David Kwan

← Back to david-kwan.com

A collection of key projects from 2025, showcasing my work beyond traditional Python and R machine learning and data science. This portfolio demonstrates interdisciplinary expertise spanning cognitive psychology, evolutionary computation, and specialized NLP applications, representing both novel developments and significant enhancements to prior research.

Featured Projects

Sentiment Analysis System

2023 2025

My HKU MSc Capstone project addressing "hidden dissatisfaction" in e-commerce through advanced sentiment analysis. Analyzed extensive Amazon reviews using multiple sophisticated models including Advanced Ensemble, RoBERTa Transformer, and Feature-Based approaches with extensively engineered features. Integrated Transformer-based ABSA processing comprehensive aspect-sentiments, uncovering subtle sentiment patterns that traditional analysis misses and enabling proactive business intelligence.

View Project →

Purchase Decision Modeling

2025

Implemented Drift Diffusion Models (DDM) to decode consumer choice behavior through evidence accumulation modeling. Analyzed multiple distinct decision-making scenarios across extensive simulated trials, revealing how bias amplification, speed-accuracy tradeoffs, and pressure sensitivity influence purchasing decisions. Achieved comprehensive insights into consumer archetypes from impulsive buyers to cautious deliberators, enabling targeted marketing strategies and optimized choice architecture.

View Project →

Multi-Objective Optimization

2025

Developed a comprehensive framework comparing state-of-the-art Multi-Objective Evolutionary Algorithms (NSGA-II, NSGA-III, MOEA/D) on benchmark problems ZDT1 and DTLZ1. MOEA/D achieved superior performance with extensive Pareto-optimal solutions in tri-objective optimization, demonstrating advanced capabilities in handling conflicting objectives. Implemented hypervolume analysis and statistical performance metrics, providing insights into algorithm selection for complex optimization challenges.

View Project →

Buddhist-Specific Neural Machine Translation

2025

Developed a specialized neural machine translation system using an extensive Buddhist-specific parallel corpus, achieving significant improvements in domain-specific translation quality. Implemented custom tokenization for CJK characters and Buddhist terminology optimization. Demonstrated substantial specificity improvement over general-purpose models, with particular strength in translating technical terms like "prajñāpāramitā" and "five aggregates" that generic translators typically misinterpret.

View Project →

My Technical Expertise & Technology Stack

Technologies and frameworks I actively work with, including but not limited to:

Programming Languages

  • Python
  • R
  • JavaScript
  • HTML/CSS
  • SQL

Machine Learning & Deep Learning

  • Scikit-learn
  • PyTorch
  • TensorFlow/Keras
  • XGBoost
  • LightGBM
  • CatBoost
  • Hugging Face
  • PyCaret
  • Imbalanced-learn
  • Feature-engine
  • Statsmodels

Natural Language Processing & Computer Vision

  • spaCy
  • NLTK
  • OpenCV
  • Transformers
  • LangChain

Model Optimization & AutoML

  • Optuna
  • Hyperopt
  • TPOT
  • H2O.ai
  • auto-sklearn
  • PyCaret
  • Ray Tune
  • SHAP
  • LIME

Data Management & Processing

  • Pandas
  • Polars
  • NumPy
  • PySpark
  • PostgreSQL
  • MySQL
  • MongoDB

Web Development & APIs

  • React
  • Bootstrap
  • Flask
  • Django
  • FastAPI
  • Node.js (Express)
  • REST APIs
  • Gradio

Time Series & Forecasting

  • Prophet
  • ARIMA
  • Statsforecast
  • sktime

Web Scraping & Data Collection

  • BeautifulSoup
  • Scrapy
  • Selenium
  • Requests
  • API Integration

Data Visualization & Business Intelligence

  • Tableau
  • Power BI
  • Matplotlib
  • Seaborn
  • Plotly
  • ggplot2
  • Streamlit
  • Dash
  • Altair
  • Bokeh
  • D3.js
  • Highcharts