Skip to content
View khazar08's full-sized avatar
  • University of California, Berkeley
  • Berkeley, California
  • 09:07 (UTC +04:00)
  • LinkedIn in/khazar-huseynov

Block or report khazar08

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
khazar08/README.md

Khazar Huseynov

Data Science + Economics (Honors) @ UC Berkeley, minor in Mathematics · Full Merit Scholarship

I build at the intersection of data, economics, and machine learning, focused on quantitative finance and applied ML.

Open to: Data Science · Quantitative Research · ML Internships

LinkedIn Email


Featured Projects

Pairs-trading engine with Kalman-filter dynamic hedge ratios and walk-forward OOS validation over ~14 years — 1.02 out-of-sample Sharpe, survived Deflated Sharpe Ratio correction for multiple-testing bias. Python statsmodels pytest

LoRA fine-tune of Qwen2.5-3B for a low-resource language — +8pp on Belebele (22→30%) and 4× SIB-200 macro-F1 vs. few-shot. A native-authored eval harness exposes chrF++ as a poor quality proxy (Cohen's κ = 0.000). Python PyTorch LoRA

S/T/X/R/DR meta-learners for treatment-effect (CATE) estimation on Criteo's 14M-row benchmark, with Qini/AUUC metrics from scratch. The X-learner beats a response-model baseline by 28% on top-decile targeting, exposing outcome ROC-AUC as the wrong objective for incrementality. Python EconML scikit-uplift

3-state Gaussian HMM (EM) driving VaR/CVaR estimation with 94.7% empirical coverage on SPY. Performance-critical path implemented in C++. Python C++ NumPy

Character-network and sentiment analysis across 134 episodes of Suits (VADER, spaCy, PageRank, LSTM, XGBoost) with an interactive Streamlit dashboard. Python spaCy Streamlit


Tech Stack

Languages

Python R SQL MATLAB TypeScript C++

Machine Learning / AI

PyTorch TensorFlow scikit-learn XGBoost LightGBM CatBoost Hugging Face PEFT / LoRA EconML

Data Science / Statistics

pandas NumPy SciPy statsmodels NLTK spaCy NetworkX

Visualization

Plotly Matplotlib Seaborn Streamlit

MLOps / Tooling

Jupyter Docker MLflow Weights & Biases pytest Git LaTeX

Frontend / Web

React Next.js HTML/CSS

Pinned Loading

  1. azerbaijani-LLM azerbaijani-LLM Public

    Fine-tuned a 2B multilingual LLM for Azerbaijani instruction-following (QLoRA, single GPU); built a native-authored eval set and a human + LLM-judge harness with documented failure-mode analysis.

    Python 1

  2. regime-monte-carlo regime-monte-carlo Public

    A regime-switching Monte Carlo engine for simulating equity price-path distributions and quantifying tail risk.

    Python 1

  3. statarb-engine statarb-engine Public

    An event-driven backtesting framework that runs a cointegration-based pairs-trading strategy on US equities with realistic frictions and a validation methodology designed to defeat the lookahead, s…

    Python 1

  4. criteo-uplift criteo-uplift Public

    Incrementality & CATE estimation on Criteo's 14M-row randomized benchmark, proving ROC-AUC is the wrong metric for ad targeting and identifying persuadable users via S/T/X/R/DR meta-learners and ca…

    Jupyter Notebook

  5. suits-nlp suits-nlp Public

    Reverse-engineering of power dynamics of the best TV show of all time — Suits — using web scraping, NLP, graph machine learning, and predictive modeling.

    Python 1

  6. quantr quantr Public

    Interactive R/Shiny dashboard for portfolio optimization, Monte Carlo simulation, risk metrics, and ARIMA forecasting with live Yahoo Finance data. Built this for some R practice

    R 1