Data Scientist & AI Engineer

Hang Chen

Building intelligent systems at the intersection of LLM agents, machine learning, quantitative research, and market data infrastructure.

About

Turning data into intelligent solutions

I'm a Data Scientist and AI Engineer passionate about building systems that learn and adapt. My work spans the full spectrum from research to production, with a focus on creating practical AI solutions that deliver measurable impact.

With expertise in LLM agents, machine learning, and quantitative research, I specialize in developing intelligent systems that can reason, analyze, and act on complex data. My background in market data infrastructure ensures these solutions are built for scale and reliability.

I believe in the power of combining rigorous statistical methods with modern AI techniques to solve challenging problems in finance, technology, and beyond.

LLM Agents

Designing and deploying autonomous AI agents using large language models for complex reasoning and task automation.

Machine Learning

Building and optimizing ML pipelines, from feature engineering to model deployment at scale.

Quantitative Research

Developing statistical models and trading strategies with rigorous backtesting frameworks.

Market Data Infrastructure

Architecting high-performance data pipelines for high-frequency tick data, reference data, and corporate actions across global equities and futures markets.

Experience

Where I've worked

2023 — Present

Senior Data Scientist

Quantitative Research Firm

Lead research on various data (high-frequency data, corp actions, reference data, supply chain, short interests, stock options, etc.), supporting ML-driven alpha discovery. Architected and deployed agentic LLM workflows to automate data preprocessing.

PythonLLMSQLAWSDagsterDuckDBC++
2022 — 2023

Senior Data Scientist

Block

Developed ML models for payment risk detection. Built pipelines (training ML, fine-tuning LLM) to evaluate value of 3rd party data such as D&B data and Google/Yelp reviews data.

PythonMachine LearningMLFlowAWSSparkSQLPrefect
2019 — 2022

Collaborated with traders to develop ML driven pre-trade strategy and post trade analytics. Alt data research (weather, fundamental, esg, etc.). Infrastructure development such as various investment data pipellines and backtesting system.

PythonRSQLBloomberg APIMATLAB
2017 — 2019

Manager, Decision Sciences

Scotiabank

Leveraged big data analytics/models to create value/insights for customer strategy, product/marketing and risk management.

PythonSQLGCPMILP Optimization
2015 — 2017

Data Scientist, Business Advisory

Moneris Solutions

Provided analytics consulting service for sales improvement, marketing retention/growth, and risk management.

PythonSQLTableauSAS
Projects

Featured work

LLM Agents

Multiple LLM Agents for market data quality and market data research.

PythonLangChainLLMFastAPI

Market Data Pipeline

High-performance market data processing system handling millions of events per second. Delivered high-frequency data and reference/corp-actions data with extreme high quality.

PythonC++SQL

Corporate Actions Demo

A demo for US Corporate Actions.

PythonLLMLangfuseSQL

Quantitative Backtesting

Quantitative backtesting of signals.

Python

Market Microstructure

Market Microstructure Research and Insights.

Python
Resume

Background & credentials

Education

Master of Applied Science

University of Toronto

2014Engineering

Bachelor of Science

Huazhong Univ. of Sci. & Tech.

2012Engineering

Certifications

CFA Charter Holder

Deep Learning Specialization (Coursera)

Publications

Contact

Get in touch

I'm always interested in discussing new opportunities, collaborations, or just having a conversation about AI and data science.