Multi-agent orchestration
Designing agent pipelines that coordinate inbox parsing, evidence extraction, state updates, and follow-up planning while keeping important mutations behind review gates.
AI Engineering
I build AI systems as software systems: agents with tools, retrieval, memory, evaluation, observability, and controlled state changes. The goal is not a simple chatbot demo; it is turning messy context into reliable workflows that can be inspected, measured, and improved.
Designing agent pipelines that coordinate inbox parsing, evidence extraction, state updates, and follow-up planning while keeping important mutations behind review gates.
Building systems around citations, durable memory, failure cases, retrieval quality, and eval contracts instead of treating a model call as the product.
Optimizing latency, cost, tool-call budgets, prompt/data contracts, model-serving constraints, and observability so AI features can survive real workflows.
A Gemma-powered recruiting mailbox pipeline with Gmail evidence, career memory, multi-agent extraction, traceable review gates, and controlled application-state updates.
A local-first public proof package with sanitized recruiter threads, optional Gemma/Ollama analysis, deterministic fallback, model traces, and pipeline eval results.
A compact AlphaZero-style research baseline with Xiangqi rules, PyTorch policy-value modeling, legal policy masking, MCTS, self-play, training, and evaluation scripts.
An AI-assisted travel planning app that turns trip context into structured itinerary recommendations and product-ready planning flows.
For broader engineering work, review the full project list or the cloud and data systems page. For opportunities, contact Hanbin directly.