work

Google
Software Engineer
2025.11 - Now

Improving Gemini training and serving performance on TPU Improving third party TPU experience by TorchTPU

Amazon, Annapurna Labs
Software Engineer
2025.3 - 2025.11

Worked at Annapurna Labs, targeting LLM training/inference acceleration on Trainium chips. Primarily worked in ML Compiler Optimization, Kernel Language Design, Kernel Optimization.

Amazon, Annapurna Labs
Software Engineer Intern
2024.5 - 2024.8

Developed the first-generation ML-based cost model and autotuning infra for Compiler and Kernels. Significantly improved the training and inference performance (14.7% for Llama3.1) on the Trainium chips. Recognized as one of the most impactful internship and earned a Certificate of Appreciation.

NFTGo
Machine Learning Engineer Intern
2023.2 - 2023.6

Built an NFT pricing service powered by machine learning. Developed regression models using historical transaction data and NFT features. Deployed the system using FastAPI, MongoDB, Redis, Apache Airflow, Docker, and Kubernetes.

TikTok
Software Engineer Intern
2022.5 - 2022.8

Developed TikTok 3D Game Engine for interactive AR/VR stickers. Implemented Motion Matching animation system in C++ for realistic avatar control. Built SDK for Skeleton Retargeting in C++ and Lua, and integrated Text-to-Animation algorithms.