About Me
I am a 1st year PhD student at UCB Sky Lab. My current research focus is on building systems to improve AI workloads. Previously, I graduated from University of Chicago with BS in Math and CS (both honors), mainly working with Prof. Junchen Jiang.
I previously worked for product and growth for repositories like LMCache and vLLM Production Stack. Research-wise, I was one of the very first people to optimize KV cache reuse for LLMs.
I am always open to collaborations and working with undergraduate students. Please checkout the collaborations page if you are interested!
Selected Publications
* indicates equivalent contribution
CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion
Jiayi Yao, Hanchen Li, Yuhan Liu, Siddhant Ray, Yihua Cheng, Qizheng Zhang, Kuntai Du, Shan Lu, Junchen Jiang
EuroSys 2025 (Best Paper Award!) [Paper]CacheGen: Fast Context Loading for Language Model Applications
Yuhan Liu, Hanchen Li, Kuntai Du, Jiayi Yao, Yihua Cheng, Yuyang Huang, Shan Lu, Michael Maire, Henry Hoffmann, Ari Holtzman, Ganesh Ananthanarayanan, Junchen Jiang
SIGCOMM 2024 [Paper] [Talk] [Slides]Towards More Economical Context-Augmented LLM Generation by Reusing Stored KV Cache
Hanchen Li*, Yuhan Liu*, Yihua Cheng, Kuntai Du, Junchen Jiang
NSDI Poster 2024 [Link]
All Publications (check Google Scholar)
Life
- My name in Chinese is 李翰宸 and I grew up in Nanjing, Jiangsu.
- I play soccer, basketball, and weightlift. I also enjoy 掼蛋(Guan Dan, a popular card game in
JiangsuChina) with friends.