About Me

I am a 1st year PhD student at UCB Sky Lab. My current research focus is on building systems to improve AI workloads. Previously, I graduated from University of Chicago with BS in Math and CS (both honors), mainly working with Prof. Junchen Jiang.

I previously worked for product and growth for repositories like LMCache and vLLM Production Stack. Research-wise, I was one of the very first people to optimize KV cache reuse for LLMs.

I am always open to collaborations and working with undergraduate students. Please checkout the collaborations page if you are interested!

Selected Publications

* indicates equivalent contribution

  • CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion
    Jiayi Yao, Hanchen Li, Yuhan Liu, Siddhant Ray, Yihua Cheng, Qizheng Zhang, Kuntai Du, Shan Lu, Junchen Jiang
    EuroSys 2025 (Best Paper Award!) [Paper]

  • CacheGen: Fast Context Loading for Language Model Applications
    Yuhan Liu, Hanchen Li, Kuntai Du, Jiayi Yao, Yihua Cheng, Yuyang Huang, Shan Lu, Michael Maire, Henry Hoffmann, Ari Holtzman, Ganesh Ananthanarayanan, Junchen Jiang
    SIGCOMM 2024 [Paper] [Talk] [Slides]

  • Towards More Economical Context-Augmented LLM Generation by Reusing Stored KV Cache
    Hanchen Li*, Yuhan Liu*, Yihua Cheng, Kuntai Du, Junchen Jiang
    NSDI Poster 2024 [Link]

All Publications (check Google Scholar)

Life

  • My name in Chinese is 李翰宸 and I grew up in Nanjing, Jiangsu.
  • I play soccer, basketball, and weightlift. I also enjoy 掼蛋(Guan Dan, a popular card game in JiangsuChina) with friends.