Hi, I'm Xiyu Liang.
I obtained my bachelor's degree from University of Electronic Science and Technology of China (UESTC)
. Recently, I am a research intern at Georgia Tech
, under the guidance of Prof. Yingyan (Celine) Lin . Previously, I was a research assistant in the Spatio-Temporal Big Data and Intelligence Lab, where I worked on spatio-temporal data mining under the supervision of Prof. Shuo Shang and Prof. Lisi Chen. I am also a member of ColAI (Now DreamSoul-AI
, an AI startup), where I work closely with Dr. Enmao Diao.
My research interests primarily lie in machine learning and natural language processing. I am particularly passionate about building large-scale machine learning models and systems that achieve lower cost, higher speed, and better performance. Additionally, I am also exploring trustworthy AI, to make AI systems more reliable and accessible for humans.
") does not match the recommended repository name for your site ("").
", so that your site can be accessed directly at "http://".
However, if the current repository name is intended, you can ignore this message by removing "{% include widgets/debug_repo_name.html %}" in index.html.
",
which does not match the baseurl ("") configured in _config.yml.
baseurl in _config.yml to "".

Yuzhe Gu, Xiyu Liang, Jiaojiao Zhao, Enmao Diao
Under review. 2026
Optimal Brain Cache (OBCache), a principled framework that formulates KV cache eviction as a layer-wise structured pruning problem. OBCache quantifies token saliency by measuring the perturbation in attention outputs induced by pruning tokens, with closed-form scores derived for isolated keys, isolated values, and joint key-value pairs.
Yuzhe Gu, Xiyu Liang, Jiaojiao Zhao, Enmao Diao
Under review. 2026
Optimal Brain Cache (OBCache), a principled framework that formulates KV cache eviction as a layer-wise structured pruning problem. OBCache quantifies token saliency by measuring the perturbation in attention outputs induced by pruning tokens, with closed-form scores derived for isolated keys, isolated values, and joint key-value pairs.