Publications

* denotes equal contribution

2026

SkyWalker: A Locality-Aware Cross-Region Load Balancer for LLM Inference Tian Xia, Ziming Mao, Jamison Kerney, Ethan J. Jackson, Zhifei Li, Jiarong Xing, Scott Shenker, Ion Stoica EuroSys 2026 (Preprint)

SkyNomad: Cost-Effective Multi-Region Scheduling for Deadline-Sensitive Workloads on Spot Instances Zhifei Li*, Tian Xia*, and others, Ion Stoica OSDI 2026 (In submission) [Paper]

2025

LEANN: A Low-Storage Vector Index for Personal Devices Yichuan Wang, Zhifei Li, Shu Liu, Yongji Wu, Ziming Mao, Yilong Zhao, Xiao Yan, Zhiying Xu, Yang Zhou, Ion Stoica, Sewon Min, Matei Zaharia, Joseph E. Gonzalez MLSys 2026 (To appear) [Paper]

2024

Barbarians at the Gate: How AI is Upending Systems Research Audrey Cheng*, Shu Liu*, Melissa Pan*, Zhifei Li, Bowen Wang, Alex Krentsel, Tian Xia, Mert Cemri, Jongseok Park, Shuo Yang, Jeff Chen, Aditya Desai, Jiarong Xing, Koushik Sen, Matei Zaharia, Ion Stoica arXiv 2024

FrontierCS: The Next Frontier of Computer Science Qiuyang Mang*, Wenhao Cai*, Zhifei Li*, Huanzhi Mao*, and others, Ion Stoica, Jingbo Shang, Zhuang Liu, Alvin Cheung ICML 2026 (In submission) [Paper]