Biography

I am a second-year Ph.D. candidate in Computer Science at the University of Virginia, working under the supervision of Prof. Yue Cheng in the DS² Lab. My research interests span machine-learning systems, storage systems, and distributed systems.

Prior to UVA, I received my M.S. in Computer Science from Boston University and my B.S. from Hangzhou Dianzi University. I am dedicated to building efficient and scalable systems for next-generation data-intensive applications.

My research focuses on addressing challenges in real-world storage systems, driven by the complexities of modern data-intensive computer systems. I am particularly interested in serverless AI, storage systems for AI, and serverless computing, taking an end-to-end approach that spans applications, middleware, platforms, and low-level operating systems.

Education

  • 2024 – present
    Ph.D. in Computer Science
    University of Virginia
  • 2022 – 2024
    M.S. in Computer Science
    Boston University
  • 2018 – 2022
    B.S. in Computer Science
    Hangzhou Dianzi University

News

Publications

  • MorphServe: Efficient and Workload-Aware LLM Serving via Runtime Quantized Layer Swapping and KV Cache Resizing
    Zhaoyuan Su, Zeyu Zhang, Tingfeng Lan, Zirui Wang, Haiying Shen, Juncheng Yang, Yue Cheng
    Ninth Annual Conference on Machine Learning and Systems (MLSys 2026)
  • λScale: Enabling Fast Scaling for Serverless Large Language Model Inference
    Minchen Yu, Rui Yang, Chaobo Jia, Zhaoyuan Su, Sheng Yao, Tingfeng Lan, Yuchen Yang, Zirui Wang, Yue Cheng, Wei Wang, Ao Wang, Ruichuan Chen
    Ninth Annual Conference on Machine Learning and Systems (MLSys 2026)
  • Towards Efficient LLM Storage Reduction via Tensor Deduplication and Delta Compression
    Zirui Wang, Tingfeng Lan, Zhaoyuan Su, Juncheng Yang, Yue Cheng
    23rd USENIX Symposium on Networked Systems Design and Implementation (NSDI 2026)
  • Everything You Always Wanted to Know About Storage Compressibility of Pre-Trained ML Models but Were Afraid to Ask
    Zhaoyuan Su, Ammar Ahmed, Zirui Wang, Ali Anwar, Yue Cheng
    50th International Conference on Very Large Data Bases (VLDB 2024)
  • Temporal Cue Guided Video Highlight Detection with Low-Rank Audio-Visual Fusion
    Qinghao Ye*, Xiyue Shen*, Yuan Gao*, Zirui Wang*, Qi Bi, Ping Li, Guang Yang
    International Conference on Computer Vision (ICCV 2021)

Academic Service

  • 2026
    Artifact Evaluation Committee — USENIX OSDI 2026
  • 2026
    Artifact Evaluation Committee — USENIX NSDI 2026 (Fall)
  • 2026
    Shadow Program Committee — EuroSys 2026
  • 2026
    Reviewer — ACM Transactions on Storage (TOS)

Outside Research

Playing basketball Watching F1 Watching CS2 esports Age of Empires IV — World Top 1000 Playing League of Legends