Recent News

Publications

ATC'24
Power-aware Deep Learning Model Serving with µ-Serve.

In Proceedings of the 2024 USENIX Annual Technical Conference (ATC 2024).
  Artifact Available, Functional, Reproduced

PDF Preprint Code Slides Video

DSN'24
When Green Computing Meets Performance and Resilience SLOs.

In Proceedings of the 54th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2024 Distrupt Track).
  Selected at the NSF Workshop on Sustainable Computing for Sustainability 2024

PDF DOI

AIOps'24 @ASPLOS'24
QLM: Queue Management for Large Language Model Serving.

In Proceedings of the 5th International Workshop on Cloud Intelligence / AIOps (AIOps 2024).

PDF Preprint

MLSys Workshop @NeurIPS'23
On the Promise and Challenges of Foundation Models for Learning-based Cloud Systems Management.

In Proceedings of the 7th Workshop on ML for Systems at NeurIPS 2023 (MLSys Workshop 2023).
  Selected for Spotlight Presentation

PDF Slides

MLSys Workshop @NeurIPS'23
PARM: Adaptive Resource Allocation for Datacenter Power Capping.

In Proceedings of the 7th Workshop on ML for Systems at NeurIPS 2023 (MLSys Workshop 2023).

PDF

ATC'23
AWARE: Automate Workload Autoscaling with Reinforcement Learning in Production Cloud Systems.

In Proceedings of the 2023 USENIX Annual Technical Conference (ATC 2023).
  Artifact Available, Functional, Reproduced
  Selected for Presentation at KubeCon + CloudNativeCon North America 2023

PDF Code Slides Video

NeurIPS'23
Multi-Agent Meta-Reinforcement Learning: Sharper Convergence Rates with Task Similarity.

In Proceedings of the 37th Annual Conference on Neural Information Processing Systems (NeurIPS 2023).

PDF

NeurIPS'22
A Mean-Field Game Approach to Cloud Resource Management with Function Approximation.

In Proceedings of the 36th Annual Conference on Neural Information Processing Systems (NeurIPS 2022).

PDF Slides Video

CompSys Workshop @IPDPS'22
Evaluating Hardware Memory Disaggregation under Delay and Contention.

In Proceedings of the 1st Workshop on Composable Systems Co-located with IPDPS 2022 (COMPSYS 2022).
  Best Presentation Award

PDF Video DOI

EuroMLSys Workshop @EuroSys'22
Reinforcement Learning for Resource Management in Multi-tenant Serverless Platforms.

In Proceedings of the 2nd European Workshop on Machine Learning and Systems Co-located with EuroSys 2022 (EuroMLSys 2022).

PDF Slides Video DOI

WoSC @Middleware'21
Is Function-as-a-Service a Good Fit for Latency-Critical Services?.

In Proceedings of the 7th International Workshop on Serverless Computing Co-located with ACM/IFIP Middleware 2021 (WoSC7).

PDF Code Slides Video DOI

OSDI'20
FIRM: An Intelligent Fine-Grained Resource Management Frameworkfor SLO-Oriented Microservices.

In Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2020).
  Artifact Available, Functional, Reproduced
  Selected as ACM ICPE 2024 Data Challenge

PDF Code Dataset Slides Video DOI

DSN'18
OWL: Understanding and Detecting Concurrency Attacks.

In Proceedings of the 48th IEEE International Conference on Dependable Systems and Networks (DSN 2018).
  CVE-2017-12193, CVE-2017-7533

PDF Code Slides DOI

NSDI'18
PLOVER: Fast, Multi-core Scalable Virtual Machine Fault-tolerance.

In Proceedings of the 15th USENIX Symposium on Networked Systems Design and Implementation (NSDI 2018).

PDF Code Slides Video DOI

Honors & Awards

  • Best Paper Award Finalist, L4DC 2024
  • ML and Systems Rising Stars, MLCommons, 2023
  • Mavis Future Faculty Fellowship, UIUC, 2023–24
  • UIUC CS PhD Fellowship, UIUC, 2023–24
  • Best Presentation Award, Workshop on Composable Systems (IPDPS), 2022
  • Yunni & Maxine Pao Memorial Fellowship, UIUC, 2021
  • Conference Presentation Award, UIUC, 2020
  • Travel Grants: MLSys 2024, USENIX OSDI/ATC 2023, DSN 2022, ACM SIGMETRICS 2021
  • Best Undergraduate Thesis 2nd Runner-up, HKU, 2019
  • Dean’s Honour List, HKU, 2016–19
  • International Student Academic Excellence Award, University of Wisconsin-Madison, 2018
  • Lee Shau Kee Scholarships for Student Enrichment, HKU, 2017
  • Honorable Mention in Mathematical Contest In Modeling, COMAP, 2017
  • HKU Foundation Scholarships For Outstanding Students, 2015–19

Services

Review Services:

  • Program Committee, ATC 2025
  • Program Committee, ICDCS 2025
  • Organizing Committee, ML for Systems Workshop, NeurIPS 2024
  • Program Committee, Workshop on Cloud Intelligence / AIOps, ASPLOS 2024
  • Program Committee, ML for Systems Workshop, NeurIPS 2023
  • Program Committee, DSN 2023 Doctoral Forum
  • Reviewer, IEEE Transactions on Parallel and Distributed Systems (TPDS), 2024
  • Reviewer, ACM Transactions on Architecture and Code Optimization (TACO), 2024
  • Reviewer, Journal of Systems Research (JSys), 2024
  • Reviewer, ACM Trasactions on Software Engineering Methodology, 2024
  • Reviewer, IEEE Transactions on Automation Science and Engineering, 2024
  • Reviewer, Sustainable Computing: Informatics and Systems, 2024
  • Reviewer, ACM Transactions on Architecture and Code Optimization (TACO), 2023
  • Reviewer, IEEE Internet of Things Journal, 2022
  • External Program Committee: EuroSys 2025, ATC 2024, EuroSys 2024, EuroSys 2022
  • Artifact Evaluation Committee: SOSP 2023, EuroSys 2023, MLSys 2023, OSDI/ATC 2022
  • Session Chair, Machine Learning Session, UIUC CSL Student Conference 2021

Community Services:

  • Mentor, Promoting Undergraduate Research in Engineering (PURE), UIUC, 2023
  • Mentor, Illinois Science and Technology Coalition (ISTC), 2023–24
  • Member, Institute for Inclusion, Diversity, Equity & Access (IDEA), UIUC, 2023
  • Mentor, Undergraduate Research Experience (URE) at IIDAI, 2022–23

Teaching

  • Fall 2023, UIUC CS 598 ML and Data Systems, Teaching Assistant
  • Fall 2023, UIUC ECE 598 Dependable AI Systems, Course Assistant
  • Fall 2022, UIUC ECE 471 Data Science Analytics, Course Assistant
  • Fall 2021, UIUC CS 536 Design of Fault-Tolerant Digital Systems, Course Assistant
  • Fall 2017, HKU COMP 2396 Object-oriented Programming, Teaching Assistant

Experience

 
 
 
 
 

Systems Researcher, Microsoft Azure Research – Systems

  Microsoft Azure Research

Jul 2024 – Present Redmond, WA
 
 
 
 
 

PhD Intern, SystemsResearch@Google (SRG)

  Google LLC

May 2023 – Aug 2023 Mountain View, CA
 
 
 
 
 

Visiting Researcher, Hybrid Cloud and AI

  IBM Research

Sep 2022 – Dec 2022 Yorktown Heights, NY
 
 
 
 
 

PhD Intern, Google Cloud Infrastructure

  Google LLC

May 2022 – Aug 2022 Sunnyvale, CA
 
 
 
 
 

Research Intern, Systems Research Group

  Microsoft Research

May 2021 – Aug 2021 Redmond, WA (Remote)
 
 
 
 
 

PhD Intern, Google Cloud Infrastructure

  Google LLC

May 2020 – Aug 2020 Sunnyvale, CA (Remote)