Search

Haoran Qiu | Microsoft AzRS

Home
Publications
Awards
Services
Experiences
Contact

Rodrigo Fonseca

Latest

StreamWise: Serving Multi-Modal Generation in Real-Time at Scale
Sherlock: Reliable and Efficient Agentic Workflow Execution
Murakkab: Resource-Efficient Agentic Workflow Orchestration in Cloud Platforms
ModServe: Modality- and Stage-Aware Resource Disaggregation for Scalable Multimodal Model Serving
Towards Efficient Large Multimodal Model Serving
TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms

© 2026 Haoran Qiu · Powered by the Academic theme for Hugo.

Cite