Publications

  • MSCCL++: Rethinking GPU Communication Abstractions for AI Inference
    ASPLOS 2026 (Honourable Best Paper Award mention)

    PDF BibTeX

  • Splitwise: Efficient Generative LLM Inference Using Phase Splitting
    ISCA 2024, IEEE MICRO Top Picks 2024

    PDF BibTeX

  • TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches
    NSDI 2023

    PDF BibTeX

  • Memory Optimization for Deep Networks
    ICLR 2021 (Spotlight Presentation)

    PDF Video Code BibTeX

  • RainBlock: Faster Transaction Processing in Public Blockchains
    Usenix ATC 2021

    PDF Video Code BibTeX

  • Analyzing the Impact of GDPR on Storage Systems
    HotStorage 2019

    PDF Slides BibTeX

Latest Updates

Mar 2026MSCCL++ received honourable mention for Best Paper Award at ASPLOS.
Jan 2026MSCCL++ accepted at ASPLOS'26!
Oct 2025Gave a guest lecture at UW on optimizing collective communication libraries.
Mar 2025Splitwise chosen in IEEE Micro Top Picks 2025.
Apr 2024Gave a guest lecture at Cornell on optimizing collective communication in large-scale ML systems.
Mar 2024Splitwise accepted at ISCA'24!
Aug 2023Started as a Senior Researcher at Microsoft Research, Redmond!