Publication SRC: A Scalable Reliable Connection for RDMA with Decoupled QPs and Connections Yiren Zhao, Ran Shu, Yongqiang Xiong APNet’25 | August 2025
Publication MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention Yucheng Li, Huiqiang Jiang, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Amir H. Abdi, Dongsheng Li, Jianfeng Gao, Yuqing Yang, Lili Qiu ICML 2025 | July 2025 Github Project
Publication LongRoPE2: Near-Lossless LLM Context Window Scaling Ning Shang , Li Lyna Zhang, Siyuan Wang, Gaokai Zhang, Gilsinia Lopez, Fan Yang, Weizhu Chen, Mao Yang ICML 2025 | July 2025
Publication Offline Learning for Combinatorial Multi-armed Bandits Xutong Liu, Xiangxiang Dai, Jinhang Zuo, Siwei Wang, Carlee Joe-Wong, John C.S. Lui, Wei Chen Proceedings of the 42nd International Conference on Machine Learning (ICML) | July 2025
Publication PipeThreader: Software-Defined Pipelining for Efficient DNN Execution Yu Cheng, Lei Wang, Yining Shi, Yuqing Xia, Lingxiao Ma, Jilong Xue, Yang Wang, Zhiwen Mo, Feiyang Chen, Fan Yang, Mao Yang, Zhi Yang OSDI | July 2025
Publication Pretraining Context Compressor for Large Language Models with Embedding-Based Memory Yuhong Dai, Jianxun Lian, Yitian Huang, Wei Zhang, Mingyang Zhou, Mingqi Wu, Xing Xie, Hao Liao ACL 2025 | July 2025
Publication SocialCC: Interactive Evaluation for Cultural Competence in Language Agents Jincenzi Wu, Jianxun Lian, DingDong Wang, Helen Meng ACL 2025 | July 2025
Publication Reduction Fusion for Optimized Distributed Data-Parallel Computations via Inverse Recomputation Haoxiang Lin, Yang Wang, Yanjie Gao, Hongyu Zhang, Ming Wu, Mao Yang FSE 2025 | June 2025 The ACM International Conference on the Foundations of Software Engineering, Ideas, Visions and Reflections Track Project
Publication dl²: Detecting Communication Deadlocks in Deep Learning Jobs Yanjie Gao, Jiyu Luo, Haoxiang Lin, Hongyu Zhang, Ming Wu, Mao Yang FSE 2025 | June 2025 The ACM International Conference on the Foundations of Software Engineering, Industry Track Project
Publication An Empirical Study of Issues in Large Language Model Training Systems Yanjie Gao, Ruiming Lu, Haoxiang Lin, Yueguo Chen FSE 2025 | June 2025 The ACM International Conference on the Foundations of Software Engineering, Industry Track Project