Publication Toward Consistent World Models with Multi-Token Prediction and Latent Semantic Enhancement Qimin Zhong, Hao Liao, Haiming Qin, Mingyang Zhou, Rui Mao, Wei Chen, Naipeng Chao The 64th Annual Meeting of the Association for Computational Linguistics (ACL), Main Conference | July 2026
Publication PaT: Planning-after-Trial for Efficient Test-Time Code Generation Youngsik Yoon, Sungjae Lee, Seockbean Song, Siwei Wang, Wei Chen, Jungseul Ok The 64th Annual Meeting of the Association for Computational Linguistics (ACL), Main Conference | July 2026
Publication GraCE: Unlocking CUDA Graphs with Compiler Support for ML Workloads Abhishek Ghosh, Ajay Nayak, Ashish Panwar, Arkaprava Basu 2026 Operating Systems Design and Implementation | July 2026 Project
Publication An Eye Tracking Study: Are AI Overviews Changing Search Behavior? Sara Allawati, Dana McKay, Mark Sanderson, Paul Thomas, Johanne R Trippas 2026 International ACM SIGIR Conference on Research and Development in Information Retrieval | July 2026
Publication EgoMemory: Memory-Augmented Personalized Retrieval for Long-Context Egocentric Video Yuanmin Tang, Jue Zhang, Xiaoting Qin, Jing Yu, Meikang Qiu, Gaopeng Gou, Gang Xiong, Qingwei Lin 林庆维, Saravan Rajmohan, Dongmei Zhang, Qi Wu ACL Findings | July 2026
Publication ExVerus: Verus Proof Repair via Counterexample Reasoning Jun Yang, Yuechun Sun, Yi Wu, Rodrigo Caridad, Yongwei Yuan, Jianan Yao, Shan Lu, Kexin Pei International Conference on Machine Learning | July 2026 Project
Publication Differentially Private Synthetic Data via APIs 4: Tabular Data Toan Tran, Arturs Backurs, Zinan Lin, Victor Reis, Li Xiong, Sergey Yekhanin ICML 2026 | July 2026 Project
Publication Rearchitecting the Datacenter Lifecycle for AI Jovan Stojkovic, Chaojie Zhang, Íñigo Goiri, Ricardo Bianchini ISCA | June 2026 Project
Publication Serving Models, Fast and Slow:Optimizing Heterogeneous LLM Inferencing Workloads at Scale Kunal Jain, A. Parayil, Ankur Mallick, Rujia Wang, Renee St. Amant, Chetan Bansal, Victor Ruehle, Saravan Rajmohan, Shashwat Jaiswal, Yogesh Simmhan, Anoop Kulkarni, Steve Kofsky ACM Sigmetrics 2026 | June 2026 Project
Publication PhaseWeave: Phase-Aware Execution on Heterogeneous Chiplet Architectures for Datacenters Joshua Kim, Chaojie Zhang, Íñigo Goiri, Chris Rossbach, Jovan Stojkovic ISCA | June 2026