Publication Worker Discretion Advised: Co-designing Risk Disclosure in Crowdsourced Responsible AI (RAI) Content Work Alice Qian, Ziqi Yang, Ryland Shaw, Jina Suh, Laura Dabbish, Hong Shen September 2025
Publication TableTalk: Scaffolding Spreadsheet Development with a Language Agent Jenny T. Liang, Aayush Kumar, Yasharth Bajpai, Sumit Gulwani, Vu Le, Chris Parnin, Arjun Radhakrishna, Ashish Tiwari, Emerson Murphy-Hill, Gustavo Soares ACM Transactions on Computer-Human Interaction | September 2025, Vol abs/2502.09787
Publication EdiVal-Agent: An Object-Centric Framework for Automated, Scalable, Fine-Grained Evaluation of Multi-Turn Editing Tianyu Chen, Yasi Zhang, Zhi Zhang, Peiyu Yu, Shu Wang, Zhendong Wang, Kevin Lin, Xiaofei Wang, Zhengyuan Yang, Linjie Li, Chung-Ching Lin, Jianwen Xie, Oscar Leong, Lijuan Wang, Ying Nian Wu, Mingyuan Zhou September 2025
Publication HistoryBankQA: Multilingual Temporal Question Answering on Historical Events Biswadip Mandal, Anant Khandelwal, Manish Gupta September 2025
Publication Good Vibrations? A Qualitative Study of Co-Creation, Communication, Flow, and Trust in Vibe Coding Veronica Pimenova, Sarah Fakhoury, Christian Bird, Margaret-Anne Storey, Madeline Endres September 2025
Publication MORQA: Benchmarking Evaluation Metrics for Medical Open-Ended Question Answering Wen-wai Yim, Asma Ben Abacha, Zixuan Yu, Robert Doerning, Fei Xia, Meliha Yetisgen September 2025
Publication Lost in Embeddings: Information Loss in Vision-Language Models Wenyan Li, Raphael Tang, Chengzu Li, Caiqi Zhang, Ivan Vuli'c, Anders Søgaard September 2025
Publication Graph-Enhanced Retrieval-Augmented Question Answering for E-Commerce Customer Support Piyushkumar Patel September 2025
Publication UnLoc: Leveraging Depth Uncertainties for Floorplan Localization Matthias Wüest, Francis Engelmann, Ondrej Miksik, Marc Pollefeys, Dániel Baráth ICLR 2026 | September 2025