Publication TSRFormer: Table Structure Recognition with Transformers Weihong Lin, Zheng Sun, Chixiang Ma, Mingze Li, Jiawei Wang, Lei Sun, Qiang Huo 2022 ACM Multimedia | October 2022
Publication Unsupervised Learning of Efficient Geometry-Aware Neural Articulated Representations Atsuhiro Noguchi, Xiao Sun, Stephen Lin, Tatsuya Harada European Conference on Computer Vision (ECCV) | October 2022
Publication Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel Coelho de Castro, Anton Schwaighofer, Stephanie Hyland, Maria Teodora Wetscherek, Tristan Naumann, Aditya Nori, Javier Alvarez-Valle, Hoifung Poon, Ozan Oktay The European Conference on Computer Vision (ECCV) | October 2022 Video Access Access Project
Publication RaVÆn: unsupervised change detection of extreme events using ML on-board satellites Vít Růžička, Anna Vaughan, Daniele De Martini, James Fulton, Valentina Salvatelli, Chris Bridges, Gonzalo Mateo-Garcia, Valentina Zantedeschi Nature Scientific Reports | October 2022
Publication Bringing Rolling Shutter Images Alive with Dual Reversed Distortion Zhihang Zhong, Mingdeng Cao, Xiao Sun, Zhirong Wu, Zhongyi Zhou, Yinqiang Zheng, Stephen Lin, Imari Sato European Conference on Computer Vision (ECCV) | October 2022
Publication LaMAR: Benchmarking Localization and Mapping for Augmented Reality Paul-Edouard Sarlin, Mihai Dusmanu, Johannes L. Schönberger, Pablo Speciale, Lukas Gruber, Viktor Larsson, Ondrej Miksik, Marc Pollefeys ECCV 2022 | October 2022
Publication DaViT: Dual Attention Vision Transformers Mingyu Ding, Bin Xiao, Noel Codella, Ping Luo, Jingdong Wang, Lu Yuan European Conference on Computer Vision (ECCV 2022) | October 2022
Publication VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning Che Wang, Xufang Luo, Keith Ross, Dongsheng Li 2022 Neural Information Processing Systems | September 2022
Publication PACT: Perception-Action Causal Transformer for Autoregressive Robotics Pre-Training Rogerio Bonatti, Sai Vemprala, Shuang Ma, Felipe Vieira Frujeri, Shuhang Chen, Ashish Kapoor IROS 2023 | September 2022 MoBT 17.1 / MoBIP 17.1
Publication Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning Yujia Xie, Luowei Zhou, Xiyang Dai, Lu Yuan, Nguyen Bach, Ce Liu, Michael Zeng 2022 Neural Information Processing Systems | September 2022