Publication
Searching the Search Space of Vision Transformer
Publication
BEVT: BERT Pretraining of Video Transformers
Microsoft Research Blog
Unlocking new dimensions in image-generation research with Manifold Matching via Metric Learning
Generative image models offer a unique value by creating new images. Such images can be sharp super-resolution versions of existing images or even realistic-looking synthetic photographs. Generative Adversarial Networks (GANs) and their variants have demonstrated…
Video
Full-Body Motion from a Single Head-Mounted Device: Generating SMPL Poses from Partial Observations
The increased availability and maturity of head-mounted and wearable devices opens up opportunities for remote communication and collaboration. However, the signal streams provided by these devices (e.g., head pose, hand pose, and gaze direction) do…