Publication
Kosmos-2.5: A Multimodal Literate Model
Project
DragNUWA
DragNUWA is a video generation model that utilizes text, images, and trajectory as three essential control factors to facilitate highly controllable video generation. DragNUWA is a video generation model that utilizes text, images, and trajectory…
Publication