Search Results for author: Lingmin Ran

Found 3 papers, 2 papers with code

X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model

no code implementations • 4 Dec 2023 • Lingmin Ran, Xiaodong Cun, Jia-Wei Liu, Rui Zhao, Song Zijie, Xintao Wang, Jussi Keppo, Mike Zheng Shou

To enhance the guidance ability of X-Adapter, we employ a null-text training strategy for the upgraded model.

Denoising

Paper
Add Code

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

1 code implementation • 27 Sep 2023 • David Junhao Zhang, Jay Zhangjie Wu, Jia-Wei Liu, Rui Zhao, Lingmin Ran, YuChao Gu, Difei Gao, Mike Zheng Shou

In this paper, we are the first to propose a hybrid model, dubbed as Show-1, which marries pixel-based and latent-based VDMs for text-to-video generation.

Ranked #2 on Text-to-Video Generation on EvalCrafter Text-to-Video (ECTV) Dataset (using extra training data)

Text-to-Video Generation Video Alignment +1

1,071

Paper
Code

AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant

2 code implementations • 30 Nov 2021 • Stan Weixian Lei, Difei Gao, Yuxuan Wang, Dongxing Mao, Zihan Liang, Lingmin Ran, Mike Zheng Shou

In contrast, we present a new task called Task-oriented Question-driven Video Segment Retrieval (TQVSR).

Question Answering Retrieval +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.