Search Results for author: Harry Yang

Found 10 papers, 5 papers with code

AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks

no code implementations • 21 Mar 2024 • Max Ku, Cong Wei, Weiming Ren, Harry Yang, Wenhu Chen

In the second stage, AnyV2V can plug in any existing image-to-video models to perform DDIM inversion and intermediate feature injection to maintain the appearance and motion consistency with the source video.

Image to Video Generation Style Transfer +1

Paper
Add Code

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation

1 code implementation • 6 Feb 2024 • Weiming Ren, Harry Yang, Ge Zhang, Cong Wei, Xinrun Du, Stephen Huang, Wenhu Chen

To verify the effectiveness of our method, we propose I2V-Bench, a comprehensive evaluation benchmark for I2V generation.

Image to Video Generation

165

Paper
Code

Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation

no code implementations • 17 Apr 2023 • Jie An, Songyang Zhang, Harry Yang, Sonal Gupta, Jia-Bin Huang, Jiebo Luo, Xi Yin

In contrast, we propose a parameter-free temporal shift module that can leverage the spatial U-Net as is for video generation.

Super-Resolution Text-to-Image Generation +2

Paper
Add Code

Make-A-Video: Text-to-Video Generation without Text-Video Data

2 code implementations • 29 Sep 2022 • Uriel Singer, Adam Polyak, Thomas Hayes, Xi Yin, Jie An, Songyang Zhang, Qiyuan Hu, Harry Yang, Oron Ashual, Oran Gafni, Devi Parikh, Sonal Gupta, Yaniv Taigman

We propose Make-A-Video -- an approach for directly translating the tremendous recent progress in Text-to-Image (T2I) generation to Text-to-Video (T2V).

Ranked #3 on Text-to-Video Generation on MSR-VTT (CLIP-FID metric)

Decoder Image Generation +3

1,861

Paper
Code

RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness

2 code implementations • 29 Jun 2022 • Francesco Pinto, Harry Yang, Ser-Nam Lim, Philip H. S. Torr, Puneet K. Dokania

We show that the effectiveness of the well celebrated Mixup [Zhang et al., 2018] can be further improved if instead of using it as the sole learning objective, it is utilized as an additional regularizer to the standard cross-entropy loss.

Out-of-Distribution Detection

199

Paper
Code

MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration

2 code implementations • 17 Apr 2022 • Thomas Hayes, Songyang Zhang, Xi Yin, Guan Pang, Sasha Sheng, Harry Yang, Songwei Ge, Qiyuan Hu, Devi Parikh

Altogether, MUGEN can help progress research in many tasks in multimodal understanding and generation.

Navigate Retrieval +4

Paper
Code

Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer

1 code implementation • 7 Apr 2022 • Songwei Ge, Thomas Hayes, Harry Yang, Xi Yin, Guan Pang, David Jacobs, Jia-Bin Huang, Devi Parikh

Videos are created to express emotion, exchange information, and share experiences.

Ranked #16 on Video Generation on UCF-101

Video Generation

246

Paper
Code

Mix-MaxEnt: Creating High Entropy Barriers To Improve Accuracy and Uncertainty Estimates of Deterministic Neural Networks

no code implementations • 29 Sep 2021 • Francesco Pinto, Harry Yang, Ser-Nam Lim, Philip Torr, Puneet K. Dokania

We propose an extremely simple approach to regularize a single deterministic neural network to obtain improved accuracy and reliable uncertainty estimates.

Paper
Add Code

Robustness and Generalization via Generative Adversarial Training

no code implementations • ICCV 2021 • Omid Poursaeed, Tianxing Jiang, Harry Yang, Serge Belongie, SerNam Lim

Adversarial training with these examples enable the model to withstand a wide range of attacks by observing a variety of input alterations during training.

object-detection Object Detection

Paper
Add Code

Fine-grained Synthesis of Unrestricted Adversarial Examples

no code implementations • 20 Nov 2019 • Omid Poursaeed, Tianxing Jiang, Yordanos Goshu, Harry Yang, Serge Belongie, Ser-Nam Lim

We propose a novel approach for generating unrestricted adversarial examples by manipulating fine-grained aspects of image generation.

Image Generation object-detection +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.