Search Results for author: Wenming Yang

Found 51 papers, 32 papers with code

BDC-Occ: Binarized Deep Convolution Unit For Binarized Occupancy Network

1 code implementation • 27 May 2024 • Zongkai Zhang, Zidong Xu, Wenming Yang, Qingmin Liao, Jing-Hao Xue

To bridge these gaps, we propose a novel binarized deep convolution (BDC) unit that effectively enhances performance while increasing the number of binarized convolutional layers.

Binarization

Paper
Code

Bilateral Event Mining and Complementary for Event Stream Super-Resolution

1 code implementation • 16 May 2024 • Zhilin Huang, Quanmin Liang, Yijie Yu, Chujun Qin, Xiawu Zheng, Kai Huang, Zikun Zhou, Wenming Yang

In this paper, we propose a bilateral event mining and complementary network (BMCNet) to fully leverage the potential of each event and capture the shared information to complement each other simultaneously.

Object Recognition Super-Resolution +1

Paper
Code

Motion-aware Latent Diffusion Models for Video Frame Interpolation

no code implementations • 21 Apr 2024 • Zhilin Huang, Yijie Yu, Ling Yang, Chujun Qin, Bing Zheng, Xiawu Zheng, Zikun Zhou, YaoWei Wang, Wenming Yang

With the advancement of AIGC, video frame interpolation (VFI) has become a crucial component in existing video generation frameworks, attracting widespread research interest.

Motion Estimation Video Frame Interpolation +1

Paper
Add Code

OccGaussian: 3D Gaussian Splatting for Occluded Human Rendering

no code implementations • 12 Apr 2024 • Jingrui Ye, Zongkai Zhang, Yujiao Jiang, Qingmin Liao, Wenming Yang, Zongqing Lu

OccGaussian initializes 3D Gaussian distributions in the canonical space, and we perform occlusion feature query at occluded regions, the aggregated pixel-align feature is extracted to compensate for the missing information.

Paper
Add Code

Efficient Heatmap-Guided 6-Dof Grasp Detection in Cluttered Scenes

1 code implementation • IEEE ROBOTICS AND AUTOMATION LETTERS 2023 • Siang Chen, Wei Tang, Pengwei Xie, Wenming Yang, Guijin Wang

Specifically, Gaussian encoding and the grid-based strategy are applied to predict grasp heatmaps as guidance to aggregate local points into graspable regions and provide global semantic information.

Ranked #4 on Robotic Grasping on GraspNet-1Billion

Grasp Generation

Paper
Code

Residual Dense Swin Transformer for Continuous Depth-Independent Ultrasound Imaging

1 code implementation • 25 Mar 2024 • Jintong Hu, Hui Che, Zishuo Li, Wenming Yang

Ultrasound imaging is crucial for evaluating organ morphology and function, yet depth adjustment can degrade image quality and field-of-view, presenting a depth-dependent dilemma.

Decoder Image Enhancement +1

Paper
Code

Low-Trace Adaptation of Zero-shot Self-supervised Blind Image Denoising

no code implementations • 19 Mar 2024 • Jintong Hu, Bin Xia, Bingchen Li, Wenming Yang

Deep learning-based denoiser has been the focus of recent development on image denoising.

Image Denoising Self-Supervised Learning

Paper
Add Code

VmambaIR: Visual State Space Model for Image Restoration

1 code implementation • 18 Mar 2024 • Yuan Shi, Bin Xia, Xiaoyu Jin, Xing Wang, Tianyu Zhao, Xin Xia, Xuefeng Xiao, Wenming Yang

To address these challenges, we propose VmambaIR, which introduces State Space Models (SSMs) with linear complexity into comprehensive image restoration tasks.

Denoising Image Restoration +2

139

Paper
Code

VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction

no code implementations • 27 Feb 2024 • Jiaqi Lin, Zhihao LI, Xiao Tang, Jianzhuang Liu, Shiyong Liu, Jiayue Liu, Yangdi Lu, Xiaofei Wu, Songcen Xu, Youliang Yan, Wenming Yang

Existing NeRF-based methods for large scene reconstruction often have limitations in visual quality and rendering speed.

Paper
Add Code

DiffVein: A Unified Diffusion Network for Finger Vein Segmentation and Authentication

no code implementations • 3 Feb 2024 • Yanjun Liu, Wenming Yang, Qingmin Liao

To fill this gap, we introduce DiffVein, a unified diffusion model-based framework which simultaneously addresses vein segmentation and authentication tasks.

Denoising Segmentation +1

Paper
Add Code

LLMRA: Multi-modal Large Language Model based Restoration Assistant

no code implementations • 21 Jan 2024 • Xiaoyu Jin, Yuan Shi, Bin Xia, Wenming Yang

By employing a pretrained multi-modal large language model and a vision language model, we generate text descriptions and encode them as context embedding with degradation information for the degraded image.

Image Restoration Language Modelling +1

Paper
Add Code

Binding-Adaptive Diffusion Models for Structure-Based Drug Design

1 code implementation • 15 Jan 2024 • Zhilin Huang, Ling Yang, Zaixi Zhang, Xiangxin Zhou, Yu Bao, Xiawu Zheng, Yuwei Yang, Yu Wang, Wenming Yang

Then the selected protein-ligand subcomplex is processed with SE(3)-equivariant neural networks, and transmitted back to each atom of the complex for augmenting the target-aware 3D molecule diffusion generation with binding interaction information.

Avg

Paper
Code

Diffusion-based Pose Refinement and Muti-hypothesis Generation for 3D Human Pose Estimaiton

1 code implementation • 10 Jan 2024 • Hongbo Kang, Yong Wang, Mengyuan Liu, Doudou Wu, Peng Liu, Xinlin Yuan, Wenming Yang

To address these two challenges, we propose a diffusion-based refinement framework called DRPose, which refines the output of deterministic models by reverse diffusion and achieves more suitable multi-hypothesis prediction for the current pose benchmark by multi-step refinement with multiple noises.

3D Human Pose Estimation Denoising

Paper
Code

RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation

1 code implementation • 12 Dec 2023 • Peng Lu, Tao Jiang, Yining Li, Xiangtai Li, Kai Chen, Wenming Yang

Real-time multi-person pose estimation presents significant challenges in balancing speed and precision.

Ranked #1 on Multi-Person Pose Estimation on CrowdPose (using extra training data)

Multi-Person Pose Estimation

5,162

Paper
Code

DSR-Diff: Depth Map Super-Resolution with Diffusion Model

no code implementations • 16 Nov 2023 • Yuan Shi, Bin Xia, Rui Zhu, Qingmin Liao, Wenming Yang

Color-guided depth map super-resolution (CDSR) improve the spatial resolution of a low-quality depth map with the corresponding high-quality color map, benefiting various applications such as 3D reconstruction, virtual reality, and augmented reality.

3D Reconstruction Depth Map Super-Resolution

Paper
Add Code

LAVSS: Location-Guided Audio-Visual Spatial Audio Separation

no code implementations • 31 Oct 2023 • Yuxin Ye, Wenming Yang, Yapeng Tian

LAVSS is inspired by the correlation between spatial audio and visual location.

Paper
Add Code

CLIP-based Synergistic Knowledge Transfer for Text-based Person Retrieval

no code implementations • 18 Sep 2023 • Yating Liu, Yaowei Li, Zimo Liu, Wenming Yang, YaoWei Wang, Qingmin Liao

Text-based Person Retrieval (TPR) aims to retrieve the target person images given a textual query.

Person Retrieval Retrieval +3

Paper
Add Code

Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video

1 code implementation • ICCV 2023 • Xiuzhe Wu, Pengfei Hu, Yang Wu, Xiaoyang Lyu, Yan-Pei Cao, Ying Shan, Wenming Yang, Zhongqian Sun, Xiaojuan Qi

Therefore, directly learning a mapping function from speech to the entire head image is prone to ambiguity, particularly when using a short video for training.

Image Generation

Paper
Code

DiffI2I: Efficient Diffusion Model for Image-to-Image Translation

no code implementations • 26 Aug 2023 • Bin Xia, Yulun Zhang, Shiyin Wang, Yitong Wang, Xinglong Wu, Yapeng Tian, Wenming Yang, Radu Timotfe, Luc van Gool

Compared to traditional DMs, the compact IPR enables DiffI2I to obtain more accurate outcomes and employ a lighter denoising network and fewer iterations.

Denoising Image-to-Image Translation +2

Paper
Add Code

Dynamic Low-Rank Instance Adaptation for Universal Neural Image Compression

1 code implementation • 15 Aug 2023 • Yue Lv, Jinxi Xiang, Jun Zhang, Wenming Yang, Xiao Han, Wei Yang

We thus introduce a dynamic gating network on top of the low-rank adaptation method, in order to decide which decoder layer should employ adaptation.

Decoder Image Compression

Paper
Code

Double-chain Constraints for 3D Human Pose Estimation in Images and Videos

1 code implementation • 10 Aug 2023 • Hongbo Kang, Yong Wang, Mengyuan Liu, Doudou Wu, Peng Liu, Wenming Yang

Notably, our model achieves state-of-the-art performance on all action categories in the Human3. 6M dataset using detected 2D poses from CPN, and our code is available at: https://github. com/KHB1698/DC-GCT.

Ranked #32 on 3D Human Pose Estimation on MPI-INF-3DHP (AUC metric)

Monocular 3D Human Pose Estimation

Paper
Code

Dual Arbitrary Scale Super-Resolution for Multi-Contrast MRI

1 code implementation • 5 Jul 2023 • Jiamiao Zhang, Yichen Chi, Jun Lyu, Wenming Yang, Yapeng Tian

Limited by imaging systems, the reconstruction of Magnetic Resonance Imaging (MRI) images from partial measurement is essential to medical imaging research.

Decoder Super-Resolution

Paper
Code

Crafting Training Degradation Distribution for the Accuracy-Generalization Trade-off in Real-World Super-Resolution

no code implementations • 29 May 2023 • Ruofan Zhang, Jinjin Gu, Haoyu Chen, Chao Dong, Yulun Zhang, Wenming Yang

In this work, we introduce a novel approach to craft training degradation distributions using a small set of reference images.

Super-Resolution

Paper
Add Code

EgoVSR: Towards High-Quality Egocentric Video Super-Resolution

1 code implementation • 24 May 2023 • Yichen Chi, Junhao Gu, Jiamiao Zhang, Wenming Yang, Yapeng Tian

We explicitly tackle motion blurs in egocentric videos using a Dual Branch Deblur Network (DB$^2$Net) in the VSR framework.

Video Super-Resolution

Paper
Code

DiffIR: Efficient Diffusion Model for Image Restoration

1 code implementation • ICCV 2023 • Bin Xia, Yulun Zhang, Shiyin Wang, Yitong Wang, Xinglong Wu, Yapeng Tian, Wenming Yang, Luc van Gool

Diffusion model (DM) has achieved SOTA performance by modeling the image synthesis process into a sequential application of a denoising network.

Denoising Image Generation +1

388

Paper
Code

Explicit3D: Graph Network with Spatial Inference for Single Image 3D Object Detection

no code implementations • 13 Feb 2023 • Yanjun Liu, Wenming Yang

Instead of using ground-truth labels as direct supervision, our relative and corner loss are derived from the homogeneous transformation, which renders the model to learn the geometric consistency between objects.

3D Object Detection Graph Generation +5

Paper
Add Code

MVKT-ECG: Efficient Single-lead ECG Classification on Multi-Label Arrhythmia by Multi-View Knowledge Transferring

no code implementations • 28 Jan 2023 • Yuzhen Qin, Li Sun, Hui Chen, Wei-Qiang Zhang, Wenming Yang, Jintao Fei, Guijin Wang

However, it is challenging to develop a single-lead-based ECG interpretation model for multiple diseases diagnosis due to the lack of some key disease information.

ECG Classification Knowledge Distillation

Paper
Add Code

Local and Global Logit Adjustments for Long-Tailed Learning

no code implementations • ICCV 2023 • Yingfan Tao, Jingna Sun, Hao Yang, Li Chen, Xu Wang, Wenming Yang, Daniel Du, Min Zheng

LGLA consists of two core components: a Class-aware Logit Adjustment (CLA) strategy and an Adaptive Angular Weighted (AAW) loss.

Paper
Add Code

Knowledge Distillation based Degradation Estimation for Blind Super-Resolution

1 code implementation • 30 Nov 2022 • Bin Xia, Yulun Zhang, Yitong Wang, Yapeng Tian, Wenming Yang, Radu Timofte, Luc van Gool

It consists of a knowledge distillation based implicit degradation estimator network (KD-IDE) and an efficient SR network.

Blind Super-Resolution Image Super-Resolution +1

134

Paper
Code

A Dual-scale Lead-seperated Transformer With Lead-orthogonal Attention And Meta-information For Ecg Classification

no code implementations • 23 Nov 2022 • Yang Li, Guijin Wang, Zhourui Xia, Wenming Yang, Li Sun

Auxiliary diagnosis of cardiac electrophysiological status can be obtained through the analysis of 12-lead electrocardiograms (ECGs).

ECG Classification

Paper
Add Code

Super-Resolution by Predicting Offsets: An Ultra-Efficient Super-Resolution Network for Rasterized Images

no code implementations • 9 Oct 2022 • Jinjin Gu, Haoming Cai, Chenyu Dong, Ruofan Zhang, Yulun Zhang, Wenming Yang, Chun Yuan

We finally use a guided fusion operation to integrate the sharp edges generated by the network and flat areas by the interpolation method to get the final SR image.

Quantization Super-Resolution

Paper
Add Code

Basic Binary Convolution Unit for Binarized Image Restoration Network

2 code implementations • 2 Oct 2022 • Bin Xia, Yulun Zhang, Yitong Wang, Yapeng Tian, Wenming Yang, Radu Timofte, Luc van Gool

In this study, we reconsider components in binary convolution, such as residual connection, BatchNorm, activation function, and structure, for IR tasks.

Binarization Image Restoration +1

114

Paper
Code

Meta-Learning based Degradation Representation for Blind Super-Resolution

1 code implementation • 28 Jul 2022 • Bin Xia, Yapeng Tian, Yulun Zhang, Yucheng Hang, Wenming Yang, Qingmin Liao

The most of CNN based super-resolution (SR) methods assume that the degradation is known (\eg, bicubic).

Blind Super-Resolution Knowledge Distillation +2

Paper
Code

Structured Sparsity Learning for Efficient Video Super-Resolution

1 code implementation • CVPR 2023 • Bin Xia, Jingwen He, Yulun Zhang, Yitong Wang, Yapeng Tian, Wenming Yang, Luc van Gool

In SSL, we design pruning schemes for several key components in VSR models, including residual blocks, recurrent networks, and upsampling networks.

Video Super-Resolution

Paper
Code

SCS-Co: Self-Consistent Style Contrastive Learning for Image Harmonization

1 code implementation • CVPR 2022 • Yucheng Hang, Bin Xia, Wenming Yang, Qingmin Liao

In addition, we propose a background-attentional adaptive instance normalization (BAIN) to achieve an attention-weighted background feature distribution according to the foreground-background feature similarity.

Contrastive Learning Image Harmonization

Paper
Code

STDAN: Deformable Attention Network for Space-Time Video Super-Resolution

1 code implementation • 14 Mar 2022 • Hai Wang, Xiaoyu Xiang, Yapeng Tian, Wenming Yang, Qingmin Liao

Second, we put forward a spatial-temporal deformable feature aggregation (STDFA) module, in which spatial and temporal contexts in dynamic video frames are adaptively captured and aggregated to enhance SR reconstruction.

Space-time Video Super-resolution Video Super-Resolution

Paper
Code

Coarse-to-Fine Embedded PatchMatch and Multi-Scale Dynamic Aggregation for Reference-based Super-Resolution

1 code implementation • 12 Jan 2022 • Bin Xia, Yapeng Tian, Yucheng Hang, Wenming Yang, Qingmin Liao, Jie zhou

To improve matching efficiency, we design a novel Embedded PatchMacth scheme with random samples propagation, which involves end-to-end training with asymptotic linear computational cost to the input size.

Reference-based Super-Resolution

Paper
Code

Efficient Non-Local Contrastive Attention for Image Super-Resolution

1 code implementation • 11 Jan 2022 • Bin Xia, Yucheng Hang, Yapeng Tian, Wenming Yang, Qingmin Liao, Jie zhou

To demonstrate the effectiveness of ENLCA, we build an architecture called Efficient Non-Local Contrastive Network (ENLCN) by adding a few of our modules in a simple backbone.

Contrastive Learning Feature Correlation +1

Paper
Code

Group Fisher Pruning for Practical Network Compression

2 code implementations • 2 Aug 2021 • Liyang Liu, Shilong Zhang, Zhanghui Kuang, Aojun Zhou, Jing-Hao Xue, Xinjiang Wang, Yimin Chen, Wenming Yang, Qingmin Liao, Wayne Zhang

Our method can be used to prune any structures including those with coupled channels.

Ranked #4 on Network Pruning on ImageNet

Image Classification Network Pruning +2

150

Paper
Code

NTIRE 2021 Challenge on Perceptual Image Quality Assessment

no code implementations • 7 May 2021 • Jinjin Gu, Haoming Cai, Chao Dong, Jimmy S. Ren, Yu Qiao, Shuhang Gu, Radu Timofte, Manri Cheon, SungJun Yoon, Byungyeon Kang, Junwoo Lee, Qing Zhang, Haiyang Guo, Yi Bin, Yuqing Hou, Hengliang Luo, Jingyu Guo, ZiRui Wang, Hai Wang, Wenming Yang, Qingyan Bai, Shuwei Shi, Weihao Xia, Mingdeng Cao, Jiahao Wang, Yifan Chen, Yujiu Yang, Yang Li, Tao Zhang, Longtao Feng, Yiting Liao, Junlin Li, William Thong, Jose Costa Pereira, Ales Leonardis, Steven McDonagh, Kele Xu, Lehan Yang, Hengxing Cai, Pengfei Sun, Seyed Mehdi Ayyoubzadeh, Ali Royat, Sid Ahmed Fezza, Dounia Hammou, Wassim Hamidouche, Sewoong Ahn, Gwangjin Yoon, Koki Tsubota, Hiroaki Akutsu, Kiyoharu Aizawa

This paper reports on the NTIRE 2021 challenge on perceptual image quality assessment (IQA), held in conjunction with the New Trends in Image Restoration and Enhancement workshop (NTIRE) workshop at CVPR 2021.

Image Quality Assessment Image Restoration

Paper
Add Code

ER-IQA: Boosting Perceptual Quality Assessment Using External Reference Images

no code implementations • 6 May 2021 • Jingyu Guo, Wei Wang, Wenming Yang, Qingmin Liao, Jie zhou

In this paper, we introduce a brand new scheme, namely external-reference image quality assessment (ER-IQA), by introducing external reference images to bridge the gap between FR and NR-IQA.

Image Quality Assessment NR-IQA

Paper
Add Code

NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

1 code implementation • 21 Apr 2021 • Ren Yang, Radu Timofte, Jing Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng Li, Thomas Tanay, Fenglong Song, Wentao Chao, Qiang Guo, Yan Liu, Jiang Li, Xiaochao Qu, Dewang Hou, Jiayu Yang, Lyn Jiang, Di You, Zhenyu Zhang, Chong Mou, Iaroslav Koshelev, Pavel Ostyakov, Andrey Somov, Jia Hao, Xueyi Zou, Shijie Zhao, Xiaopeng Sun, Yiting Liao, Yuanzhi Zhang, Qing Wang, Gen Zhan, Mengxi Guo, Junlin Li, Ming Lu, Zhan Ma, Pablo Navarrete Michelini, Hai Wang, Yiyun Chen, Jingyu Guo, Liliang Zhang, Wenming Yang, Sijung Kim, Syehoon Oh, Yucong Wang, Minjie Cai, Wei Hao, Kangdi Shi, Liangyan Li, Jun Chen, Wei Gao, Wang Liu, XiaoYu Zhang, Linjie Zhou, Sixin Lin, Ru Wang

This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results.

Paper
Code

Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation

1 code implementation • 26 Mar 2021 • Wenhao Li, Hong Liu, Runwei Ding, Mengyuan Liu, Pichao Wang, Wenming Yang

The modified VTE is termed as Strided Transformer Encoder (STE), which is built upon the outputs of VTE.

Ranked #2 on 3D Human Pose Estimation on HumanEva-I

Monocular 3D Human Pose Estimation

331

Paper
Code

Towards Impartial Multi-task Learning

2 code implementations • ICLR 2021 • Liyang Liu, Yi Li, Zhanghui Kuang, Jing-Hao Xue, Yimin Chen, Wenming Yang, Qingmin Liao, Wayne Zhang

Multi-task learning (MTL) has been widely used in representation learning.

Multi-Task Learning Representation Learning

195

Paper
Code

Attention Cube Network for Image Restoration

1 code implementation • 13 Sep 2020 • Yucheng Hang, Qingmin Liao, Wenming Yang, Yupeng Chen, Jie zhou

The adaptive spatial attention branch (ASAB) and the adaptive channel attention branch (ACAB) constitute the adaptive dual attention module (ADAM), which can capture the long-range spatial and channel-wise contextual information to expand the receptive field and distinguish different types of information for more effective feature representations.

Feature Correlation Image Restoration

Paper
Code

Real-MFF: A Large Realistic Multi-focus Image Dataset with Ground Truth

no code implementations • 28 Mar 2020 • Juncheng Zhang, Qingmin Liao, Shaojun Liu, Haoyu Ma, Wenming Yang, Jing-Hao Xue

In this letter, we introduce a large and realistic multi-focus dataset called Real-MFF, which contains 710 pairs of source images with corresponding ground truth images.

Paper
Add Code

LCSCNet: Linear Compressing Based Skip-Connecting Network for Image Super-Resolution

1 code implementation • 9 Sep 2019 • Wenming Yang, Xuechen Zhang, Yapeng Tian, Wei Wang, Jing-Hao Xue, Qingmin Liao

In this paper, we develop a concise but efficient network architecture called linear compressing based skip-connecting network (LCSCNet) for image super-resolution.

Ranked #14 on Image Super-Resolution on Set14 - 3x upscaling

Image Super-Resolution

Paper
Code

CFSNet: Toward a Controllable Feature Space for Image Restoration

1 code implementation • ICCV 2019 • Wei Wang, Ruiming Guo, Yapeng Tian, Wenming Yang

Deep learning methods have witnessed the great progress in image restoration with specific metrics (e. g., PSNR, SSIM).

Image Restoration Image Super-Resolution +1

Paper
Code

Lightweight Feature Fusion Network for Single Image Super-Resolution

2 code implementations • 15 Feb 2019 • Wenming Yang, Wei Wang, Xuechen Zhang, Shuifa Sun, Qingmin Liao

Specifically, a spindle block is composed of a dimension extension unit, a feature exploration unit and a feature refinement unit.

Ranked #11 on Image Super-Resolution on Manga109 - 3x upscaling

Image Super-Resolution

Paper
Code

Domain-Aware SE Network for Sketch-based Image Retrieval with Multiplicative Euclidean Margin Softmax

1 code implementation • 11 Dec 2018 • Peng Lu, Gao Huang, Hangyu Lin, Wenming Yang, Guodong Guo, Yanwei Fu

This paper proposes a novel approach for Sketch-Based Image Retrieval (SBIR), for which the key is to bridge the gap between sketches and photos in terms of the data representation.

Retrieval Sketch-Based Image Retrieval

Paper
Code

Deep Learning for Single Image Super-Resolution: A Brief Review

1 code implementation • 9 Aug 2018 • Wenming Yang, Xuechen Zhang, Yapeng Tian, Wei Wang, Jing-Hao Xue

Single image super-resolution (SISR) is a notoriously challenging ill-posed problem, which aims to obtain a high-resolution (HR) output from one of its low-resolution (LR) versions.

Efficient Neural Network Image Super-Resolution

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.