Search Results for author: Yichen Zhu

Instead of a simple combination of pruning and SD, EPSD enables the pruned network to favor SD by keeping more distillable weights before training to ensure better distillation of the pruned network.

Knowledge Distillation Network Pruning +1

Paper
Add Code

Infinite-Horizon Graph Filters: Leveraging Power Series to Enhance Sparse Information Aggregation

1 code implementation • 18 Jan 2024 • Ruizhe Zhang, Xinke Jiang, Yuchen Fang, Jiayuan Luo, Yongxin Xu, Yichen Zhu, Xu Chu, Junfeng Zhao, Yasha Wang

Graph Neural Networks (GNNs) have shown considerable effectiveness in a variety of graph learning tasks, particularly those based on the message-passing approach in recent years.

Graph Learning Node Classification

Paper
Code

Language-Conditioned Robotic Manipulation with Fast and Slow Thinking

no code implementations • 8 Jan 2024 • Minjie Zhu, Yichen Zhu, Jinming Li, Junjie Wen, Zhiyuan Xu, Zhengping Che, Chaomin Shen, Yaxin Peng, Dong Liu, Feifei Feng, Jian Tang

The language-conditioned robotic manipulation aims to transfer natural language instructions into executable actions, from simple pick-and-place to tasks requiring intent recognition and visual reasoning.

Decision Making Intent Recognition +2

Paper
Add Code

Object-Centric Instruction Augmentation for Robotic Manipulation

no code implementations • 5 Jan 2024 • Junjie Wen, Yichen Zhu, Minjie Zhu, Jinming Li, Zhiyuan Xu, Zhengping Che, Chaomin Shen, Yaxin Peng, Dong Liu, Feifei Feng, Jian Tang

Humans interpret scenes by recognizing both the identities and positions of objects in their observations.

Language Modelling Large Language Model +1

Paper
Add Code

LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language Model

1 code implementation • 4 Jan 2024 • Yichen Zhu, Minjie Zhu, Ning Liu, Zhicai Ou, Xiaofeng Mou, Jian Tang

In this paper, we introduce LLaVA-$\phi$ (LLaVA-Phi), an efficient multi-modal assistant that harnesses the power of the recently advanced small language model, Phi-2, to facilitate multi-modal dialogues.

Ranked #103 on Visual Question Answering on MM-Vet

Language Modelling Visual Question Answering

325

Paper
Code

Exploring Gradient Explosion in Generative Adversarial Imitation Learning: A Probabilistic Perspective

no code implementations • 18 Dec 2023 • Wanying Wang, Yichen Zhu, Yirui Zhou, Chaomin Shen, Jian Tang, Zhiyuan Xu, Yaxin Peng, Yangchun Zhang

Generative Adversarial Imitation Learning (GAIL) stands as a cornerstone approach in imitation learning.

Imitation Learning

Paper
Add Code

MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models

1 code implementation • 29 Nov 2023 • Xin Liu, Yichen Zhu, Jindong Gu, Yunshi Lan, Chao Yang, Yu Qiao

The security concerns surrounding Large Language Models (LLMs) have been extensively explored, yet the safety of Multimodal Large Language Models (MLLMs) remains understudied.

Paper
Code

Unsupervised Discovery of Interpretable Directions in h-space of Pre-trained Diffusion Models

no code implementations • 15 Oct 2023 • Zijian Zhang, Luping Liu, Zhijie Lin, Yichen Zhu, Zhou Zhao

We propose the first unsupervised and learning-based method to identify interpretable directions in h-space of pre-trained diffusion models.

Paper
Add Code

3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding

no code implementations • 25 Jul 2023 • Zehan Wang, Haifeng Huang, Yang Zhao, Linjun Li, Xize Cheng, Yichen Zhu, Aoxiong Yin, Zhou Zhao

3D visual grounding aims to localize the target object in a 3D point cloud by a free-form language description.

3D visual grounding Object +3

Paper
Add Code

Revisiting Event-based Video Frame Interpolation

no code implementations • 24 Jul 2023 • Jiaben Chen, Yichen Zhu, Dongze Lian, Jiaqi Yang, Yifu Wang, Renrui Zhang, Xinhang Liu, Shenhan Qian, Laurent Kneip, Shenghua Gao

We therefore propose to incorporate RGB information in an event-guided optical flow refinement strategy.

Optical Flow Estimation Video Frame Interpolation

Paper
Add Code

Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding

1 code implementation • ICCV 2023 • Zehan Wang, Haifeng Huang, Yang Zhao, Linjun Li, Xize Cheng, Yichen Zhu, Aoxiong Yin, Zhou Zhao

To accomplish this, we design a novel semantic matching model that analyzes the semantic similarity between object proposals and sentences in a coarse-to-fine manner.

3D visual grounding Object +3

Paper
Code

Make A Long Image Short: Adaptive Token Length for Vision Transformers

no code implementations • 5 Jul 2023 • Qiqi Zhou, Yichen Zhu

The TLA enables ReViT to process images with the minimum sufficient number of tokens, reducing token numbers in the ViT model and improving inference speed.

Action Recognition Image Classification

Paper
Add Code

Prediction with Incomplete Data under Agnostic Mask Distribution Shift

no code implementations • 18 May 2023 • Yichen Zhu, Jian Yuan, Bo Jiang, Tao Lin, Haiming Jin, Xinbing Wang, Chenghu Zhou

We focus on the case where the underlying joint distribution of complete features and label is invariant, but the missing pattern, i. e., mask distribution may shift agnostically between training and testing.

Paper
Add Code

ScaleKD: Distilling Scale-Aware Knowledge in Small Object Detector

no code implementations • CVPR 2023 • Yichen Zhu, Qiqi Zhou, Ning Liu, Zhiyuan Xu, Zhicai Ou, Xiaofeng Mou, Jian Tang

Unlike existing works that struggle to balance the trade-off between inference speed and SOD performance, in this paper, we propose a novel Scale-aware Knowledge Distillation (ScaleKD), which transfers knowledge of a complex teacher model to a compact student model.

Knowledge Distillation object-detection +2

Paper
Add Code

Label-Guided Auxiliary Training Improves 3D Object Detector

1 code implementation • 24 Jul 2022 • Yaomin Huang, Xinmei Liu, Yichen Zhu, Zhiyuan Xu, Chaomin Shen, Zhengping Che, Guixu Zhang, Yaxin Peng, Feifei Feng, Jian Tang

Detecting 3D objects from point clouds is a practical yet challenging task that has attracted increasing attention recently.

3D Object Detection Object +1

Paper
Code

Optimal market completion through financial derivatives with applications to volatility risk

no code implementations • 16 Feb 2022 • Matt Davison, Marcos Escobar-Anel, Yichen Zhu

This paper investigates the optimal choices of financial derivatives to complete a financial market in the framework of stochastic volatility (SV) models.

Paper
Add Code

Derivatives-based portfolio decisions. An expected utility insight

no code implementations • 11 Jan 2022 • Marcos Escobar-Anel, Matt Davison, Yichen Zhu

This paper challenges the use of stocks in portfolio construction, instead we demonstrate that Asian derivatives, straddles, or baskets could be more convenient substitutes.

Management

Paper
Add Code

UniLog: Deploy One Model and Specialize it for All Log Analysis Tasks

no code implementations • 6 Dec 2021 • Yichen Zhu, Weibin Meng, Ying Liu, Shenglin Zhang, Tao Han, Shimin Tao, Dan Pei

UniLog: Deploy One Model and Specialize it for All Log Analysis Tasks

Paper
Add Code

Make A Long Image Short: Adaptive Token Length for Vision Transformers

no code implementations • 3 Dec 2021 • Yichen Zhu, Yuqin Zhu, Jie Du, Yi Wang, Zhicai Ou, Feifei Feng, Jian Tang

The TLA enables the ReViT to process the image with the minimum sufficient number of tokens during inference.

Action Recognition Image Classification

Paper
Add Code

Training BatchNorm Only in Neural Architecture Search and Beyond

no code implementations • 1 Dec 2021 • Yichen Zhu, Jie Du, Yuqin Zhu, Yi Wang, Zhicai Ou, Feifei Feng, Jian Tang

Critically, there is no effort to understand 1) why training BatchNorm only can find the perform-well architectures with the reduced supernet-training time, and 2) what is the difference between the train-BN-only supernet and the standard-train supernet.

Fairness Neural Architecture Search

Paper
Add Code

Networked Time Series Prediction with Incomplete Data via Generative Adversarial Network

no code implementations • 5 Oct 2021 • Yichen Zhu, Bo Jiang, Haiming Jin, Mengtian Zhang, Feng Gao, Jianqiang Huang, Tao Lin, Xinbing Wang

An important task in such applications is to predict the future values of a NETS based on its historical values and the underlying graph.

Generative Adversarial Network Management +2

Paper
Add Code

Student Customized Knowledge Distillation: Bridging the Gap Between Student and Teacher

no code implementations • ICCV 2021 • Yichen Zhu, Yi Wang

We formulate the knowledge distillation as a multi-task learning problem so that the teacher transfers knowledge to the student only if the student can benefit from learning such knowledge.

Image Classification Knowledge Distillation +4

Paper
Add Code

Classification Trees for Imbalanced and Sparse Data: Surface-to-Volume Regularization

no code implementations • 26 Apr 2020 • Yichen Zhu, Cheng Li, David B. Dunson

When data are limited in one or more of the classes, the estimated decision boundaries are often irregularly shaped due to the limited sample size, leading to poor generalization error.

General Classification

Paper
Add Code

Resizable Neural Networks

no code implementations • 25 Sep 2019 • Yichen Zhu, Xiangyu Zhang, Tong Yang, Jian Sun

We introduce the adaptive resizable networks as dynamic networks, which further improve the performance with less computational cost via data-dependent inference.

Data Augmentation Neural Architecture Search

Paper
Add Code

VAENAS: Sampling Matters in Neural Architecture Search

no code implementations • 25 Sep 2019 • Shizheng Qin, Yichen Zhu, Pengfei Hou, Xiangyu Zhang, Wenqiang Zhang, Jian Sun

In this paper, we propose a learnable sampling module based on variational auto-encoder (VAE) for neural architecture search (NAS), named as VAENAS, which can be easily embedded into existing weight sharing NAS framework, e. g., one-shot approach and gradient-based approach, and significantly improve the performance of searching results.

Neural Architecture Search

Paper
Add Code

CityFlow: A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario

1 code implementation • 13 May 2019 • Huichu Zhang, Siyuan Feng, Chang Liu, Yaoyao Ding, Yichen Zhu, Zihan Zhou, Wei-Nan Zhang, Yong Yu, Haiming Jin, Zhenhui Li

The most commonly used open-source traffic simulator SUMO is, however, not scalable to large road network and large traffic flow, which hinders the study of reinforcement learning on traffic scenarios.

Multi-agent Reinforcement Learning reinforcement-learning +1

753

Paper
Code

Nonoverlap-Promoting Variable Selection

no code implementations • ICML 2018 • Pengtao Xie, Hongbao Zhang, Yichen Zhu, Eric Xing

Variable selection is a classic problem in machine learning (ML), widely used to find important explanatory factors, and improve generalization performance and interpretability of ML models.

Variable Selection

Paper
Add Code

Orthogonality-Promoting Distance Metric Learning: Convex Relaxation and Theoretical Analysis

no code implementations • ICML 2018 • Pengtao Xie, Wei Wu, Yichen Zhu, Eric P. Xing

In this paper, we address these three issues by (1) seeking convex relaxations of the original nonconvex problems so that the global optimal is guaranteed to be achievable; (2) providing a formal analysis on OPR's capability of promoting balancedness; (3) providing a theoretical analysis that directly reveals the relationship between OPR and generalization performance.

Metric Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.