no code implementations • 4 Feb 2024 • Haochen Liu, Zhiyu Huang, Wenhui Huang, Haohan Yang, Xiaoyu Mo, Chen Lv
First, we introduce marginal-conditioned occupancy prediction to align joint occupancy with agent-wise perceptions.
no code implementations • 1 Jan 2024 • Ruizhuo Xu, Ke Wang, Chao Deng, Mei Wang, Xi Chen, Wenhui Huang, Junlan Feng, Weihong Deng
With the increasing availability of consumer depth sensors, 3D face recognition (FR) has attracted more and more attention.
1 code implementation • IEEE Transactions on Neural Networks and Learning Systems 2023 • Wenhui Huang, Cong Zhang, Jingda Wu, Xiangkun He, Jie Zhang, Chen Lv.
Stochastic exploration is the key to the success of the Deep Q-network (DQN) algorithm.
1 code implementation • 1 Jan 2023 • Wenhui Huang, Yanxin Zhou, Xiangkun He, Chen Lv
Despite some successful applications of goal-driven navigation, existing deep reinforcement learning (DRL)-based approaches notoriously suffers from poor data efficiency issue.
1 code implementation • 17 Sep 2022 • Yizeng Han, Yifan Pu, Zihang Lai, Chaofei Wang, Shiji Song, Junfen Cao, Wenhui Huang, Chao Deng, Gao Huang
Intuitively, easy samples, which generally exit early in the network during inference, should contribute more to training early classifiers.
no code implementations • 1 Jul 2022 • Jingda Wu, Wenhui Huang, Niels de Boer, Yanghui Mo, Xiangkun He, Chen Lv
Decisions made by human subjects in a driving simulator are treated as safe demonstrations, which are stored into the replay buffer and then utilized to enhance the training process of RL.
1 code implementation • 20 Jun 2022 • Wenhui Huang, Cong Zhang, Jingda Wu, Xiangkun He, Jie Zhang, Chen Lv
We theoretically prove that the policy improvement theorem holds for the preference-guided $\epsilon$-greedy policy and experimentally show that the inferred action preference distribution aligns with the landscape of corresponding Q-values.
1 code implementation • 9 Jan 2022 • Gao Huang, Yulin Wang, Kangchen Lv, Haojun Jiang, Wenhui Huang, Pengfei Qi, Shiji Song
Spatial redundancy widely exists in visual recognition tasks, i. e., discriminative features in an image or video frame usually correspond to only a subset of pixels, while the remaining regions are irrelevant to the task at hand.
1 code implementation • 26 Sep 2021 • Jingda Wu, Zhiyu Huang, Wenhui Huang, Chen Lv
A novel prioritized experience replay mechanism that adapts to human guidance in the reinforcement learning process is proposed to boost the efficiency and performance of the reinforcement learning algorithm.
no code implementations • 24 Mar 2021 • Zhongxu Hu, Chen Lv, Yanxin Zhou, Yiran Zhang, Wenhui Huang
To handle the error of the head pose estimation model, this paper proposes an adaptive Kalman Filter.
no code implementations • 12 Jan 2020 • Wenhui Huang, Francesco Braghin, Zhuo Wang
Therefore, we propose an apprenticeship learning in combination with deep reinforcement learning approach that allows the agent to learn the driving and stopping behaviors with continuous actions.
Robotics