Search Results for author: Ke Gong

Found 14 papers, 7 papers with code

Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation

no code implementations • 18 Dec 2023 • Hui Fu, Zeqing Wang, Ke Gong, Keze Wang, Tianshui Chen, Haojie Li, Haifeng Zeng, Wenxiong Kang

Moreover, to facilitate disentangled representation learning, we introduce four well-designed constraints: an auxiliary style classifier, an auxiliary inverse classifier, a content contrastive loss, and a pair of latent cycle losses, which can effectively contribute to the construction of the identity-related style space and semantic-related content space.

Disentanglement

Paper
Add Code

Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition

no code implementations • Findings (EMNLP) 2021 • Guolin Zheng, Yubei Xiao, Ke Gong, Pan Zhou, Xiaodan Liang, Liang Lin

Specifically, we unify a pre-trained acoustic model (wav2vec 2. 0) and a language model (BERT) into an end-to-end trainable framework.

Language Modelling Representation Learning +2

Paper
Add Code

Graphonomy: Universal Image Parsing via Graph Reasoning and Transfer

2 code implementations • 26 Jan 2021 • Liang Lin, Yiming Gao, Ke Gong, Meng Wang, Xiaodan Liang

Prior highly-tuned image parsing models are usually studied in a certain domain with a specific set of semantic labels and can hardly be adapted into other scenarios (e. g., sharing discrepant label granularity) without extensive re-training.

Graph Representation Learning Human Parsing +2

291

Paper
Code

Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition

no code implementations • 22 Dec 2020 • Yubei Xiao, Ke Gong, Pan Zhou, Guolin Zheng, Xiaodan Liang, Liang Lin

When sampling tasks in MML-ASR, AMS adaptively determines the task sampling probability for each source language.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Photometric Multi-View Mesh Refinement for High-Resolution Satellite Images

no code implementations • 10 May 2020 • Mathias Rothermel, Ke Gong, Dieter Fritsch, Konrad Schindler, Norbert Haala

Modern high-resolution satellite sensors collect optical imagery with ground sampling distances (GSDs) of 30-50cm, which has sparked a renewed interest in photogrammetric 3D surface reconstruction from satellite data.

Surface Reconstruction Vocal Bursts Intensity Prediction

Paper
Add Code

Bidirectional Graph Reasoning Network for Panoptic Segmentation

no code implementations • CVPR 2020 • Yangxin Wu, Gengwei Zhang, Yiming Gao, Xiajun Deng, Ke Gong, Xiaodan Liang, Liang Lin

We introduce a Bidirectional Graph Reasoning Network (BGRNet), which incorporates graph structure into the conventional panoptic segmentation network to mine the intra-modular and intermodular relations within and between foreground things and background stuff classes.

Instance Segmentation Panoptic Segmentation +1

Paper
Add Code

Layout-Graph Reasoning for Fashion Landmark Detection

no code implementations • CVPR 2019 • Weijiang Yu, Xiaodan Liang, Ke Gong, Chenhan Jiang, Nong Xiao, Liang Lin

Each Layout-Graph Reasoning(LGR) layer aims to map feature representations into structural graph nodes via a Map-to-Node module, performs reasoning over structural graph nodes to achieve global layout coherency via a layout-graph reasoning module, and then maps graph nodes back to enhance feature representations via a Node-to-Map module.

Attribute Clustering +1

Paper
Add Code

Graphonomy: Universal Human Parsing via Graph Transfer Learning

1 code implementation • CVPR 2019 • Ke Gong, Yiming Gao, Xiaodan Liang, Xiaohui Shen, Meng Wang, Liang Lin

By distilling universal semantic graph representation to each specific task, Graphonomy is able to predict all levels of parsing labels in one system without piling up the complexity.

Human Parsing Transfer Learning

291

Paper
Code

End-to-End Knowledge-Routed Relational Dialogue System for Automatic Diagnosis

1 code implementation • 30 Jan 2019 • Lin Xu, Qixian Zhou, Ke Gong, Xiaodan Liang, Jianheng Tang, Liang Lin

Besides the challenges for conversational dialogue systems (e. g. topic transition coherency and question understanding), automatic medical diagnosis further poses more critical requirements for the dialogue rationality in the context of medical knowledge and symptom-disease relations.

Decision Making Dialogue Management +5

Paper
Code

Soft-Gated Warping-GAN for Pose-Guided Person Image Synthesis

no code implementations • NeurIPS 2018 • Haoye Dong, Xiaodan Liang, Ke Gong, Hanjiang Lai, Jia Zhu, Jian Yin

Despite remarkable advances in image synthesis research, existing works often fail in manipulating images under the context of large geometric transformations.

Generative Adversarial Network Image Generation

Paper
Add Code

Adaptive Temporal Encoding Network for Video Instance-level Human Parsing

1 code implementation • 2 Aug 2018 • Qixian Zhou, Xiaodan Liang, Ke Gong, Liang Lin

Beyond the existing single-person and multiple-person human parsing tasks in static images, this paper makes the first attempt to investigate a more realistic video instance-level human parsing that simultaneously segments out each person instance and parses each instance into more fine-grained parts (e. g., head, leg, dress).

Human Parsing Segmentation +4

Paper
Code

Instance-level Human Parsing via Part Grouping Network

1 code implementation • ECCV 2018 • Ke Gong, Xiaodan Liang, Yicheng Li, Yimin Chen, Ming Yang, Liang Lin

Instance-level human parsing towards real-world human analysis scenarios is still under-explored due to the absence of sufficient data resources and technical difficulty in parsing multiple instances in a single pass.

Ranked #6 on Human Part Segmentation on CIHP

Edge Detection Human Parsing +2

412

Paper
Code

Look into Person: Joint Body Parsing & Pose Estimation Network and A New Benchmark

3 code implementations • 5 Apr 2018 • Xiaodan Liang, Ke Gong, Xiaohui Shen, Liang Lin

To further explore and take advantage of the semantic correlation of these two tasks, we propose a novel joint human parsing and pose estimation network to explore efficient context modeling, which can simultaneously predict parsing and pose with extremely high quality.

Ranked #10 on Semantic Segmentation on LIP val

Human Parsing Pose Estimation +1

372

Paper
Code

Look into Person: Self-supervised Structure-sensitive Learning and A New Benchmark for Human Parsing

1 code implementation • CVPR 2017 • Ke Gong, Xiaodan Liang, Dongyu Zhang, Xiaohui Shen, Liang Lin

Human parsing has recently attracted a lot of research interests due to its huge application potentials.

Ranked #13 on Semantic Segmentation on LIP val

Human Parsing Self-Supervised Learning +1

228

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.