no code implementations • 20 Dec 2023 • Yue-Jiang Dong, Yuan-Chen Guo, Ying-Tian Liu, Fang-Lue Zhang, Song-Hai Zhang
Self-supervised monocular depth estimation is of significant importance with applications spanning across autonomous driving and robotics.
1 code implementation • 14 Dec 2023 • Zi-Xin Zou, Zhipeng Yu, Yuan-Chen Guo, Yangguang Li, Ding Liang, Yan-Pei Cao, Song-Hai Zhang
Recent advancements in 3D reconstruction from single images have been driven by the evolution of generative models.
no code implementations • 14 Dec 2023 • Ying-Tian Liu, Yuan-Chen Guo, Guan Luo, Heyi Sun, Wei Yin, Song-Hai Zhang
However, the generation quality and generalization ability of 3D diffusion models is hindered by the scarcity of high-quality and large-scale 3D datasets.
no code implementations • 6 Dec 2023 • Yunhan Yang, Yukun Huang, Xiaoyang Wu, Yuan-Chen Guo, Song-Hai Zhang, Hengshuang Zhao, Tong He, Xihui Liu
However, due to the lack of information from multiple views, these works encounter difficulties in generating controllable novel views.
no code implementations • 30 Oct 2023 • Xin Yu, Yuan-Chen Guo, Yangguang Li, Ding Liang, Song-Hai Zhang, Xiaojuan Qi
In this paper, we re-evaluate the role of classifier-free guidance in score distillation and discover a surprising finding: the guidance alone is enough for effective text-to-3D generation tasks.
1 code implementation • 23 Oct 2023 • Xiaoxiao Long, Yuan-Chen Guo, Cheng Lin, YuAn Liu, Zhiyang Dou, Lingjie Liu, Yuexin Ma, Song-Hai Zhang, Marc Habermann, Christian Theobalt, Wenping Wang
In this work, we introduce Wonder3D, a novel method for efficiently generating high-fidelity textured meshes from single-view images. Recent methods based on Score Distillation Sampling (SDS) have shown the potential to recover 3D geometry from 2D diffusion priors, but they typically suffer from time-consuming per-shape optimization and inconsistent geometry.
no code implementations • NeurIPS 2023 • Zheng Chen, Yan-Pei Cao, Yuan-Chen Guo, Chen Wang, Ying Shan, Song-Hai Zhang
Unlike generalizable radiance fields trained on perspective images, PanoGRF avoids the information loss from panorama-to-perspective conversion and directly aggregates geometry and appearance features of 3D sample points from each panoramic view based on spherical projection.
1 code implementation • CVPR 2023 • Ying-Tian Liu, Zhifei Zhang, Yuan-Chen Guo, Matthew Fisher, Zhaowen Wang, Song-Hai Zhang
Automatic generation of fonts can be an important aid to typeface design.
no code implementations • 28 Mar 2023 • Yuan-Chen Guo, Yan-Pei Cao, Chen Wang, Yu He, Ying Shan, XiaoHu Qie, Song-Hai Zhang
With the emergence of neural radiance fields (NeRFs), view synthesis quality has reached an unprecedented level.
no code implementations • ICCV 2023 • Tian-Xing Xu, Yuan-Chen Guo, Yu-Kun Lai, Song-Hai Zhang
To address these issues, we present MBPTrack, which adopts a Memory mechanism to utilize past information and formulates localization in a coarse-to-fine scheme using Box Priors given in the first frame.
no code implementations • ICCV 2023 • Chia-Hao Chen, Ying-Tian Liu, Zhifei Zhang, Yuan-Chen Guo, Song-Hai Zhang
Existing vector font generation approaches either struggle to preserve high-frequency corner details of the glyph or produce vector shapes that have redundant segments, which hinders their applications in practical scenarios.
no code implementations • CVPR 2023 • Tian-Xing Xu, Yuan-Chen Guo, Yu-Kun Lai, Song-Hai Zhang
Therefore, contextual information across two consecutive frames is crucial for effective object tracking.
no code implementations • 12 Sep 2022 • Zheng Chen, Chen Wang, Yuan-Chen Guo, Song-Hai Zhang
Neural Radiance Fields (NeRF) achieve photo-realistic view synthesis with densely captured input images.
no code implementations • 21 Jul 2022 • Tian-Xing Xu, Yuan-Chen Guo, Yong-Liang Yang, Song-Hai Zhang
Point clouds captured by depth sensors are often contaminated by noises, obstructing further analysis and applications.
no code implementations • 10 Dec 2021 • Ying-Tian Liu, Yuan-Chen Guo, Song-Hai Zhang
Is the center position fully capable of representing a pixel?
1 code implementation • 3 Dec 2021 • Chen Wang, Xian Wu, Yuan-Chen Guo, Song-Hai Zhang, Yu-Wing Tai, Shi-Min Hu
We present NeRF-SR, a solution for high-resolution (HR) novel view synthesis with mostly low-resolution (LR) inputs.
no code implementations • CVPR 2022 • Yuan-Chen Guo, Di Kang, Linchao Bao, Yu He, Song-Hai Zhang
Specifically, we propose to split a scene into transmitted and reflected components, and model the two components with separate neural radiance fields.
no code implementations • 9 Jul 2021 • Yuan Xue, Yuan-Chen Guo, Han Zhang, Tao Xu, Song-Hai Zhang, Xiaolei Huang
In many applications of computer graphics, art and design, it is desirable for a user to provide intuitive non-image input, such as text, sketch, stroke, graph or layout, and have a computer system automatically generate photo-realistic images that adhere to the input content.
no code implementations • 16 Jun 2021 • Ying-Tian Liu, Yuan-Chen Guo, Yi-Xiao Li, Chen Wang, Song-Hai Zhang
In this paper, we present a novel implicit glyph shape representation, which models glyphs as shape primitives enclosed by quadratic curves, and naturally enables generating glyph images at arbitrary high resolutions.
1 code implementation • 25 May 2021 • Tian-Xing Xu, Yuan-Chen Guo, Zhiqiang Li, Ge Yu, Yu-Kun Lai, Song-Hai Zhang
Place recognition plays an essential role in the field of autonomous driving and robot navigation.
Ranked #4 on 3D Place Recognition on CS-Campus3D
1 code implementation • CVPR 2021 • Song-Hai Zhang, Yuan-Chen Guo, Qing-Wen Gu
We investigate the problem of generating 3D meshes from single free-hand sketches, aiming at fast 3D modeling for novice users.