About Me
I am presently a postdoctoral researcher at Harvard University. My research interests include artificial intelligence, with a focus on multi-modal learning, generative models, 3D vision, and their applications in biomedical fields.
Education
- Ph.D in Computer Science, University of Technology Sydney, Australia, 2021. Supervised by Prof. Yi Yang.
- B.S. in Electronic and Information Engineering, University of Science and Technology of China, China, 2018.
Selected Publications
CATR: Combinatorial-Dependence Audio-Queried Transformer for Audio-Visual Video Segmentation
Kexin Li, Zongxin Yang✉, Lei Chen, Yi Yang, Jun Xiao
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
Zongxin Yang, Guikun Chen, Xiaodi Li, Wenguan Wang, Yi Yang
ICML 2024. [Proj. page] [PDF] [Code]
Scalable Video Object Segmentation with Identification Mechanism
Zongxin Yang, Jiaxu Miao, Yunchao Wei, Wenguan Wang, Xiaohan Wang, Yi Yang
Controllable 3D Face Generation with Conditional Style Code Diffusion
Xiaolong Shen, Jianxin Ma, Chang Zhou, Zongxin Yang✉
Decoupling Features in Hierarchical Propagation for Video Object Segmentation
Zongxin Yang, Yi Yang
Associating Objects with Transformers for Video Object Segmentation
Zongxin Yang, Yunchao Wei, Yi Yang
Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration
Zongxin Yang, Yunchao Wei, Yi Yang
DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-scale Consistency
Zongxin Yang, Xin Yu, Yi Yang
CVPR 2021. [PDF]
Collaborative Video Object Segmentation by Foreground-Background Integration
Zongxin Yang, Yunchao Wei, Yi Yang
Gated Channel Transformation for Visual Recognition
Zongxin Yang, Linchao Zhu, Yu Wu, Yi Yang
Very Long Natural Scenery Image Prediction by Outpainting
Zongxin Yang, Jian Dong, Ping Liu, Yi Yang, Shuicheng Yan
SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction
Zechuan Zhang, Zongxin Yang✉, Yi Yang
SEEAvatar: Photorealistic Text-to-3D Avatar Generation with Constrained Geometry and Appearance
Yuanyou Xu, Zongxin Yang, Yi Yang
Human101: Training 100+ FPS Human Gaussians in 100s from 1 View
Mingwei Li, Jiachen Tao, Zongxin Yang, Yi Yang
GD2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields
Xiao Pan, Zongxin Yang✉, Shuai Bai, Yi Yang
Preprint. [PDF]
AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion
Shuo Huang, Zongxin Yang, Liangting Li, Yi Yang, Jia Jia
Global-correlated 3D-decoupling Transformer for Clothed Avatar Reconstruction
Zechuan Zhang, Li Sun, Zongxin Yang, Lin Chen, Yi Yang
Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation
Yuanyou Xu, Zongxin Yang, Yi Yang
Efficient Emotional Adaptation for Audio-driven Talking-Head Generation
Yuan Gan, Zongxin Yang, Xihang Yue, Lingyun Sun, Yi Yang
TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering
Xiao Pan, Zongxin Yang, Jianxin Ma, Chang Zhou, Yi Yang
JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery
Jiahao Li, Zongxin Yang, Xiaohan Wang, Jianxin Ma, Chang Zhou, Yi Yang
Segment and Track Anything
Yangming Cheng, Liulei Li, Yuanyou Xu, Xiaodi Li, Zongxin Yang*, Wenguan Wang, Yi Yang
Video Object Segmentation in Panoptic Wild Scenes
Yuanyou Xu, Zongxin Yang, Yi Yang
Pyramid Diffusion Models For Low-light Image Enhancement
Dewei Zhou, Zongxin Yang, Yi Yang
Co-Learning Meets Stitch-Up for Noisy Multi-Label Visual Recognition
Chao Liang, Zongxin Yang, Linchao Zhu, Yi Yang
TIP 2023. [PDF]
FedSeg: Class-Heterogeneous Federated Learning for Semantic Segmentation
Jiaxu Miao, Zongxin Yang, Leilei Fan, Yi Yang
CVPR 2023. [PDF]
Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation
Xiaolong Shen, Zongxin Yang, Xiaohan Wang, Jianxin Ma, Chang Zhou, Yi Yang
ProD: Prompting-to-disentangle Domain Knowledge for Cross-domain Few-shot Image Classification
Tianyi Ma, Yifan Sun, Zongxin Yang, Yi Yang
CVPR 2023. [PDF]
Decompose to Generalize: Species-Generalized Animal Pose Estimation
Guangrui Li, Yifan Sun, Zongxin Yang, Yi Yang
ICLR 2023. [PDF]
Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation
Feng Zhu, Zongxin Yang, Yunchao Wei, Xin Yu, Yi Yang
In-N-Out Generative Learning for Dense Unsupervised Video Segmentation
Xiao Pan, Peike Li, Zongxin Yang, Huiling Zhou, Chang Zhou, Hongxia Yang, Jingren Zhou, Yi Yang
ACM MM 2022. [PDF]
H2FA R-CNN: Holistic and Hierarchical Feature Alignment for Cross-domain Weakly Supervised Object Detection
Yunqiu Xu, Yifan Sun, Zongxin Yang, Jiaxu Miao, Yi Yang
Selected Awards
1st in the VOTS 2023 challenge. ICCV 2023. [Report]
1st in Semi-Supervised Video Object Segmentation of EPIC-Kitchens Dataset Challenges. CVPR 2023. [Report]
1st in TREK-150 Object Tracking of EPIC-Kitchens Dataset Challenges. CVPR 2023. [Report]
1st in the VOT 2022 real-time segmentation tracking challenge. ECCV 2022. [Report]
1st in the VOT 2022 short-term segmentation tracking challenge. ECCV 2022. [Report]
1st in eBay eProduct Visual Search Challenge. CVPR 2022. [Report]
1st (Track 1) in the 3rd Large-scale Video Object Segmentation Challenge. CVPR 2021. [Report]
1st (Track 3) in the 3rd Large-scale Video Object Segmentation Challenge. CVPR 2021. [Report]
Guo Moruo Scholarship. From USTC, 2018.