|
OpenVid-1M: A Large-Scale Dataset for High-Quality Text-to-Video Generation
K. Nan*, R. Xie*, P. Zhou*, T. Fan, Z. Yang, Z. Chen, X. Li, J. Yang and Y. Tai*.
arXiv:2407.02371v1, 2024
arXiv
/
Website
/
Demo (High-res)
/
Dataset
/
Models
/
Code
|
|
AddSR: Accelerating Diffusion-based Blind Super-Resolution with Adversarial Diffusion Distillation
R. Xie, Y. Tai*, C. Zhao, K. Zhang, Z. Zhang, J. Zhou, X. Ye, Q. Wang* and J. Yang.
arXiv:2404.01717, 2024
arXiv
/
Website
/
Code
|
|
RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network
X. Ji*, C. Lin, Z. Ding, Y. Tai, J. Yang, J. Zhu, X. Hu, J. Zhang, D. Luo and C. Wang.
arXiv:2406.18284, 2024
arXiv
|
|
DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation
Q. Wang, J. Zhang, C. Xu, W. Cao, Y. Tai, Y. Han, Y. Ge, H. Gu, C. Wang and Y. Fu.
arXiv:2403.17664, 2024
arXiv
|
|
HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation
S. Guan*, Y. Ge*, Y. Tai*, J. Yang, W. Li and M. You*.
European Conference on Computer Vision (ECCV), 2024
arXiv (Coming soon)
|
|
FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
C. Xu, Y. Liu, J. Xing, W. Wang, M. Sun, J. Dan, T. Huang, S Li, Z. Cheng, Y. Tai, B. Sun
Computer Vision and Pattern Recognition (CVPR), 2024
Paper
/
Code
|
|
PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization
X. Peng, J. Zhu, B. Jiang, Y. Tai, D. Luo, J. Zhang, W. Lin, T. Jin, C. Wang, and R. Ji.
Computer Vision and Pattern Recognition (CVPR), 2024
arXiv
/
Website
|
|
Dynamic Frame Interpolation in Wavelet Domain
L. Kong, B. Jiang, D. Luo, W. Chu, Y. Tai, C. Wang, and J. Yang.
IEEE Trans. on Image Processing, 2023
arXiv
/
Evaluation code
|
|
Learning Versatile 3D Shape Generation with Improved AR Models
S. Luo, X. Qian, Y. Fu, Y. Zhang, Y. Tai, Z. Zhang, C. Wang, and X. Xue.
International Conference on Computer Vision (ICCV), 2023
arXiv
|
|
Learning Neural Proto-face Field for Disentangled 3D Face Modeling In the Wild
Z. Zhang, R. Chen, W. Cao, Y. Tai*, and C. Wang*
Computer Vision and Pattern Recognition (CVPR), 2023
Paper
|
|
Learning to Measure the Point Cloud Reconstruction Loss in a Representation Space
T. Huang, Z. Ding, J. Zhang, Y. Tai, Z. Zhang, M. Chen, C. Wang, and Y. Liu
Computer Vision and Pattern Recognition (CVPR), 2023
Paper
|
|
High-fidelity Generalized Emotional Talking Face Generation with Multi-modal Emotion Space Learning
C. Xu, J. Zhang, J. Zhu, W. Chu, Y. Tai, C. Wang, and Y. Liu
Computer Vision and Pattern Recognition (CVPR), 2023
Paper (arXiv)
|
|
High-resolution Iterative Feedback Network for Camouflaged Object Detection
X. Hu, S. Wang, X. Qian, H. Dai, W. Ren, D. Luo, Y. Tai, and L. Shao
AAAI Conference on Artificial Intellige (AAAI), 2023
Paper (arXiv)
|
|
High-Resolution GAN Inversion for Degraded Images in Large Diverse Datasets
Y. Wang, C. Lin, D. Luo, Y. Tai, Z. Zhang and Y. Xie
AAAI Conference on Artificial Intellige (AAAI), 2023 [Oral]
Paper (arXiv)
|
|
3QNet: 3D Point Cloud Geometry Quantization Compression Network
T. Huang, J. Zhang, J. Chen, Z. Ding, Y. Tai, Z. Zhang, C. Wang, and Y. Liu
ACM Transactions on Graphics, 2022
Paper
|
|
ColorFormer: Image Colorization via Color Memory assisted Hybrid-attention Transformer
X. Ji*, B. Jiang*, D. Luo, G. Tao, W. Chu, Y. Tai*, Z. Xie, and C. Wang*
European Conference on Computer Vision (ECCV), 2022
Paper
/
Supp
|
|
StyleFace: Towards Identity-Disentangled Face Generation on Megapixels
Y. Luo, J. Yan, J. Zhu, K. He, W. Chu, Y. Tai, and C. Wang
European Conference on Computer Vision (ECCV), 2022
Paper
/
Supp
|
|
Designing One Unified Framework for High-Fidelity Face Reenactment and Swapping
C. Xu*, J. Zhang*, Y. Han, G. Tian, X. Zeng, Y. Tai, Y. Wang, C. Wang, and Y. Liu
European Conference on Computer Vision (ECCV), 2022
Paper
/
Supp
/
Code
|
|
SeedFormer: Patch Seeds based Point Cloud Completion with Upsample Transformer
H. Zhou, Y. Cao, W. Chu, J. Zhu, L. Tong, Y. Tai, and C. Wang
European Conference on Computer Vision (ECCV), 2022
Paper (arXiv)
/
Supp
/
Code
|
|
Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation
Z. Jiang, Y. Li, C. Yang, P. Gao, Y. Wang, Y. Tai, and C. Wang
European Conference on Computer Vision (ECCV), 2022
Paper (arXiv)
/
Code
|
|
Joint Learning Content and Degradation Aware Feature for Blind Super-Resolution
Y. Zhou*, C. Lin*, D. Luo, Y. Liu, Mingang Chen, Y. Tai*, and C. Wang*
ACM International Conference on Multimedia (ACM MM), 2022
Paper (arXiv)
|
|
AutoGAN-Synthesizer: Neural Architecture Searchfor Cross-Modality MRI Synthesis
X. Hu, R. Shen, D. Luo, Y. Tai, C. Wang, and B. Menze
International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2022
Paper (official link)
/
Code
|
|
HifiHead: One-Shot High Fidelity Neural Head Synthesis with 3D Control
F. Zhu, J. Zhu, W. Chu, Y. Tai*, Z. Xie, X. Huang, and C. Wang*.
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Paper
|
|
Blind Face Restoration via Integrating Face Shape and Generative Priors
F. Zhu, J. Zhu, W. Chu, X. Zhang, X. Ji, C. Wang*, and Y. Tai*.
Computer Vision and Pattern Recognition (CVPR), 2022
Paper
/
Supp
/
Code
|
|
IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation
L. Kong*, B. Jiang*, D. Luo, W. Chu, X. Huang, Y. Tai, C. Wang, and J. Yang.
Computer Vision and Pattern Recognition (CVPR), 2022
Paper
/
Supp
/
Code
|
|
Physically-guided Disentangled Implicit Rendering for 3D Face Modeling
Z. Zhang, Y. Ge, Y. Tai, W. Cao, R. Chen, K. Liu, H. Tang, X. Huang, C. Wang, Z. Xie, and D. Huang.
Computer Vision and Pattern Recognition (CVPR), 2022
Paper
/
Supp
|
|
Learning to Restore 3D Face from In-the-Wild Degraded Images
Z. Zhang, Y. Ge, Y. Tai, X. Huang, C. Wang, H. Tang, D. Huang, and Z. Xie.
Computer Vision and Pattern Recognition (CVPR), 2022
Paper
/
Supp
|
|
Learning to Memorize Feature Hallucination for One-Shot Image Generation
Y. Xie, Y. Fu, J. Zhu, Y. Tai, Y. Cao, and C. Wang.
Computer Vision and Pattern Recognition (CVPR), 2022
Paper
|
|
DIRL: Domain-invariant Representation Learning for Generalizable Semantic Segmentation
Q. Xu, L. Yao, Z. Jiang, G. Jiang, W. Chu, W. Han, W. Zhang, C. Wang, and Y. Tai*.
AAAI Conference on Artificial Intellige (AAAI), 2022 [Oral]
Paper
|
|
SCSNet: Simultaneously Image Colorization and Super-Resolution
J. Zhang, C. Xu, Y. Han, J. Li, Y. Wang, Y. Tai, C. Wang, F. Huang, Z. Xie, and Y. Liu.
AAAI Conference on Artificial Intellige (AAAI), 2022
Paper
|
|
LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization
Z. Chen, C. Wang, Y. Wang, G. Jiang, Y. Shen, Y. Tai, C. Wang, W. Zhang, and L. Cao.
AAAI Conference on Artificial Intellige (AAAI), 2022
Paper
|
|
Spectrum-to-Kernel Translation for Accurate Blind Image Super-Resolution
G. Tao, X. Ji, W. Wang, S. Chen, C. Lin, Y. Cao, T. Lu, D. Luo, and Y. Tai.
Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS), 2021
arXiv
A novel framework S2K that predicts the kernel from spectrum in frequency domain
|
|
Analogous to Evolutionary Algorithm: Designing a Unified Sequence Model
J. Zhang, C. Xu, J. Li, W. Chen, Y. Wang, Y. Tai, S. Chen, C. Wang, F. Huang and R. Liu.
Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS), 2021
arXiv
/
Code
|
|
Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework
Q. Song*, C. Wang*, Z. Jiang, Y. Wang, Y. Tai, C. Wang, J. Li, F. Huang and Y. Wu.
International Conference on Computer Vision (ICCV), 2021 [Oral]
arXiv
/
Code
|
|
Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd Counting
C. Wang*, Q. Song*, B. Zhang, Y. Wang, Y. Tai, X. Hu, C. Wang, J. Li, J. Ma, and Y. Wu.
International Conference on Computer Vision (ICCV), 2021
arXiv
/
Code
|
|
ASFD: Automatic and Scalable Face Detector
J. Li*, B. Zhang*, Y. Wang, Y. Tai, Z. Zhang, C. Wang, J. Li, X. Huang and Y. Xia.
ACM International Conference on Multimedia (ACM MM), 2021
arXiv
Ranked No. 1 on WIDER FACE
|
|
HifiFace: 3D Shape and Semantic Prior Guided High Fidelity Face Swapping
Y. Wang*, X. Chen*, J. Zhu, W. Chu, Y. Tai*, C. Wang, J. Li, Y. Wu, F. Huang and R. Ji.
International Joint Conference on Artificial Intelligence (IJCAI), 2021
arXiv
/
Project
/
Poster
/
Video (1min)
|
|
Context-Aware Image Inpainting with Learned Semantic Priors
W. Zhang, J. Zhu, Y. Tai, Y. Wang, W. Chu, B. Ni, C. Wang and X. Yang.
International Joint Conference on Artificial Intelligence (IJCAI), 2021
arXiv
/
Extended journal version
/
Code (Official)
|
|
Dual Reweighting Domain Generalization for Face Presentation Attack Detection
S. Liu, K. Zhang, T. Yao, K. Sheng, S. Ding, Y. Tai, J. Li, Y. Xie and L. Ma.
International Joint Conference on Artificial Intelligence (IJCAI), 2021
arXiv
|
|
SiamRCR: Reciprocal Classification and Regression for Visual Object Tracking
J. Peng*, Z. Jiang*, Y. Gu*, Y. Wu, Y. Wang, Y. Tai, C. Wang and W. Lin.
International Joint Conference on Artificial Intelligence (IJCAI), 2021
arXiv
|
|
Learning to Aggregate and Personalize 3D Face from In-the-Wild Photo Collection
Z. Zhang, Y. Ge, R. Chen, Y. Tai, Y. Yan, J. Yang, C. Wang, J. Li, and F. Huang.
Computer Vision and Pattern Recognition (CVPR), 2021 [Oral]
Paper
/
Code (Official)
|
|
Learning Salient Boundary Feature for Anchor-free Temporal Action Localization
C. Lin*, C. Xu*, D. Luo, Y. Wang, Y. Tai, C. Wang, J. Li, F. Huang and Y. Fu.
Computer Vision and Pattern Recognition (CVPR), 2021
arXiv
/
Paper
/
Code (Official)
|
|
Learning to Restore Hazy Video: A New Real-World Dataset and A New Method
X. Zhang*, H. Dong*, J. Pan, C. Zhu, Y. Tai, C. Wang, J. Li, F. Huang and F. Wang.
Computer Vision and Pattern Recognition (CVPR), 2021
Paper
|
|
Frequency Consistent Adaptation for Real World Super Resolution
X. Ji*, G. Tao*, Y. Cao, Y. Tai, T. Lu, C. Wang, J. Li, and F. Huang.
AAAI Conference on Artificial Intelligence (AAAI), 2021
arXiv
Improved version of our prior work RealSR
|
|
Learning Comprehensive Motion Representation for Action Recognition
M. Wu*, B. Jiang*, D. Luo, J. Yan, Y. Wang, Y. Tai, C. Wang, J. Li, F. Huang, and X. Yang.
AAAI Conference on Artificial Intelligence (AAAI), 2021
arXiv
/
Code (Official)
Extented version of our prior works TEINet and TDRL
|
|
Generalizable Representation Learning for Mixture Domain Face Anti-Spoofing
Z. Chen, T. Yao, K. Sheng, S. Ding, Y. Tai, J. Li, F. Huang, and X. Jin.
AAAI Conference on Artificial Intelligence (AAAI), 2021
arXiv
|
|
To Choose or to Fuse? Scale Selection for Crowd Counting
Q. Song*, C. Wang*, Y. Wang, Y. Tai, C. Wang, J. Li, J. Wu, and J. Ma.
AAAI Conference on Artificial Intelligence (AAAI), 2021
Paper
/
Code (Official)
|
|
FAN: Feature Adaptation Network for Surveillance Face Recognition and Normalization
X. Yin, Y. Tai, Y. Huang and X. Liu.
Asian Conference on Computer Vision (ACCV), 2020
Paper
Novel framework to improve surveillance face recognition & normalization from unpaired data
|
|
Improving Face Recognition from Hard Samples via Distribution Distillation Loss
Y. Huang*, P. Shen*, Y. Tai*, S. Li*, X. Liu, J. Li, F. Huang, and R. Ji.
European Conference on Computer Vision (ECCV), 2020
arXiv
/
Paper (Official)
/
Code (Official)
|
|
SSCGAN: Facial Attribute Editing via Style Skip Connections
W. Chu, Y. Tai*, C. Wang, J. Li, F. Huang, and R. Ji.
European Conference on Computer Vision (ECCV), 2020
Paper (Official)
|
|
Face Anti-Spoofing via Disentangled Representation Learning
K. Zhang, T. Yao, J. Zhang, Y. Tai*, S. Ding, J. Li, F. Huang, H. Song and L. Ma.
European Conference on Computer Vision (ECCV), 2020
Paper (Official)
|
|
Chained-Tracker: Chaining Paired Attentive Regression Results for End-to-End Joint Multiple-Object Detection and Tracking
J. Peng, C. Wang, F. Wan, Y. Wu, Y. Wang, Y. Tai, C. Wang, J. Li, F. Huang and Y. Fu.
European Conference on Computer Vision (ECCV), 2020 [Spotlight]
arXiv
/
Paper (Official)
/
Code (Official)
|
|
Temporal Distinct Representation Learning for 2D-CNN-based Action Recognition
J. Weng, D. Luo, Y. Wang, Y. Tai, C. Wang, J. Li, F. Huang, X. Jiang and J. Yuan.
European Conference on Computer Vision (ECCV), 2020
Paper (Official)
|
|
Adversarial Semantic Data Augmentation for Human Pose Estimation
Y. Bin, X. Cao, X. Chen, Y. Ge, Y. Tai, C. Wang, J. Li, F. Huang, C. Gao and N. Sang.
European Conference on Computer Vision (ECCV), 2020
arXiv
/
Paper (Official)
/
Code (Official)
State-of-the-art performance on MPII and LSP
|
|
Real-World Super-Resolution via Kernel Estimation and Noise Injection
X. Ji, Y. Cao, Y. Tai*, C. Wang, J. Li, and F. Huang.
Computer Vision and Pattern Recognition Workshop (CVPRW), 2020
Paper
/
Code (Tencent)
/
Code (Personal)
/
Code (NCNN-vulkan)
Winner of CVPR NTIRE 2020 Challenge on Real-World Super-Resolution
|
|
CurricularFace: Adaptive Curriculum Learning Loss for Deep Face Recognition
Y. Huang, Y. Wang, Y. Tai*, X. Liu, P. Shen, S. Li*, J. Li, and F. Huang.
Computer Vision and Pattern Recognition (CVPR), 2020
Paper (Official)
/
Code
|
|
Learning by Analogy: Reliable Supervision from Transformations for Unsupervised Optical Flow Estimation
L. Liu, J. Zhang, Y. Liu, Y. Wang, Y. Tai, D. Luo, C. Wang, J. Li, and F. Huang.
Computer Vision and Pattern Recognition (CVPR), 2020
Paper (Official)
/
Code
|
|
Learning Multi-Granular Hypergraphs for Video-Based Person Re-Identification
Y. Yan, J. Qin, J. Chen, L. Liu, F. Zhu, Y. Tai, and L. Shao.
Computer Vision and Pattern Recognition (CVPR), 2020
Paper (Official)
/
Code
|
|
Fast Learning of Temporal Action Proposal via Dense Boundary Generator
C. Lin*, J. Li*, Y. Wang, Y. Tai, D. Luo, Z. Cui, C. Wang, J. Li, F. Huang and R. Ji.
AAAI Conference on Artificial Intelligence (AAAI), 2020
arXiv
/
Code
Ranked No. 1 on ActivityNet Challenge 2019 on Temporal Action Proposals
|
|
TEINet: Towards an Efficient Architecture for Video Recognition
Z. Liu*, D. Luo*, Y. Wang, L. Wang, Y. Tai, C. Wang, J. Li, F. Huang and T. Lu.
AAAI Conference on Artificial Intelligence (AAAI), 2020
arXiv
|
|
DSFD: Dual Shot Face Detector
J. Li, Y. Wang, C. Wang, Y. Tai, J. Qian, J. Yang, C.e Wang, J. Li and F. Huang.
Computer Vision and Pattern Recognition (CVPR), 2019
arXiv
/
Paper (Official)
/
Code
Ranked No. 1 on WIDER FACE and FDDB (Until 2019.01)
|
|
Towards Highly Accurate and Stable Face Alignment for High-Resolution Videos
Y. Tai*, Y. Liang*, X. Liu, L. Duan, J. Li, C. Wang, F. Huang and Y. Chen.
AAAI Conference on Artificial Intelligence (AAAI), 2019
arXiv
/
Paper (Official)
/
Supp
/
Code
|
|
Data-Adaptive Metric Learning with Scale Alignment
S. Chen, C. Gong, J. Yang, Y. Tai, L. Hui and J. Li.
AAAI Conference on Artificial Intelligence (AAAI), 2019
Paper (Official)
|
|
Person Search via A Mask-Guided Two-Stream CNN Model
D. Chen, S. Zhang, W. Ouyang, J. Yang and Y. Tai.
European Conference on Computer Vision (ECCV), 2018
Paper
/
Poster
|
|
FSRNet: End-to-End Learning Face Super-Resolution with Facial Priors
Y. Tai*, Y. Chen*, X. Liu, C. Shen, J. Yang.
Computer Vision and Pattern Recognition (CVPR), 2018 [Spotlight]
Paper (Official)
/
arXiv
/
Code
/
Demo
/
Slides
/
Poster
|
|
MemNet: A Persistent Memory Network for Image Restoration
Y. Tai, J. Yang, X. Liu, C. Xu.
International Conference on Computer Vision (ICCV), 2017 [Spotlight]
Paper
/
Code
/
Poster
/
'15 most influential papers' in ICCV 2017 by PaperDigest
My second paper that achieves over 1,500 google scholar citations
|
|
Image Super-Resolution via Deep Recursive Residual Network
Y. Tai, J. Yang, X. Liu.
Computer Vision and Pattern Recognition (CVPR), 2017
Paper
/
Code
/
Project
/
Poster
My first paper that achieves over 2,000 google scholar citations
|
|
Nuclear Norm based Matrix Regression with Applications to Face Recognition with Occlusion and Illumination Changes
J. Yang, L. Luo, J. Qian, Y. Tai, F. Zhang and Y. Xu.
IEEE Trans. on Pattern Analysis and Machine Intelligence, 2017
Paper
|
|
Structural Orthogonal Procrustes Regression for Face Recognition with Pose Variations and Misalignment
Y. Tai, J. Yang, F. Zhang, Y. Zhang, L. Luo, J. Qian.
SIAM Conference on Data Mining (SDM), 2016 [Oral]
Paper
|
|
Face Recognition with Pose Variations and Misalignment via Orthogonal Procrustes Regression
Y. Tai, J. Yang, Y. Zhang, L. Luo, J. Qian and Y. Chen
IEEE Trans. on Image Processing, 2016
Paper
|
|
Learning Discriminative Singular Value Decomposition Representation for Face Recognition
Y. Tai, J. Yang, L. Luo, F. Zhang and J. Qian
Pattern Recognition, 2016
Paper
|