NJU

Selected Pulications

Publications (* equal contribution, * corresponding author. All publications on [Google Scholar])

OpenVid-1M: A Large-Scale Dataset for High-Quality Text-to-Video Generation
K. Nan*, R. Xie*, P. Zhou*, T. Fan, Z. Yang, Z. Chen, X. Li, J. Yang and Y. Tai*.
arXiv:2407.02371v1, 2024
arXiv / Website / Demo (High-res) / Dataset / Models / Code GitHub stars

AddSR: Accelerating Diffusion-based Blind Super-Resolution with Adversarial Diffusion Distillation
R. Xie, Y. Tai*, C. Zhao, K. Zhang, Z. Zhang, J. Zhou, X. Ye, Q. Wang* and J. Yang.
arXiv:2404.01717, 2024
arXiv / Website / Code GitHub stars

RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network
X. Ji*, C. Lin, Z. Ding, Y. Tai, J. Yang, J. Zhu, X. Hu, J. Zhang, D. Luo and C. Wang.
arXiv:2406.18284, 2024
arXiv

DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation
Q. Wang, J. Zhang, C. Xu, W. Cao, Y. Tai, Y. Han, Y. Ge, H. Gu, C. Wang and Y. Fu.
arXiv:2403.17664, 2024
arXiv

HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation
S. Guan*, Y. Ge*, Y. Tai*, J. Yang, W. Li and M. You*.
European Conference on Computer Vision (ECCV), 2024
arXiv (Coming soon)

FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
C. Xu, Y. Liu, J. Xing, W. Wang, M. Sun, J. Dan, T. Huang, S Li, Z. Cheng, Y. Tai, B. Sun
Computer Vision and Pattern Recognition (CVPR), 2024
Paper / Code

PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization
X. Peng, J. Zhu, B. Jiang, Y. Tai, D. Luo, J. Zhang, W. Lin, T. Jin, C. Wang, and R. Ji.
Computer Vision and Pattern Recognition (CVPR), 2024
arXiv / Website

Dynamic Frame Interpolation in Wavelet Domain
L. Kong, B. Jiang, D. Luo, W. Chu, Y. Tai, C. Wang, and J. Yang.
IEEE Trans. on Image Processing, 2023
arXiv / Evaluation code GitHub stars

Learning Versatile 3D Shape Generation with Improved AR Models
S. Luo, X. Qian, Y. Fu, Y. Zhang, Y. Tai, Z. Zhang, C. Wang, and X. Xue.
International Conference on Computer Vision (ICCV), 2023
arXiv

Learning Neural Proto-face Field for Disentangled 3D Face Modeling In the Wild
Z. Zhang, R. Chen, W. Cao, Y. Tai*, and C. Wang*
Computer Vision and Pattern Recognition (CVPR), 2023
Paper
Learning to Measure the Point Cloud Reconstruction Loss in a Representation Space
T. Huang, Z. Ding, J. Zhang, Y. Tai, Z. Zhang, M. Chen, C. Wang, and Y. Liu
Computer Vision and Pattern Recognition (CVPR), 2023
Paper
High-fidelity Generalized Emotional Talking Face Generation with Multi-modal Emotion Space Learning
C. Xu, J. Zhang, J. Zhu, W. Chu, Y. Tai, C. Wang, and Y. Liu
Computer Vision and Pattern Recognition (CVPR), 2023
Paper (arXiv)
High-resolution Iterative Feedback Network for Camouflaged Object Detection
X. Hu, S. Wang, X. Qian, H. Dai, W. Ren, D. Luo, Y. Tai, and L. Shao
AAAI Conference on Artificial Intellige (AAAI), 2023
Paper (arXiv)
High-Resolution GAN Inversion for Degraded Images in Large Diverse Datasets
Y. Wang, C. Lin, D. Luo, Y. Tai, Z. Zhang and Y. Xie
AAAI Conference on Artificial Intellige (AAAI), 2023 [Oral]
Paper (arXiv)
3QNet: 3D Point Cloud Geometry Quantization Compression Network
T. Huang, J. Zhang, J. Chen, Z. Ding, Y. Tai, Z. Zhang, C. Wang, and Y. Liu
ACM Transactions on Graphics, 2022
Paper
ColorFormer: Image Colorization via Color Memory assisted Hybrid-attention Transformer
X. Ji*, B. Jiang*, D. Luo, G. Tao, W. Chu, Y. Tai*, Z. Xie, and C. Wang*
European Conference on Computer Vision (ECCV), 2022
Paper / Supp
StyleFace: Towards Identity-Disentangled Face Generation on Megapixels
Y. Luo, J. Yan, J. Zhu, K. He, W. Chu, Y. Tai, and C. Wang
European Conference on Computer Vision (ECCV), 2022
Paper / Supp
Designing One Unified Framework for High-Fidelity Face Reenactment and Swapping
C. Xu*, J. Zhang*, Y. Han, G. Tian, X. Zeng, Y. Tai, Y. Wang, C. Wang, and Y. Liu
European Conference on Computer Vision (ECCV), 2022
Paper / Supp / Code GitHub stars
SeedFormer: Patch Seeds based Point Cloud Completion with Upsample Transformer
H. Zhou, Y. Cao, W. Chu, J. Zhu, L. Tong, Y. Tai, and C. Wang
European Conference on Computer Vision (ECCV), 2022
Paper (arXiv) / Supp / Code GitHub stars
Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation
Z. Jiang, Y. Li, C. Yang, P. Gao, Y. Wang, Y. Tai, and C. Wang
European Conference on Computer Vision (ECCV), 2022
Paper (arXiv) / Code GitHub stars
Joint Learning Content and Degradation Aware Feature for Blind Super-Resolution
Y. Zhou*, C. Lin*, D. Luo, Y. Liu, Mingang Chen, Y. Tai*, and C. Wang*
ACM International Conference on Multimedia (ACM MM), 2022
Paper (arXiv)
AutoGAN-Synthesizer: Neural Architecture Searchfor Cross-Modality MRI Synthesis
X. Hu, R. Shen, D. Luo, Y. Tai, C. Wang, and B. Menze
International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2022
Paper (official link) / Code GitHub stars
HifiHead: One-Shot High Fidelity Neural Head Synthesis with 3D Control
F. Zhu, J. Zhu, W. Chu, Y. Tai*, Z. Xie, X. Huang, and C. Wang*.
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Paper
Blind Face Restoration via Integrating Face Shape and Generative Priors
F. Zhu, J. Zhu, W. Chu, X. Zhang, X. Ji, C. Wang*, and Y. Tai*.
Computer Vision and Pattern Recognition (CVPR), 2022
Paper / Supp / Code GitHub stars
IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation
L. Kong*, B. Jiang*, D. Luo, W. Chu, X. Huang, Y. Tai, C. Wang, and J. Yang.
Computer Vision and Pattern Recognition (CVPR), 2022
Paper / Supp / Code GitHub stars
Physically-guided Disentangled Implicit Rendering for 3D Face Modeling
Z. Zhang, Y. Ge, Y. Tai, W. Cao, R. Chen, K. Liu, H. Tang, X. Huang, C. Wang, Z. Xie, and D. Huang.
Computer Vision and Pattern Recognition (CVPR), 2022
Paper / Supp
Learning to Restore 3D Face from In-the-Wild Degraded Images
Z. Zhang, Y. Ge, Y. Tai, X. Huang, C. Wang, H. Tang, D. Huang, and Z. Xie.
Computer Vision and Pattern Recognition (CVPR), 2022
Paper / Supp
Learning to Memorize Feature Hallucination for One-Shot Image Generation
Y. Xie, Y. Fu, J. Zhu, Y. Tai, Y. Cao, and C. Wang.
Computer Vision and Pattern Recognition (CVPR), 2022
Paper
DIRL: Domain-invariant Representation Learning for Generalizable Semantic Segmentation
Q. Xu, L. Yao, Z. Jiang, G. Jiang, W. Chu, W. Han, W. Zhang, C. Wang, and Y. Tai*.
AAAI Conference on Artificial Intellige (AAAI), 2022 [Oral]
Paper
SCSNet: Simultaneously Image Colorization and Super-Resolution
J. Zhang, C. Xu, Y. Han, J. Li, Y. Wang, Y. Tai, C. Wang, F. Huang, Z. Xie, and Y. Liu.
AAAI Conference on Artificial Intellige (AAAI), 2022
Paper
LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization
Z. Chen, C. Wang, Y. Wang, G. Jiang, Y. Shen, Y. Tai, C. Wang, W. Zhang, and L. Cao.
AAAI Conference on Artificial Intellige (AAAI), 2022
Paper
Spectrum-to-Kernel Translation for Accurate Blind Image Super-Resolution
G. Tao, X. Ji, W. Wang, S. Chen, C. Lin, Y. Cao, T. Lu, D. Luo, and Y. Tai.
Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS), 2021
arXiv

A novel framework S2K that predicts the kernel from spectrum in frequency domain

Analogous to Evolutionary Algorithm: Designing a Unified Sequence Model
J. Zhang, C. Xu, J. Li, W. Chen, Y. Wang, Y. Tai, S. Chen, C. Wang, F. Huang and R. Liu.
Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS), 2021
arXiv / Code GitHub stars
Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework
Q. Song*, C. Wang*, Z. Jiang, Y. Wang, Y. Tai, C. Wang, J. Li, F. Huang and Y. Wu.
International Conference on Computer Vision (ICCV), 2021 [Oral]
arXiv / Code GitHub stars
Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd Counting
C. Wang*, Q. Song*, B. Zhang, Y. Wang, Y. Tai, X. Hu, C. Wang, J. Li, J. Ma, and Y. Wu.
International Conference on Computer Vision (ICCV), 2021
arXiv / Code GitHub stars
ASFD: Automatic and Scalable Face Detector
J. Li*, B. Zhang*, Y. Wang, Y. Tai, Z. Zhang, C. Wang, J. Li, X. Huang and Y. Xia.
ACM International Conference on Multimedia (ACM MM), 2021
arXiv

Ranked No. 1 on WIDER FACE

HifiFace: 3D Shape and Semantic Prior Guided High Fidelity Face Swapping
Y. Wang*, X. Chen*, J. Zhu, W. Chu, Y. Tai*, C. Wang, J. Li, Y. Wu, F. Huang and R. Ji.
International Joint Conference on Artificial Intelligence (IJCAI), 2021
arXiv / Project / Poster / Video (1min)

Context-Aware Image Inpainting with Learned Semantic Priors
W. Zhang, J. Zhu, Y. Tai, Y. Wang, W. Chu, B. Ni, C. Wang and X. Yang.
International Joint Conference on Artificial Intelligence (IJCAI), 2021
arXiv / Extended journal version / Code (Official) GitHub stars

Dual Reweighting Domain Generalization for Face Presentation Attack Detection
S. Liu, K. Zhang, T. Yao, K. Sheng, S. Ding, Y. Tai, J. Li, Y. Xie and L. Ma.
International Joint Conference on Artificial Intelligence (IJCAI), 2021
arXiv

SiamRCR: Reciprocal Classification and Regression for Visual Object Tracking
J. Peng*, Z. Jiang*, Y. Gu*, Y. Wu, Y. Wang, Y. Tai, C. Wang and W. Lin.
International Joint Conference on Artificial Intelligence (IJCAI), 2021
arXiv

Learning to Aggregate and Personalize 3D Face from In-the-Wild Photo Collection
Z. Zhang, Y. Ge, R. Chen, Y. Tai, Y. Yan, J. Yang, C. Wang, J. Li, and F. Huang.
Computer Vision and Pattern Recognition (CVPR), 2021 [Oral]
Paper / Code (Official) GitHub stars

Learning Salient Boundary Feature for Anchor-free Temporal Action Localization
C. Lin*, C. Xu*, D. Luo, Y. Wang, Y. Tai, C. Wang, J. Li, F. Huang and Y. Fu.
Computer Vision and Pattern Recognition (CVPR), 2021
arXiv / Paper / Code (Official) GitHub stars

Learning to Restore Hazy Video: A New Real-World Dataset and A New Method
X. Zhang*, H. Dong*, J. Pan, C. Zhu, Y. Tai, C. Wang, J. Li, F. Huang and F. Wang.
Computer Vision and Pattern Recognition (CVPR), 2021
Paper
Frequency Consistent Adaptation for Real World Super Resolution
X. Ji*, G. Tao*, Y. Cao, Y. Tai, T. Lu, C. Wang, J. Li, and F. Huang.
AAAI Conference on Artificial Intelligence (AAAI), 2021
arXiv

Improved version of our prior work RealSR

Learning Comprehensive Motion Representation for Action Recognition
M. Wu*, B. Jiang*, D. Luo, J. Yan, Y. Wang, Y. Tai, C. Wang, J. Li, F. Huang, and X. Yang.
AAAI Conference on Artificial Intelligence (AAAI), 2021
arXiv / Code (Official) GitHub stars

Extented version of our prior works TEINet and TDRL

Generalizable Representation Learning for Mixture Domain Face Anti-Spoofing
Z. Chen, T. Yao, K. Sheng, S. Ding, Y. Tai, J. Li, F. Huang, and X. Jin.
AAAI Conference on Artificial Intelligence (AAAI), 2021
arXiv

To Choose or to Fuse? Scale Selection for Crowd Counting
Q. Song*, C. Wang*, Y. Wang, Y. Tai, C. Wang, J. Li, J. Wu, and J. Ma.
AAAI Conference on Artificial Intelligence (AAAI), 2021
Paper / Code (Official) GitHub stars

FAN: Feature Adaptation Network for Surveillance Face Recognition and Normalization
X. Yin, Y. Tai, Y. Huang and X. Liu.
Asian Conference on Computer Vision (ACCV), 2020
Paper

Novel framework to improve surveillance face recognition & normalization from unpaired data

Improving Face Recognition from Hard Samples via Distribution Distillation Loss
Y. Huang*, P. Shen*, Y. Tai*, S. Li*, X. Liu, J. Li, F. Huang, and R. Ji.
European Conference on Computer Vision (ECCV), 2020
arXiv / Paper (Official) / Code (Official) GitHub stars

SSCGAN: Facial Attribute Editing via Style Skip Connections
W. Chu, Y. Tai*, C. Wang, J. Li, F. Huang, and R. Ji.
European Conference on Computer Vision (ECCV), 2020
Paper (Official)

Face Anti-Spoofing via Disentangled Representation Learning
K. Zhang, T. Yao, J. Zhang, Y. Tai*, S. Ding, J. Li, F. Huang, H. Song and L. Ma.
European Conference on Computer Vision (ECCV), 2020
Paper (Official)

Chained-Tracker: Chaining Paired Attentive Regression Results for End-to-End
Joint Multiple-Object Detection and Tracking

J. Peng, C. Wang, F. Wan, Y. Wu, Y. Wang, Y. Tai, C. Wang, J. Li, F. Huang and Y. Fu.
European Conference on Computer Vision (ECCV), 2020 [Spotlight]
arXiv / Paper (Official) / Code (Official) GitHub stars

Temporal Distinct Representation Learning for 2D-CNN-based Action Recognition
J. Weng, D. Luo, Y. Wang, Y. Tai, C. Wang, J. Li, F. Huang, X. Jiang and J. Yuan.
European Conference on Computer Vision (ECCV), 2020
Paper (Official)

Adversarial Semantic Data Augmentation for Human Pose Estimation
Y. Bin, X. Cao, X. Chen, Y. Ge, Y. Tai, C. Wang, J. Li, F. Huang, C. Gao and N. Sang.
European Conference on Computer Vision (ECCV), 2020
arXiv / Paper (Official) / Code (Official) GitHub stars

State-of-the-art performance on MPII and LSP

Real-World Super-Resolution via Kernel Estimation and Noise Injection
X. Ji, Y. Cao, Y. Tai*, C. Wang, J. Li, and F. Huang.
Computer Vision and Pattern Recognition Workshop (CVPRW), 2020
Paper / Code (Tencent) GitHub stars / Code (Personal) GitHub stars / Code (NCNN-vulkan) GitHub stars

Winner of CVPR NTIRE 2020 Challenge on Real-World Super-Resolution

CurricularFace: Adaptive Curriculum Learning Loss for Deep Face Recognition
Y. Huang, Y. Wang, Y. Tai*, X. Liu, P. Shen, S. Li*, J. Li, and F. Huang.
Computer Vision and Pattern Recognition (CVPR), 2020
Paper (Official) / Code GitHub stars

Learning by Analogy: Reliable Supervision from Transformations for
Unsupervised Optical Flow Estimation

L. Liu, J. Zhang, Y. Liu, Y. Wang, Y. Tai, D. Luo, C. Wang, J. Li, and F. Huang.
Computer Vision and Pattern Recognition (CVPR), 2020
Paper (Official) / Code GitHub stars

Learning Multi-Granular Hypergraphs for Video-Based Person Re-Identification
Y. Yan, J. Qin, J. Chen, L. Liu, F. Zhu, Y. Tai, and L. Shao.
Computer Vision and Pattern Recognition (CVPR), 2020
Paper (Official) / Code GitHub stars

Fast Learning of Temporal Action Proposal via Dense Boundary Generator
C. Lin*, J. Li*, Y. Wang, Y. Tai, D. Luo, Z. Cui, C. Wang, J. Li, F. Huang and R. Ji.
AAAI Conference on Artificial Intelligence (AAAI), 2020
arXiv / Code GitHub stars

Ranked No. 1 on ActivityNet Challenge 2019 on Temporal Action Proposals

TEINet: Towards an Efficient Architecture for Video Recognition
Z. Liu*, D. Luo*, Y. Wang, L. Wang, Y. Tai, C. Wang, J. Li, F. Huang and T. Lu.
AAAI Conference on Artificial Intelligence (AAAI), 2020
arXiv

DSFD: Dual Shot Face Detector
J. Li, Y. Wang, C. Wang, Y. Tai, J. Qian, J. Yang, C.e Wang, J. Li and F. Huang.
Computer Vision and Pattern Recognition (CVPR), 2019
arXiv / Paper (Official) / Code GitHub stars

Ranked No. 1 on WIDER FACE and FDDB (Until 2019.01)

Towards Highly Accurate and Stable Face Alignment for High-Resolution Videos
Y. Tai*, Y. Liang*, X. Liu, L. Duan, J. Li, C. Wang, F. Huang and Y. Chen.
AAAI Conference on Artificial Intelligence (AAAI), 2019
arXiv / Paper (Official) / Supp / Code GitHub stars

Data-Adaptive Metric Learning with Scale Alignment
S. Chen, C. Gong, J. Yang, Y. Tai, L. Hui and J. Li.
AAAI Conference on Artificial Intelligence (AAAI), 2019
Paper (Official)

Person Search via A Mask-Guided Two-Stream CNN Model
D. Chen, S. Zhang, W. Ouyang, J. Yang and Y. Tai.
European Conference on Computer Vision (ECCV), 2018
Paper / Poster

FSRNet: End-to-End Learning Face Super-Resolution with Facial Priors
Y. Tai*, Y. Chen*, X. Liu, C. Shen, J. Yang.
Computer Vision and Pattern Recognition (CVPR), 2018 [Spotlight]
Paper (Official) / arXiv / Code GitHub stars / Demo / Slides / Poster

MemNet: A Persistent Memory Network for Image Restoration
Y. Tai, J. Yang, X. Liu, C. Xu.
International Conference on Computer Vision (ICCV), 2017 [Spotlight]
Paper / Code GitHub stars / Poster / '15 most influential papers' in ICCV 2017 by PaperDigest

My second paper that achieves over 1,500 google scholar citations

Image Super-Resolution via Deep Recursive Residual Network
Y. Tai, J. Yang, X. Liu.
Computer Vision and Pattern Recognition (CVPR), 2017
Paper / Code GitHub stars / Project / Poster

My first paper that achieves over 2,000 google scholar citations

Nuclear Norm based Matrix Regression with Applications to Face Recognition with Occlusion and Illumination Changes
J. Yang, L. Luo, J. Qian, Y. Tai, F. Zhang and Y. Xu.
IEEE Trans. on Pattern Analysis and Machine Intelligence, 2017
Paper

Structural Orthogonal Procrustes Regression for Face Recognition with Pose Variations and Misalignment
Y. Tai, J. Yang, F. Zhang, Y. Zhang, L. Luo, J. Qian.
SIAM Conference on Data Mining (SDM), 2016 [Oral]
Paper

Face Recognition with Pose Variations and Misalignment via Orthogonal Procrustes Regression
Y. Tai, J. Yang, Y. Zhang, L. Luo, J. Qian and Y. Chen
IEEE Trans. on Image Processing, 2016
Paper

Learning Discriminative Singular Value Decomposition Representation for Face Recognition
Y. Tai, J. Yang, L. Luo, F. Zhang and J. Qian
Pattern Recognition, 2016
Paper