Publications
Publications by categories in reversed chronological order.
2022
- CVPR
Unified Multivariate Gaussian Mixture for Efficient Neural Image CompressionIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Jun 2022 - IJCAI
S2 transformer for image captioningIn Proceedings of the International Joint Conferences on Artificial Intelligence Jun 2022 - TCSVT
KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D CorrespondencesIEEE Transactions on Circuits and Systems for Video Technology Jun 2022 - TCSVTProgressive Meta-learning with CurriculumIEEE Transactions on Circuits and Systems for Video Technology Jun 2022
- TMMAgeGAN++: Face Aging and Rejuvenation With Dual Conditional GANsIEEE Transactions on Multimedia Jun 2022
- Pattern RecognitionText-instance graph: Exploring the relational semantics for text-based visual question answeringPR Jun 2022
- TMMPush & Pull: Transferable Adversarial Examples With Attentive AttackIEEE Transactions on Multimedia 2022
2021
- TIPHierarchical representation network with auxiliary tasks for video captioning and video question answeringIEEE Transactions on Image Processing 2021
- ICCV
From general to specific: Informative scene graph generation via balance adjustmentIn Proceedings of the IEEE/CVF International Conference on Computer Vision 2021 - ICCV
Exploiting scene graphs for human-object interaction detectionIn Proceedings of the IEEE/CVF International Conference on Computer Vision 2021 - TOC
- IJCAI

- IJCAI
Towards Unsupervised Deformable-Instances Image-to-Image TranslationIn Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI-21 Aug 2021 - TMMAMANet: Adaptive Multi-Path Aggregation for Learning Human 2D-3D CorrespondencesIEEE Transactions on Multimedia Aug 2021
- ACM MMSemantic-Aware Transfer with Instance-Adaptive Parsing for Crowded Scenes Pose EstimationIn Proceedings of the 29th ACM International Conference on Multimedia Aug 2021
- ACM MMFully Functional Image Manipulation Using Scene Graphs in A Bounding-Box Free WayIn Proceedings of the 29th ACM International Conference on Multimedia Aug 2021
- ACM MMCurriculum-based meta-learningIn Proceedings of the 29th ACM International Conference on Multimedia Aug 2021
- ACM MMCamera-Agnostic Person Re-Identification via Adversarial Disentangling LearningIn Proceedings of the 29th ACM International Conference on Multimedia Aug 2021
- ACM MMConceptual and syntactical cross-modal alignment with cross-level consistency for image-text matchingIn Proceedings of the 29th ACM International Conference on Multimedia Aug 2021
- ACM MMExploring Contextual-Aware Representation and Linguistic-Diverse Expression for Visual DialogIn Proceedings of the 29th ACM International Conference on Multimedia Aug 2021
- PRLightweight dynamic conditional GAN with pyramid attention for text-to-image synthesisPattern Recognition Aug 2021
- PR
- PRGeneralized pyramid co-attention with learnable aggregation net for video question answeringPattern Recognition Aug 2021
2020
- TNNLS
Rich visual knowledge-based augmentation network for visual question answeringIEEE Transactions on Neural Networks and Learning Systems Aug 2020 - ECCV

- ACM MMKTN: Knowledge Transfer Network for Multi-Person DensePose EstimationIn Proceedings of the 28th ACM International Conference on Multimedia Aug 2020
- ACM MMLab2Pix: Label-Adaptive Generative Adversarial Network for Unsupervised Image SynthesisIn Proceedings of the 28th ACM International Conference on Multimedia Aug 2020
- ACM MMOne-shot scene graph generationIn Proceedings of the 28th ACM International Conference on Multimedia Aug 2020
- PR
- Neurocomputing
- IJCVUnified binary generative adversarial network for image retrieval and compressionInternational Journal of Computer Vision Aug 2020
- Neurocomputing
- Neurocomputing
2019
- TPAMI
Hierarchical LSTMs with adaptive attention for visual captioningIEEE transactions on pattern analysis and machine intelligence Aug 2019 - AAAI
Perceptual Pyramid Adversarial Networks for Text-to-Image SynthesisProceedings of the AAAI Conference on Artificial Intelligence Jul 2019 - AAAI
Deliberate attention networks for image captioningIn Proceedings of the AAAI conference on artificial intelligence Jul 2019 - AAAI
Structured two-stream attention network for video question answeringIn Proceedings of the AAAI Conference on Artificial Intelligence Jul 2019 - AAAI
Beyond rnns: Positional self-attention with co-attention for video question answeringIn Proceedings of the AAAI Conference on Artificial Intelligence Jul 2019 - ACM MMLearnable Aggregating Net with Diversity Learning for Video Question AnsweringIn Proceedings of the 27th ACM International Conference on Multimedia Jul 2019
- IJCAI
Beyond product quantization: Deep progressive quantization for image retrievalarXiv preprint arXiv:1906.06698 Jul 2019 - IJCAI
Deep recurrent quantization for generating sequential binary codesarXiv preprint arXiv:1906.06699 Jul 2019
2018
- IJCAI

- ACM MMExamine before You Answer: Multi-Task Learning with Adaptive-Attentions for Multiple-Choice VQAIn Proceedings of the 26th ACM International Conference on Multimedia Jul 2018
- IJCAI
From Pixels to Objects: Cubic Visual Attention for Visual Question AnsweringIn Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI-18 Jul 2018
2017
-
Video captioning with attention-based LSTM and semantic consistencyIEEE Transactions on Multimedia Jul 2017 - IJCAI
Hierarchical LSTM with adjusted temporal attention for video captioningarXiv preprint arXiv:1706.01231 Jul 2017 - TPAMIA survey on learning to hashIEEE transactions on pattern analysis and machine intelligence Jul 2017
2016
- ACM MMAttention-based LSTM with semantic consistency for videos captioningIn Proceedings of the 24th ACM international conference on Multimedia Jul 2016





