Publications
Google Scholar (*equal contribution, †corresponding author)
-
2025
-
Fair Generation without Unfair Distortions: Debiasing Text-to-Image Generation with Entanglement-Free Attention [PDF]International Conference on Computer Vision (ICCV), 2025
-
AlignGuard: Scalable Safety Alignment for Text-to-Image Generation [PDF] [Homepage & CODE]International Conference on Computer Vision (ICCV), 2025
-
True Multimodal In-Context Learning Needs Attention to the Visual Context [PDF] [Homepage & CODE]Conference On Language Modeling (COLM) , 2025
-
Multimodal Pragmatic Jailbreak on Text-to-image Models [PDF] [Homepage & CODE]the Annual Meeting of the Association for Computational Linguistics (ACL), 2025
-
FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings [PDF]the Annual Meeting of the Association for Computational Linguistics (ACL), 2025
-
the Annual Meeting of the Association for Computational Linguistics (ACL), 2025
-
Magnet: Multi-turn Tool-use Data Synthesis and Distillation via Graph Translation [PDF]the Annual Meeting of the Association for Computational Linguistics (ACL), 2025
-
ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos [PDF] [CODE]IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2025
-
Localizing events in videos with multimodal queries [PDF] [Homepage & CODE]IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2025
-
Fedbip: Heterogeneous one-shot federated learning with personalized latent diffusion models [PDF] [CODE]IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2025
-
International Conference on Learning Representations (ICLR) , 2025
-
Primitive Vision: Improving Diagram Understanding in MLLMs [PDF]International Conference on Machine Learning (ICML) , 2025
-
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) , 2025
-
CL-Cross VQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering [PDF]The IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025
-
Can Multimodal Large Language Models Truly Perform Multimodal In-Context Learning? [PDF] [Homepage & CODE]The IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025
2024-
Visual Question Decomposition on Multimodal Large Language Models [PDF]Findings of Conference on Empirical Methods in Natural Language Processing (EMNLP) , 2024
-
Stop Reasoning! When Multimodal LLMs with Chain-of-Thought Reasoning Meets Adversarial Images [PDF]Conference On Language Modeling (COLM) , 2024
-
Model-agnostic Origin Attribution of Generated Images with Few-shot Examples [PDF]European Conference on Computer Vision (ECCV) , 2024
-
Improving Adversarial Transferability via Model Alignment [PDF]European Conference on Computer Vision (ECCV) , 2024
-
Latent Guard: a Safety Framework for Text-to-image Generation [PDF]European Conference on Computer Vision (ECCV) , 2024
-
MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models [PDF]European Conference on Computer Vision (ECCV) , 2024
-
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model [PDF]European Conference on Computer Vision (ECCV) , 2024
-
Dataset Distillation by Automatic Training TrajectoriesEuropean Conference on Computer Vision (ECCV) , 2024
-
As Firm As Their Foundations: Creating Transferable Adversarial Examples Across Downstream Tasks with CLIP [PDF]The British Machine Vision Conference (BMVC) , 2024
-
Revisiting and Exploring Efficient Fast Adversarial Training via LAW: Lipschitz Regularization and Auto Weight Averaging [PDF]IEEE Transactions on Information Forensics & Security (TIFS) , 2024
-
Provably Better Explanations with Optimized Aggregation of Feature AttributionsInternational Conference on Machine Learning (ICML) , 2024
-
Transactions on Machine Learning Research (TMLR) , 2024
-
ACM Transactions on Knowledge Discovery from Data (TKDD) , 2024
-
Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation [PDF][CODE]IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2024
-
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2024
-
Hide in Thicket: Generating Imperceptible and Rational Adversarial Perturbations on 3D Point Clouds [PDF][CODE]IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2024
-
An Image Is Worth 1000 Lies: Transferability of Adversarial Images across Prompts on Vision-Language Models [PDF][CODE]International Conference on Learning Representations (ICLR) , 2024
-
International Conference on Learning Representations (ICLR) , 2024
-
International Conference on Learning Representations (ICLR) , 2024
-
Minimalism is King! High-Frequency Energy-based Screening for Data-Efficient Backdoor Attacks [PDF]IEEE Transactions on Information Forensics & Security (TIFS) , 2024
-
Does Few-shot Learning Suffer from Backdoor Attacks? [PDF]Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) , 2024
-
FedDAT: An Approach for Foundation Model Finetuning in Multi-Modal Heterogeneous Federated Learning [PDF]Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) , 2024
-
Discretization-Induced Dirichlet Posterior for Robust Uncertainty Quantification on Regression [PDF]Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) , 2024
-
Fast Propagation is Better: Accelerating Single-Step Adversarial Training via Sampling Subnetworks [PDF]IEEE Transactions on Information Forensics & Security (TIFS) , 2024
2023-
Dataset and Benchmark Track in (NeurIPS), 2023
-
International Conference on Computer Vision (ICCV), 2023
-
International Conference on Computer Vision (ICCV), 2023
-
FRAug: Tackling Federated Learning with Non-IID Features via Representation Augmentation [PDF]International Conference on Computer Vision (ICCV), 2023
-
Exploring Non-additive Randomness on ViT against Query-Based Black-Box Attacks [PDF]British Machine Vision Conference (BMVC), 2023
-
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2023
-
ECOLA: Enhancing Temporal Knowledge Embeddings with Contextualized Language Representations [PDF][CODE]Findings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2023
2022-
European Conference on Computer Vision (ECCV) , 2022
-
SegPGD: An Effective and Efficient Adversarial Attack for Evaluating and Boosting Segmentation Robustness [PDF]European Conference on Computer Vision (ECCV) , 2022
-
Towards Efficient Adversarial Training on Vision Transformers [PDF]European Conference on Computer Vision (ECCV) , 2022
-
European Conference on Computer Vision (ECCV) , 2022
2021-
Workshop in ICCV , 2021
-
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2021
-
International Conference on Learning Representations (ICLR) , 2021
-
AAAI Conference on Artificial Intelligence (AAAI) , 2021
2020-
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2020
-
European Conference on Artificial Intelligence (ECAI) , 2020
2019-
Neural network memorization dissection [PDF]Workshop on Machine Learning with Guarantees, NeurIPS , 2019
-
Saliency Methods for Explaining Adversarial Attacks [PDF]Human-Centric Machine Learning Workshop, NeurIPS , 2019
2018-
Asian Conference on Computer Vision (ACCV), 2018
-
Patents and Inventions
-
-
Verification of classification decisions in Convolutional Neural Networks [PDF]
Jindong Gu
US Patent: US 2022/0019870 A1 -
Method and processing unit for computer-implemented analysis of a classification model [PDF]
Jindong Gu
US Patent: US 2020/0334489 A1 - Siemens Inventions: 8 AI Inventions in Siemens Technology, Germany [Link]
-
Verification of classification decisions in Convolutional Neural Networks [PDF]