Jinhao Duan (段金昊)
I am a PhD student at Drexel University, advised by Prof. Kaidi Xu.
I am interested in Trustworthy Machine Learning, including Adversarial Robustness, Uncertainty Quantification, Security & Privacy of Large Foundation Models, with the potential applications in healthcare.
Email /
Google Scholar /
Github
|
|
-
[10/2024] Workshop: We are organizing GenAI4Health@AAAI 2025 in Philadelphia, PA. More details will be released soon!
-
[09/2024] Three papers were accepted:
ReMiND (missing MRI imputation via diffusion models) was accepted by Imaging Neuroscience;
ConU (LLM conformal prediction) was accepted by Findings of EMNLP 2024;
GTBench (LLM game-theoretic benchmark) was accepted by NeurIPS 2024.
-
[07/2024] One paper (VLMs typographic vulnerability) was accepted by ECCV 2024
-
[05/2024] Received PhD Research Excellence Award from CCI, Drexel University
-
[05/2024] SAR was accepted by ACL 2024
-
[05/2024] One paper was accepted by ICML 2024
-
[03/2024] Two papers were accepted by CVPR 2024 and one paper was accepted by NAACL 2024
-
[01/2024] One paper was accepted by ICLR 2024
-
[09/2023] Two papers were accepted by BMVC 2023
-
[05/2023] One paper was accepted by ICML 2023
-
[05/2023] One paper was accepted by IJCAI 2023
Selected Publications
|
(* indicates equal contribution)
|
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations
Jinhao Duan*, Renming Zhang*, James Diffenderfer, Bhavya Kailkhura, Lichao Sun, Elias Stengel-Eskin, Mohit Bansal, Tianlong Chen, Kaidi Xu
Paper / Github / GTBench HF Leaderboard
NeurIPS, 2024
|
ConU: Conformal Uncertainty in Large Language Models with Correctness Coverage Guarantees
Zhiyuan Wang, Jinhao Duan, Lu Cheng, Yue Zhang, Qingni Wang, Hengtao Shen, Xiaofeng Zhu, Xiaoshuang Shi, Kaidi Xu
Findings of EMNLP, 2024
|
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model
Hao Cheng, Erjia Xiao, Jindong Gu, Le Yang, Jinhao Duan, Jize Zhang, Jiahang Cao, Kaidi Xu, Renjing Xu
ECCV, 2024
|
Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models
Jinhao Duan, Hao Cheng, Shiqi Wang, Alex Zavalny, Chenan Wang, Renjing Xu, Bhavya Kailkhura, Kaidi Xu
Github
ACL, 2024
|
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression
Junyuan Hong*, Jinhao Duan*, Chenhui Zhang*, Zhangheng Li*, Chulin Xie, Kelsey Lieberman, James Diffenderfer, Brian Bartoldson, Ajay Jaiswal, Kaidi Xu, Bhavya Kailkhura, Dan Hendrycks, Dawn Song, Zhangyang Wang, Bo Li
Paper / Project / Github / Leaderboard / Models
SeT@ICLR, 2024
ICML, 2024
|
ACT-Diffusion: Efficient Adversarial Consistency Training for One-step Diffusion Models
Fei Kong, Jinhao Duan, Lichao Sun, Hao Cheng, Renjing Xu, Hengtao Shen, Xiaofeng Zhu, Xiaoshuang Shi, Kaidi Xu
CVPR, 2024
|
Can Protective Perturbation Safeguard Personal Data from Being Exploited by Stable Diffusion?
Zhengyue Zhao, Jinhao Duan, Kaidi Xu, Chenan Wang, Rui Zhang, Zidong Du, Qi Guo, Xing Hu
CVPR, 2024
|
ReTA: Recursively Thinking Ahead to Improve the Strategic Reasoning of Large Language Models
Jinhao Duan, Shiqi Wang, James Diffenderfer, Lichao Sun, Tianlong Chen, Bhavya Kailkhura, Kaidi Xu
Paper
NAACL, 2024
|
An efficient membership inference attack for the diffusion model by proximal initialization
Fei Kong, Jinhao Duan, RuiPeng Ma, Hengtao Shen, Xiaofeng Zhu, Xiaoshuang Shi, Kaidi Xu
ICLR, 2024
|
ReMiND: Recovery of Missing Neuroimaging using Diffusion Models with Application to Alzheimer's Disease
Chenxi Yuan*, Jinhao Duan*, Nicholas J Tustison, Kaidi Xu, Rebecca A Hubbard, Kristin A Linn
Github
Imaging Neuroscience, 2024
|
RBFormer: Improve Adversarial Robustness of Transformer by Robust Bias
Hao Cheng, Jinhao Duan, Hui Li, Lyutianyang Zhang, Jiahang Cao, Ping Wang, Jize Zhang, Kaidi Xu, Renjing Xu
BMVC, 2023
|
Semantic adversarial attacks via diffusion models
Chenan Wang, Jinhao Duan, Chaowei Xiao, Edward Kim, Matthew Stamm, Kaidi Xu
BMVC, 2023
|
Are Diffusion Models Vulnerable to Membership Inference Attacks?
Jinhao Duan, Fei Kong, Shiqi Wang, Xiaoshuang Shi, Kaidi Xu
Paper / Github (SecMI) / Github (SecMI-LDM)
ICML, 2023
|
Improve Video Representation with Temporal Adversarial Augmentation
Jinhao Duan, Quanfu Fan, Hao Cheng, Xiaoshuang Shi, Kaidi Xu
IJCAI, 2023
|
Services
Program Committee (PC) member: EMNLP (2023), AAAI (2024), CVPR (2024), NeurIPs (2024), ICLR (2025), AISTATS (2025)
Journal Reviewer: IEEE Security & Privacy, Machine Learning
|
This website template is borrowed from Jon Barron.
|
|