Eddy Luo
Click to switch avatar

Weidi(Eddy) Luo

CS PhD student @ The University of Georgia

燃えろ! 戦う! To be the rising star!

Left Anime Character

"It's not that life has dreams, but that dreams create life."

「人生に夢があるのではなく、夢が人生をつくるのです」

— Taeko Uzuki
Right Anime Character

🚀 About the Protagonist of 「青春との戦い」

Eddy Luo (罗威迪 & 一ノ瀬 エイジ), an incoming Ph.D. student at University of Georgia, where I will be advised by Prof.Xiang Zhen. I am also fortunate to be co-advised by Prof. Chaowei Xiao at the University of Wisconsin–Madison, a mentor I deeply respect and am sincerely grateful to. Previously, I served as a research assistant at the OSU NLP Group and the ICICLE Institute advised by Prof. Yu Su.

Eddy warmly welcomes collaboration opportunities and supports undergraduates who want to apply for a PhD program. He hopes we can conduct significant research together. Please feel free to contact him at Email: luo.1455[shift+2]uga[dot]edu. どうぞよろしくお願いします!

Eddy's research interests:

Trustworthy AI & AI Safety

Using interpretability methods, discover security vulnerabilities in AI systems, including foundation models and AI agents, and develop corresponding defense and detection algorithms, including safety alignment strategies.

AI in Security

Leverage AI to drive defense and attack strategies on systems, including web system and operating system.

Lifelong AI Algorithms

Develop lifelong learning AI frameworks and defense systems by utilizing reinforcement learning, cognitive science, bio-inspired algorithms, active learning, and so on.

📰 Eddy's News

2025.05.15
🎉

Two of our works, AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection and Disentangling Memory and Reasoning Ability in Large Language Models have been accepted by ACL'2025 main conference. Thanks to my collaborators.

2025.04.15
🎉

Our work JailBreakV-28K: A Benchmark for Assessing the Robustness of MultiModal Large Language Models against Jailbreak Attacks wins $20,000 SafeBench Prize for Advancing MultiModal Large Language Model Security Benchmarking from Center for AI Safety.

2025.04.09
🎉

I will join the University of Georgia as a PhD student in August 2025.

📝 Eddy's Selected Pre-print

Arxiv
Doxing via the Lens

Weidi Luo*, Tianyu Lu*, Qiming Zhang*, Xiaogeng Liu, Bin Hu, Yue Zhao, Jieyu Zhao, Song Gao, Patrick McDaniel, Zhen Xiang, Chaowei Xiao

Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models

23,000+ views, 1,300+ shares
Arxiv
Visual-RolePlay

Siyuan Ma*, Weidi Luo*, Yu Wang, Xiaogeng Liu, Muhao Chen, Bo Li, Chaowei Xiao

Visual-RolePlay: Universal Jailbreak Attack on MultiModal Large Language Models via Role-playing Image Character

📸 Eddy's Selected Publication

ACL'2025
AGrail

Weidi Luo, Shenghong Dai, Xiaogeng Liu, Suman Banerjee, Huan Sun, Muhao Chen, Chaowei Xiao

AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection

NAACL'2025
Dynamic Safeguards

Weidi Luo*, He Cao*, Yu Wang, Zijing Liu, Aidan Wong, Bin Feng, Yuan Yao, Yu Li

Dynamic Guided and Domain Applicable Safeguards for Enhanced Security in Large Language Models

COLM'2024
JailBreakV-28K

Weidi Luo*, Siyuan Ma*, Xiaogeng Liu*, Xiaoyu Guo, Chaowei Xiao

JailBreakV-28K: A Benchmark for Assessing the Robustness of MultiModal Large Language Models against Jailbreak Attacks

$20,000 SafeBench Award from Center for AI Safety
ACL'2025
Memory and Reasoning

Mingyu Jin, Weidi Luo, Sitao Cheng, Xinyi Wang, Wenyue Hua, Ruixiang Tang, William Yang Wang, Yongfeng Zhang

Disentangling Memory and Reasoning Ability in Large Language Models

🌍 Visitor Statistics