About me
I am currently a third-year PhD student at Pennsylvania State University. I am fortunate to be advised by Dr. Jinyuan Jia. My research focuses on AI Security & Trustworthy ML, recently focusing on LLM robustness and transparency. Previously, I earned double bachelorโs degrees in Mechanical Engineering and Computer Science from RPI, followed by a masterโs in Computer Science from Duke University.
Current research focuses
- Transparency of LLM-empowered AI systems for security purposes
- Enhancing the efficiency of optimization-based LLM red-teaming
- Provably secure/robust machine learning systems
Publications
* Equal contribution
- *Yanting Wang, *Wei Zou, Runpeng Geng, and Jinyuan Jia. TracLLM: A Generic Framework for Attributing Long Context LLMs, In USENIX Security, 2025. ๐ Code
- Yupei Liu, Yanting Wang, and Jinyuan Jia. TrojanDec: Data-free Detection of Trojan Inputs in Self-supervised Learning, In AAAI, 2025.
- Yanting Wang, Wei Zou, Jinyuan Jia. FCert: Certifiably Robust Few-Shot Classification in the Era of Foundation Models, In IEEE S&P, 2024. ๐ Code
- Yanting Wang, Hongye Fu, Wei Zou, and Jinyuan Jia. MMCert: Provable Defense against Adversarial Attacks to Multi-modal Models, In CVPR, 2024. ๐ Code
Preprints
- Yanting Wang, Runpeng Geng, Jinghui Chen, Minhao Cheng, Jinyuan Jia.
TASO: Jailbreak LLMs via Alternative Template and Suffix Optimization, In arXiv, 2025. - Runpeng Geng, Yanting Wang, Chenlong Yin, Minhao Cheng, Ying Chen, Jinyuan Jia. PISanitizer: Preventing Prompt Injection to Long-Context LLMs via Prompt Sanitization, In arXiv, 2025. ๐ Code
- Wei Zou, Yupei Liu, Yanting Wang, Ying Chen, Neil Gong, and Jinyuan Jia.
PIShield: Detecting Prompt Injection Attacks via Intrinsic LLM Features, In arXiv, 2025. ๐ Code - Runpeng Geng, Yanting Wang, Ying Chen, and Jinyuan Jia.
UniC-RAG: Universal Knowledge Corruption Attacks to Retrieval-Augmented Generation, In arXiv, 2025. - Yupei Liu, Yanting Wang, Yuqi Jia, Jinyuan Jia, and Neil Zhenqiang Gong.
SecInfer: Preventing Prompt Injection via Inference-Time Scaling, In arXiv, 2025. - Yanting Wang, Runpeng Geng, Ying Chen, and Jinyuan Jia. AttnTrace: Attention-based Context Traceback for Long-Context LLMs, In arxiv, 2025. ๐ Code Demo (HF Spaces)
- Yuzhou Nie, Yanting Wang, Jinyuan Jia, Michael J. De Lucia, Nathaniel D. Bastian, Wenbo Guo, and Dawn Song. TrojFM: Resource-efficient Backdoor Attacks against Very Large Foundation Models, In arxiv, 2024.
Professional Service
Program Committee member:
AAAI, 2026
Reviewer:
Pattern Recognition, 2025
TNNLS, 2024
Personal
Iโm always interested in trying new things. Outside of work, I enjoy playing all kinds of ball sports, hiking, and snowboarding.
