Follow
Weixin Chen
Title
Cited by
Cited by
Year
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
B Wang, W Chen, H Pei, C Xie, M Kang, C Zhang, C Xu, Z Xiong, R Dutta, ...
Advances in Neural Information Processing Systems (NeurIPS), 2023
1132023
Effective Backdoor Defense by Exploiting Sensitivity of Poisoned Samples
W Chen, B Wu, H Wang
Advances in Neural Information Processing Systems (NeurIPS), 2022
412022
TrojDiff: Trojan Attacks on Diffusion Models with Diverse Targets
W Chen, D Song, B Li
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
332023
GRATH: Gradual Self-Truthifying for Large Language Models
W Chen, D Song, B Li
arXiv preprint arXiv:2401.12292, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–4