I am an undergraduate student at the School of Computer Science and Technology, University of Science and Technology of China (USTC). My research interests lie in the robustness and reliability of vision-language models, multimodal large language models, and autonomous agents.