Sifan Li

I am currently a Ph.D. candidate at the Institute of Computing Technology, Chinese Academy of Sciences, under the supervision of Prof. Shenghua Liu and Prof. Yiwei Wang. I am also an assistant researcher in AI4Science at Shanghai AI Lab and a research intern at the University of California, Merced, working under the supervision of Prof. Yiwei Wang.

My research interests include NLP, multimodal LLMs, vision-language models, AI interpretability, agentic systems, reinforcement learning, AI4Science, efficient training, and computer vision.

news

May 31, 2026	The preprint GeoSVG-RL: Geometry-Aware Reinforcement Learning for Layout-Constrained Text-to-SVG Diagram Generation is available on arXiv.
Apr 01, 2026	I started as an assistant researcher in AI4Science at Shanghai AI Lab.
Jan 22, 2026	The preprint OptiSQL: Executable SQL Generation from Optical Tokens is available on arXiv.
Nov 07, 2025	SemVink: Advancing VLMs’ Semantic Understanding of Optical Illusions via Visual Global Thinking was presented as an oral paper at the EMNLP 2025 Main Conference in Suzhou.
Oct 14, 2025	The preprint Vision Language Models Map Logos to Text via Semantic Entanglement in the Visual Projector is available on arXiv.

selected publications

GeoSVG-RL: Geometry-Aware Reinforcement Learning for Layout-Constrained Text-to-SVG Diagram Generation

Sifan Li, Yujun Cai, Hongkai Chen, and 1 more author

2026

arXiv
OptiSQL: Executable SQL Generation from Optical Tokens

Sifan Li, Hongkai Chen, Yujun Cai, and 3 more authors

2026

arXiv
SemVink: Advancing VLMs’ Semantic Understanding of Optical Illusions via Visual Global Thinking

Sifan Li, Yujun Cai, and Yiwei Wang

In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Oral presentation in the Main Conference of EMNLP 2025.

PDF Poster Slides Website
Vision Language Models Map Logos to Text via Semantic Entanglement in the Visual Projector

Sifan Li, Hongkai Chen, Yujun Cai, and 4 more authors

2025

arXiv PDF Website
Do "New Snow Tablets" Contain Snow? Large Language Models Over-Rely on Names to Identify Ingredients of Chinese Drugs

Sifan Li, Yujun Cai, Bryan Hooi, and 2 more authors

2025

arXiv PDF Poster Website
Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image

Sifan Li, Ming Tao, Hao Zhao, and 2 more authors

2025

arXiv PDF Poster