Sifan Li

Ph.D. Candidate in Computer Science

aportrait.jpg

I am currently a Ph.D. candidate at the Institute of Computing Technology, Chinese Academy of Sciences. I am also an assistant researcher in AI4Science at Shanghai AI Lab and a research intern at the University of California, Merced, working under the supervision of Dr. Yiwei Wang.

My research interests include NLP, multimodal LLMs, vision-language models, AI interpretability, agentic systems, reinforcement learning, AI4Science, efficient training, and computer vision.

news

May 31, 2026 The preprint GeoSVG-RL: Geometry-Aware Reinforcement Learning for Layout-Constrained Text-to-SVG Diagram Generation is available on arXiv.
Apr 01, 2026 I started as an assistant researcher in AI4Science at Shanghai AI Lab.
Jan 22, 2026 The preprint OptiSQL: Executable SQL Generation from Optical Tokens is available on arXiv.
Nov 07, 2025 SemVink: Advancing VLMs’ Semantic Understanding of Optical Illusions via Visual Global Thinking was presented as an oral paper at the EMNLP 2025 Main Conference in Suzhou.
Oct 14, 2025 The preprint Vision Language Models Map Logos to Text via Semantic Entanglement in the Visual Projector is available on arXiv.

selected publications

  1. geosvg-rl.png
    GeoSVG-RL: Geometry-Aware Reinforcement Learning for Layout-Constrained Text-to-SVG Diagram Generation
    Sifan Li, Yujun Cai, Hongkai Chen, and 1 more author
    2026
  2. optisql.png
    OptiSQL: Executable SQL Generation from Optical Tokens
    Sifan Li, Hongkai Chen, Yujun Cai, and 3 more authors
    2026
  3. semvink.png
    SemVink: Advancing VLMs’ Semantic Understanding of Optical Illusions via Visual Global Thinking
    Sifan Li, Yujun Cai, and Yiwei Wang
    In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
    Oral presentation in the Main Conference of EMNLP 2025.
  4. logos.png
    Vision Language Models Map Logos to Text via Semantic Entanglement in the Visual Projector
    Sifan Li, Hongkai Chen, Yujun Cai, and 4 more authors
    2025
  5. cpm.png
    Do "New Snow Tablets" Contain Snow? Large Language Models Over-Rely on Names to Identify Ingredients of Chinese Drugs
    Sifan Li, Yujun Cai, Bryan Hooi, and 2 more authors
    2025
  6. replace.png
    Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image
    Sifan Li, Ming Tao, Hao Zhao, and 2 more authors
    2025