SemVink

Visual global thinking for VLM semantic understanding of optical illusions.

SemVink investigates why vision-language models struggle with hidden semantic content in optical illusions and how visual global thinking can bridge this gap.

References