V*: Guided Visual Search as a Core Mechanism in Multimodal LLMs

Penghao Wu, Saining Xie

Research output: Contribution to journalConference articlepeer-review

Fingerprint

Dive into the research topics of 'V*: Guided Visual Search as a Core Mechanism in Multimodal LLMs'. Together they form a unique fingerprint.

Computer Science

Keyphrases