Begin typing your search...

Showing results for "#Visual cues"

MS working on AI model that takes images as cues

The multi-modal large language model (MLLM) can help in an array of new tasks, including image captioning, visual question answering and more.