Event details
Feb
25
Multimodal AI - Visual tool calling
Multimodal AI models can include visual tools that enable them to manipulate images or retrieve external information. A zoom tool can be used to focus on a specific section of a painting. A reverse image search tool can find similar images across the Web. This visual search can retrieve metadata that improves the recognition and interpretation of visual information. We will begin with existing visual tools and consider which additional tools could aid research.
Image: Elise Racine & Digit / Woven Dialogues / Licenced by CC-BY 4.0
Image: Elise Racine & Digit / Woven Dialogues / Licenced by CC-BY 4.0
University programs and activities are open to all eligible participants without regard to identity or other protected characteristics. Sponsorship of an event does not constitute institutional endorsement of external speakers or views presented.
View physical accessibility information for campus buildings and find accessible routes using the Princeton Campus Map app.