π€© Multimodal Large Language Models (MLLMs) excel in language, vision, and vision-language tasks. Microsoft Research introduces KOSMOS-2, a powerful multimodal model with grounding capabilities. Exciting times ahead! πβ¨ #MLLMs #Grounding #KOSMOS2 π go.digitalengineer.io/GR