xAI|Grok-1.5 Vision Preview (April 12, 2024)
文章来源: 蓝调2024-04-13 21:52:37

 

 

April 12, 2024

Grok-1.5 Vision Preview

Connecting the digital and physical worlds with our first multimodal model.

Introducing Grok-1.5V, our first-generation multimodal model. In addition to its strong text capabilities, Grok can now process a wide variety of visual information, including documents, diagrams, charts, screenshots, and photographs. Grok-1.5V will be available soon to our early testers and existing Grok users.

https://x.ai/blog/grok-1.5v

An abstract visualization of two transparent spheres reminiscent of glasses over a galaxy filled with stars and nebulae.