Natural language processing

xAI adds real-time vision, multilingual audio and real-time search to Grok

xAI adds real-time vision, multilingual audio and real-time search to Grok


xAI has rolled out three new features for its Grok voice assistant: Grok Vision, multilingual audio output, and real-time search in voice mode. According to the company, all three features are now available to iOS users. Android users with a SuperGrok subscription also get access to multilingual audio and real-time search. Grok Vision allows the assistant to provide live commentary on whatever appears on the smartphone screen. Google and OpenAI have been offering similar features for some time, using language models to interpret on-screen content in real time. The update is part of a broader push by xAI—Elon Musk’s artificial intelligence start-up—to compete with companies like Google and OpenAI. xAI recently introduced a new reasoning model called Grok 3 mini.

Ad

Join our community

Join the DECODER community on Discord, Reddit or Twitter – we can’t wait to meet you.

Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:

xAI adds real-time vision, multilingual audio and real-time search to Grok

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Join our community

Join the DECODER community on Discord, Reddit or Twitter – we can’t wait to meet you.

xAI adds real-time vision, multilingual audio and real-time search to Grok

Source link