Skip to content

About this episode

The race towards multimodal LLMs is heating up! With rumors of a big impending launch of Google Gemini, OpenAI is racing to push out their multimodal features. Today they launched the ability for ChatGPT to carry on audio conversations, as well as to use images as inputs. Before that on the Brief, Amazon to invest up to $4B in Anthropic. ABOUT THE AI BREAKDOWN The AI Breakdown helps you understand the most important news and discussions in AI.  Subscribe to The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/subscribe Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown Join the community: bit.ly/aibreakdown Learn more: http://breakdown.network/

Listen to this episode in English to learn English

Podcast episodes are one of the highest-density ways to absorb English at native pace. ChatGPT Can Now See and Hear from The AI Daily Brief: Artificial Intelligence News and Analysis gives you natural dialogue, unscripted speech, and vocabulary that actually appears in real conversations.

In the Clue app, every word in the transcript is tappable. Tap an unknown word, see the translation in your language instantly, and keep listening without breaking flow.

Episodes to Learn English

More podcasts in English