Log in

goodpods headphones icon

To access all our features

Open the Goodpods app
Close icon
This Day in AI Podcast - EP96: Gemini Native Image Generation & Editing, OpenAI's Agent SDK & Will Manus AI Invade USA?

EP96: Gemini Native Image Generation & Editing, OpenAI's Agent SDK & Will Manus AI Invade USA?

03/14/25 • 72 min

1 Listener

This Day in AI Podcast

Join Simtheory: https://simtheory.ai
----
CHAPTERS:
00:00 - Gemini Flash 2.0 Experimental Native Image Generation & Editing
27:55 - Thoughts on OpenAI's "New tools for building agents" announcement
43:31 - Why is everyone talking about MCP all of a sudden?
56:31 - Manus AI: Will Manus Invade the USA and Defeat it With Powerful AGI? (jokes)
----
Thanks for all of your support and listening!

plus icon
bookmark

Join Simtheory: https://simtheory.ai
----
CHAPTERS:
00:00 - Gemini Flash 2.0 Experimental Native Image Generation & Editing
27:55 - Thoughts on OpenAI's "New tools for building agents" announcement
43:31 - Why is everyone talking about MCP all of a sudden?
56:31 - Manus AI: Will Manus Invade the USA and Defeat it With Powerful AGI? (jokes)
----
Thanks for all of your support and listening!

Previous Episode

undefined - EP95: Why does GPT4.5 exist? Claude 3.7 Sonnet Has Arrived & Working with Claude Code Agent

EP95: Why does GPT4.5 exist? Claude 3.7 Sonnet Has Arrived & Working with Claude Code Agent

Join Simtheory to try GPT-4.5: https://simtheory.ai
Dis Track: https://simulationtheory.ai/5714654f-0fbe-496f-8428-20018457c4c7
===
CHAPTERS:
00:00 - Reaction to GPT4.5 Live Stream + Release
12:45 - Claude 3.7 Sonnet Release: Reactions and First Week Impressions
45:58 - Claude 3.7 Sonnet Dis Track Test
56:10 - Claude Code First Impressions + Future Agent Workflows
1:15:45 - Chris's Veo2 Film Clip
1:24:49 - Alexa+ AI Assistant
1:34:05 - Claude 3.7 Sonnet BOOM FACTOR

Next Episode

undefined - EP97: Moore’s Law for AI agents, OpenAI's new audio models, o1-pro API & When Will AI Replace Us?

EP97: Moore’s Law for AI agents, OpenAI's new audio models, o1-pro API & When Will AI Replace Us?

Create an AI workspace on Simtheory: https://simtheory.ai
---
Song: https://simulationtheory.ai/f6d643e4-4201-475c-aa82-8a96b6b3b215
---
CHAPTERS:
00:00 - OpenAI's audio model updates: gpt-4o-transcribe, gpt-4o-mini-tts
18:39 - Strategy of AI Labs with Agent SDKs and Model "stacks" and limitations of voice
25:28 - Cost of models, GPT-4.5, o1-pro api release thoughts
31:57 - o1-pro "I am rich" track & Chris's o1-pro PR stunt realization, more thoughts on o1 family
48:39 - Moore’s Law for AI agents, current AI workflows and future enterprise agent workflows & AI agent job losses
1:24:09 - Can we control agents?
1:29:21 - Final thoughts for the week
1:35:15 - Full "I am rich" o1-pro track
---
See you next week and thanks for your support.

CORRECTION: Kosciusko is obviously not an aboriginal name I misspoke. Wagga Wagga and others in the voice clip are and are great ways to test AI text to speech models!

Episode Comments

Generate a badge

Get a badge for your website that links back to this episode

Select type & size
Open dropdown icon
share badge image

<a href="https://goodpods.com/podcasts/this-day-in-ai-podcast-247037/ep96-gemini-native-image-generation-and-editing-openais-agent-sdk-and-87371691"> <img src="https://storage.googleapis.com/goodpods-images-bucket/badges/generic-badge-1.svg" alt="listen to ep96: gemini native image generation & editing, openai's agent sdk & will manus ai invade usa? on goodpods" style="width: 225px" /> </a>

Copy