
EP96: Gemini Native Image Generation & Editing, OpenAI's Agent SDK & Will Manus AI Invade USA?
03/14/25 • 72 min
1 Listener
Join Simtheory: https://simtheory.ai
----
CHAPTERS:
00:00 - Gemini Flash 2.0 Experimental Native Image Generation & Editing
27:55 - Thoughts on OpenAI's "New tools for building agents" announcement
43:31 - Why is everyone talking about MCP all of a sudden?
56:31 - Manus AI: Will Manus Invade the USA and Defeat it With Powerful AGI? (jokes)
----
Thanks for all of your support and listening!
Join Simtheory: https://simtheory.ai
----
CHAPTERS:
00:00 - Gemini Flash 2.0 Experimental Native Image Generation & Editing
27:55 - Thoughts on OpenAI's "New tools for building agents" announcement
43:31 - Why is everyone talking about MCP all of a sudden?
56:31 - Manus AI: Will Manus Invade the USA and Defeat it With Powerful AGI? (jokes)
----
Thanks for all of your support and listening!
Previous Episode

EP95: Why does GPT4.5 exist? Claude 3.7 Sonnet Has Arrived & Working with Claude Code Agent
Join Simtheory to try GPT-4.5: https://simtheory.ai
Dis Track: https://simulationtheory.ai/5714654f-0fbe-496f-8428-20018457c4c7
===
CHAPTERS:
00:00 - Reaction to GPT4.5 Live Stream + Release
12:45 - Claude 3.7 Sonnet Release: Reactions and First Week Impressions
45:58 - Claude 3.7 Sonnet Dis Track Test
56:10 - Claude Code First Impressions + Future Agent Workflows
1:15:45 - Chris's Veo2 Film Clip
1:24:49 - Alexa+ AI Assistant
1:34:05 - Claude 3.7 Sonnet BOOM FACTOR
Next Episode

EP97: Moore’s Law for AI agents, OpenAI's new audio models, o1-pro API & When Will AI Replace Us?
Create an AI workspace on Simtheory: https://simtheory.ai
---
Song: https://simulationtheory.ai/f6d643e4-4201-475c-aa82-8a96b6b3b215
---
CHAPTERS:
00:00 - OpenAI's audio model updates: gpt-4o-transcribe, gpt-4o-mini-tts
18:39 - Strategy of AI Labs with Agent SDKs and Model "stacks" and limitations of voice
25:28 - Cost of models, GPT-4.5, o1-pro api release thoughts
31:57 - o1-pro "I am rich" track & Chris's o1-pro PR stunt realization, more thoughts on o1 family
48:39 - Moore’s Law for AI agents, current AI workflows and future enterprise agent workflows & AI agent job losses
1:24:09 - Can we control agents?
1:29:21 - Final thoughts for the week
1:35:15 - Full "I am rich" o1-pro track
---
See you next week and thanks for your support.
CORRECTION: Kosciusko is obviously not an aboriginal name I misspoke. Wagga Wagga and others in the voice clip are and are great ways to test AI text to speech models!
If you like this episode you’ll love
Episode Comments
Generate a badge
Get a badge for your website that links back to this episode
<a href="https://goodpods.com/podcasts/this-day-in-ai-podcast-247037/ep96-gemini-native-image-generation-and-editing-openais-agent-sdk-and-87371691"> <img src="https://storage.googleapis.com/goodpods-images-bucket/badges/generic-badge-1.svg" alt="listen to ep96: gemini native image generation & editing, openai's agent sdk & will manus ai invade usa? on goodpods" style="width: 225px" /> </a>
Copy