Log in

goodpods headphones icon

To access all our features

Open the Goodpods app
Close icon
Chain of Thought - AI in 2025: Agents & The Rise of Evaluation Driven Development

AI in 2025: Agents & The Rise of Evaluation Driven Development

01/15/25 • 33 min

Chain of Thought

"In the next three to five years, every piece of software that is built on this planet will have some sort of AI baked into it." - Atin Sanyal

Chain of Thought is back for its second season, and this episode dives headfirst into the possibilities AI holds for 2025 and beyond. Join Conor Bronson as he chats with Galileo co-founders Yash Sheth (COO) and Atindriyo Sanyal (CTO) about major trends to look for this year. These include AI finding its product "tool stack" fit, generation latency decreasing, AI agents, their potential to revolutionize code generation and other industries, and the crucial role of robust evaluation tools in ensuring the responsible and effective deployment of these agents.

Yash and Atin also highlight Galileo's focus on building trust and security in AI applications through scalable evaluation intelligence. They emphasize the importance of quantifying application behavior, enforcing metrics in production, and adapting to the evolving needs of AI development.

Finally, they discuss Galileo's vision for the future and their active pursuit of partnerships in 2025 to contribute to a more reliable and trustworthy AI ecosystem.

Chapters: 00:00 AI Trends and Predictions for 2025

02:55 Advancements in LLMs and Code Generation

05:16 Challenges and Opportunities in AI Development

10:40 Evaluating AI Agents and Applications

16:07 Building Evaluation Intelligence

23:41 Research Opportunities

29:50 Advice for Leveraging AI in 2025

32:00 Closing Remarks

Show Notes:

plus icon
bookmark

"In the next three to five years, every piece of software that is built on this planet will have some sort of AI baked into it." - Atin Sanyal

Chain of Thought is back for its second season, and this episode dives headfirst into the possibilities AI holds for 2025 and beyond. Join Conor Bronson as he chats with Galileo co-founders Yash Sheth (COO) and Atindriyo Sanyal (CTO) about major trends to look for this year. These include AI finding its product "tool stack" fit, generation latency decreasing, AI agents, their potential to revolutionize code generation and other industries, and the crucial role of robust evaluation tools in ensuring the responsible and effective deployment of these agents.

Yash and Atin also highlight Galileo's focus on building trust and security in AI applications through scalable evaluation intelligence. They emphasize the importance of quantifying application behavior, enforcing metrics in production, and adapting to the evolving needs of AI development.

Finally, they discuss Galileo's vision for the future and their active pursuit of partnerships in 2025 to contribute to a more reliable and trustworthy AI ecosystem.

Chapters: 00:00 AI Trends and Predictions for 2025

02:55 Advancements in LLMs and Code Generation

05:16 Challenges and Opportunities in AI Development

10:40 Evaluating AI Agents and Applications

16:07 Building Evaluation Intelligence

23:41 Research Opportunities

29:50 Advice for Leveraging AI in 2025

32:00 Closing Remarks

Show Notes:

Previous Episode

undefined - Now is the Time to Build | Weaviate’s Bob van Luijt

Now is the Time to Build | Weaviate’s Bob van Luijt

"This is the time. This is the time to start building... I can't say that often enough. This is the time." - Bob van Luijt

Join Bob van Luijt, CEO and co-founder of Weaviate as he sits down with our host Conor Bronson for the Season 2 premiere of Chain of Thought. Together, they explore the ever-evolving world of AI infrastructure and the evolution of Retrieval-Augmented Generation (RAG) architecture.

Bob's journey with Weaviate offers a compelling example of how to adapt to rapid changes in the AI landscape. He discusses the importance of understanding developer needs and building AI-native solutions, emphasizing the potential of generative feedback loops and agent architectures to revolutionize data management.

Chapters: 00:00 Welcome to Season 2

1:43 The Evolution of AI Infrastructure

04:13 Navigating Rapid Changes in AI

07:39 Generative Feedback Loops and AI Native Databases

13:26 Challenges and Opportunities in AI Production

19:03 The Importance of Documentation and Developer Experience

27:13 Future Predictions and Paradigm Shifts in AI

31:17 Final Thoughts and Encouragement to Build

Follow:

Conor Bronsdon: ⁠https://www.linkedin.com/in/conorbronsdon/⁠

Bob van Luijt: ⁠https://www.linkedin.com/in/bobvanluijt/

Weaviate: https://www.linkedin.com/company/weaviate-io/

Show notes: Learn more about Weaviate: https://weaviate.io/

Next Episode

undefined - AI, Open Source & Developer Safety | Block’s Rizel Scarlett

AI, Open Source & Developer Safety | Block’s Rizel Scarlett

As DeepSeek so aptly demonstrated, AI doesn’t need to be closed source to be successful.

This week, Rizel Scarlett, a Staff Developer Advocate at Block, joins Conor Bronsdon to discuss the intersections between AI, open source, and developer advocacy. Rizel shares her journey into the world of AI, her passion for empowering developers, and her work on Block's new AI initiative, Goose, an on-machine developer agent designed to automate engineering tasks and enhance productivity.

Conor and Rizel also explore how AI can enable psychological safety, especially for junior developers. Building on this theme of community, they also dive into topics such as responsible AI development, ethical considerations in AI, and the impact of community involvement when building open source developer tools.

Chapters: 00:00 Rizel's Role at Block 02:41 Introducing Goose: Block's AI Agent 06:30 Psychological Safety and AI for Developers 11:24 AI Tools and Team Dynamics 17:28 Open Source AI and Community Involvement 25:29 Future of AI in Developer Communities 27:47 Responsible and Ethical Use of AI 31:34 Conclusion Follow Conor Bronsdon: https://www.linkedin.com/in/conorbronsdon/

Rizel Scarlett

LinkedIn: https://www.linkedin.com/in/rizel-bobb-semple/ Website: https://blackgirlbytes.dev/

Show Notes

Learn more about Goose: https://block.github.io/goose/

Episode Comments

Generate a badge

Get a badge for your website that links back to this episode

Select type & size
Open dropdown icon
share badge image

<a href="https://goodpods.com/podcasts/chain-of-thought-601485/ai-in-2025-agents-and-the-rise-of-evaluation-driven-development-81954148"> <img src="https://storage.googleapis.com/goodpods-images-bucket/badges/generic-badge-1.svg" alt="listen to ai in 2025: agents & the rise of evaluation driven development on goodpods" style="width: 225px" /> </a>

Copy