
#116: AI Agents, MCP and the problems with AI benchmarks | ft. Matt Carey
Failed to add items
Sorry, we are unable to add the item because your shopping cart is already at capacity.
Add to Cart failed.
Please try again later
Add to Wish List failed.
Please try again later
Remove from wishlist failed.
Please try again later
Adding to library failed
Please try again
Follow podcast failed
Please try again
Unfollow podcast failed
Please try again
-
Narrated by:
-
By:
About this listen
In this episode, I spoke with Matt Carey, founding AI engineer at StackOne, founder of AI Demo Days and member of the OpenUK AI Advisory Board.
Everyone needs a friend who works in AI to help them filter the AI news and get the signals from the noise. Matt is that friend for me!
We discussed AI agents, MCP, and the challenges of AI benchmarks, which help explain the disconnect between the benchmark results and the anecdotal experiences of AI users, such as myself.
Links from the episode:
- Google's whitepaper on AI agents
- Anthropic Building Effective AI Agents
- Simon Willison on X
- Thorsten Ball's Joy & Curiosity newsletter
- AI Demo Days
- MCP has a prompt injection problem
Opening theme song:
Cheery Monday by Kevin MacLeod
Link: https://incompetech.filmmusic.io/song/3495-cheery-monday
License: http://creativecommons.org/licenses/by/4.0
adbl_web_global_use_to_activate_webcro805_stickypopup
No reviews yet