PodXiv: The latest AI papers, decoded in 20 minutes.

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to Cart failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from wishlist failed.

Please try again later

Adding to library failed

Please try again

Follow podcast failed

Please try again

Unfollow podcast failed

Please try again

PodXiv: The latest AI papers, decoded in 20 minutes.

By: AI Podcast

Listen for free

About this listen

This podcast delivers sharp, daily breakdowns of cutting-edge research in AI. Perfect for researchers, engineers, and AI enthusiasts. Each episode cuts through the jargon to unpack key insights, real-world impact, and what’s next. This podcast is purely for learning purposes. We'll never monetize this podcast. It's run by research volunteers like you! Questions? Write me at: airesearchpodcasts@gmail.comAI Podcast

Politics & Government

Episodes View all

(LLM Code-Salesforce) CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models

Jul 5 2025

Welcome to our podcast! Today, we're exploring CodeTree, a groundbreaking framework developed by researchers at The University of Texas at Austin and Salesforce Research. CodeTree revolutionises code generation by enabling Large Language Models (LLMs) to efficiently navigate the vast coding search space through an agent-guided tree search. This innovative approach employs a unified tree structure for explicitly exploring coding strategies, generating solutions, and refining them.
At its core, CodeTree leverages dedicated LLM agents: the Thinker for strategy generation, the Solver for initial code implementation, and the Debugger for solution improvement. Crucially, a Critic Agent dynamically guides the exploration by evaluating nodes, verifying solutions, and making crucial decisions like refining, aborting, or accepting a solution. This multi-agent collaboration, combined with environmental and AI-generated feedback, has led to significant performance gains across diverse coding benchmarks, including HumanEval, MBPP, CodeContests, and SWEBench.
However, CodeTree's effectiveness hinges on LLMs with strong reasoning abilities; smaller models may struggle with its complex instruction-following roles, potentially leading to misleading feedback. The framework currently prioritises functional correctness, leaving aspects like code readability or efficiency for future enhancements. Despite these limitations, CodeTree offers a powerful paradigm for automated code generation, demonstrating remarkable search efficiency, even with limited generation budgets.
Paper link: https://arxiv.org/pdf/2411.04329

Show more Show less

19 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to Cart failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from wishlist failed.

Please try again later

Adding to library failed

Please try again

Follow podcast failed

Please try again

Unfollow podcast failed

Please try again

Listen for free
(FM-NVIDIA) Fugatto: Foundational Generative Audio Transformer Opus 1

Jul 3 2025

Fugatto, a new generalist audio synthesis and transformation model developed by NVIDIA, and ComposableART, an inference-time technique designed to enhance its capabilities. Fugatto distinguishes itself by its ability to follow free-form text instructions, often with optional audio inputs, addressing the challenge that audio data, unlike text, typically lacks inherent instructional information. The document details a comprehensive data and instruction generation strategy that leverages large language models (LLMs) and audio understanding models to create diverse and rich datasets, enabling Fugatto to handle a wide array of tasks including text-to-speech, text-to-audio, and audio transformations. Furthermore, ComposableART allows for compositional abilities, such as combining, interpolating, or negating instructions, providing fine-grained control over audio outputs beyond the training distribution. The text presents experimental evaluations demonstrating Fugatto's competitive performance against specialised models and highlights its emergent capabilities, such as synthesising novel sounds or performing tasks not explicitly trained for.
link: https://d1qx31qr3h6wln.cloudfront.net/publications/FUGATTO.pdf

Show more Show less

18 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to Cart failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from wishlist failed.

Please try again later

Adding to library failed

Please try again

Follow podcast failed

Please try again

Unfollow podcast failed

Please try again

Listen for free
(LLM Application-NVIDIA) Small Language Models: The Future of Agentic AI

Jul 3 2025

The provided text argues that small language models (SLMs) are the future of agentic AI, positioning them as more economical and operationally suitable than large language models (LLMs) for the majority of tasks within AI agents. While LLMs excel at general conversations, agentic systems frequently involve repetitive, specialised tasks where SLMs offer advantages like lower latency, reduced computational requirements, and significant cost savings. The authors propose a shift to heterogeneous systems, where SLMs handle routine functions and LLMs are used sparingly for complex reasoning. The document also addresses common barriers to SLM adoption, such as existing infrastructure investments and popular misconceptions, and outlines a conversion algorithm for migrating agentic applications from LLMs to SLMs.
Link: https://arxiv.org/pdf/2506.02153

Show more Show less

22 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to Cart failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from wishlist failed.

Please try again later

Adding to library failed

Please try again

Follow podcast failed

Please try again

Unfollow podcast failed

Please try again

Listen for free

No reviews yet

Get Started

Popular Lists

Explore Audible

PodXiv: The latest AI papers, decoded in 20 minutes.

Failed to add items

Add to Cart failed.

Add to Wish List failed.

Remove from wishlist failed.

Adding to library failed

Follow podcast failed

Unfollow podcast failed

PodXiv: The latest AI papers, decoded in 20 minutes.

About this listen

(LLM Code-Salesforce) CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models

Failed to add items

Add to Cart failed.

Add to Wish List failed.

Remove from wishlist failed.

Adding to library failed

Follow podcast failed

Unfollow podcast failed

(FM-NVIDIA) Fugatto: Foundational Generative Audio Transformer Opus 1

Failed to add items

Add to Cart failed.

Add to Wish List failed.

Remove from wishlist failed.

Adding to library failed

Follow podcast failed

Unfollow podcast failed

(LLM Application-NVIDIA) Small Language Models: The Future of Agentic AI

Failed to add items

Add to Cart failed.

Add to Wish List failed.

Remove from wishlist failed.

Adding to library failed

Follow podcast failed

Unfollow podcast failed