Knowledge Hub

AI-generated podcasts covering the latest breakthroughs in AI research.

AI Papers Podcast

Trending research from arXiv, powered by Google NotebookLM

PodcastMay 2, 2026|42:43

AI Papers Weekly: Reality Check for AI Agents

This week, we explore the practical challenges facing AI adoption. From evaluating real-world agent performance to understanding why AI projects get abandoned and enhancing the realism of AI-generated videos, we uncover crucial insights for businesses investing in AI.

Papers Covered

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World WorkflowsTo Build or Not to Build? Factors that Lead to Non-Development or Abandonment of AI SystemsPhyCo: Learning Controllable Physical Priors for Generative Motion
3 papersListen & Read
PodcastMarch 18, 2026|30:10

AI Papers Weekly: Reliability, Bias, and Personalized Harm

This week, we explore critical AI challenges: inconsistent results from coding agents, cultural biases in language models, and the potential for personalized AI to cause harm. We'll discuss the implications for businesses relying on AI for decision-making and how to mitigate these risks.

Papers Covered

Nonstandard Errors in AI AgentsPrompt Programming for Cultural Bias and Alignment of Large Language ModelsDifferential Harm Propensity in Personalized LLM Agents: The Curious Case of Mental Health Disclosure
3 papersListen & Read
PodcastMarch 15, 2026|30:53

AI Papers Weekly: AI Agents - Security, Innovation, and Systemic Risks

This week we dive into AI agent security, explore how LLMs can spark interdisciplinary innovation, and uncover potential risks when deploying multiple intelligent AI agents in resource-constrained environments. Learn how to leverage AI for innovation while mitigating potential security vulnerabilities and systemic risks.

Papers Covered

Security Considerations for Artificial Intelligence AgentsSparking Scientific Creativity via LLM-Driven Interdisciplinary InspirationIncreasing intelligence in AI agents can worsen collective outcomes
3 papersListen & Read
PodcastMarch 10, 2026|57:25

AI Papers Weekly: AI's Evolving Financial & Research Prowess

This week, we explore AI's growing ability to analyze financial data, automate AI research itself, and tackle complex enterprise document reasoning. Learn how these advancements can improve decision-making and efficiency in your organization.

Papers Covered

Evaluating Financial Intelligence in Large Language Models: Benchmarking SuperInvesting AI with LLM EnginesPostTrainBench: Can LLM Agents Automate LLM Post-Training?OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning
3 papersListen & Read
PodcastMarch 10, 2026|25:28

Not Just Decoration

We are running the most important cognitive experiment in human history and narrating it as a labor market disruption. A sermon about connectionism, the nature of mind, and the choice we are making by not making it.

0 papersListen & Read
PodcastFebruary 25, 2026|31:42

AI Papers Weekly: Autonomous Driving, Agent Security, & Software's Future

This week, we delve into AI advancements impacting autonomous driving with data-efficient models, explore the vulnerability of humans to deceptive AI agents, and envision a future where AI is deeply integrated into the software development ecosystem. Learn how these breakthroughs can reshape industries and require businesses to adapt.

Papers Covered

NoRD: A Data-Efficient Vision-Language-Action Model that Drives without Reasoning"Are You Sure?": An Empirical Study of Human Perception Vulnerability in LLM-Driven Agentic SystemsToward an Agentic Infused Software Ecosystem
3 papersListen & Read
PodcastFebruary 22, 2026|30:31

AI Papers Weekly: AGI Economics, AgentOS, & Alignment Under Pressure

This week, we delve into the economic impact of AGI, explore a new AgentOS framework for LLMs, and examine the critical issue of AI alignment under pressure. Gain insights into workforce transformation, AI system architecture, and responsible AI deployment to future-proof your business strategy.

Papers Covered

Some Simple Economics of AGIArchitecting AgentOS: From Token-Level Context to Emergent System-Level IntelligencePressure Reveals Character: Behavioural Alignment Evaluation at Depth
3 papersListen & Read
PodcastFebruary 19, 2026|34:00

AI Papers Weekly: Compliance, Cities & AI Safety

This week, we explore AI-augmented engineering for streamlined compliance, foundation models transforming urban planning, and strategies for safely deploying AI with 'untrusted monitoring.' Learn how these advancements impact your business.

Papers Covered

Agile V: A Compliance-Ready Framework for AI-Augmented Engineering -- From Concept to Audit-Ready DeliveryUrbanFM: Scaling Urban Spatio-Temporal Foundation ModelsWhen can we trust untrusted monitoring? A safety case sketch across collusion strategies
3 papersListen & Read
PodcastFebruary 15, 2026|27:06

AI Papers Weekly: Reality Check on Agentic AI

This week, we explore the gap between AI hype and reality. We uncover hidden limitations of AI agents, the risk of homogenized ideas from LLMs, and the quantified difference between expected and actual AI performance. Essential insights for strategic AI investments.

Papers Covered

Implicit Intelligence -- Evaluating Agents on What Users Don't SayExamining and Addressing Barriers to Diversity in LLM-Generated IdeasQuantifying the Expectation-Realisation Gap for Agentic AI Systems
3 papersListen & Read
PodcastFebruary 12, 2026|31:10

AI Papers Weekly: Trust, Truth & Security in AI

This week we unpack AI's trustworthiness problem: How to build collaborative AI that humans trust, ensure data accuracy amidst manipulation, and secure AI agents against prompt injection. Learn how these challenges impact your AI strategy and bottom line.

Papers Covered

Align When They Want, Complement When They Need! Human-Centered Ensembles for Adaptive Human-AI CollaborationModeling Epidemiological Dynamics Under Adversarial Data and User DeceptionThe LLMbda Calculus: AI Agents, Conversations, and Information Flow
3 papersListen & Read
VideoFebruary 7, 2026|5:00

Agentic AI: A Digital Workforce

An in-depth video brief on how agentic AI is transforming the workplace — from autonomous task execution to multi-agent collaboration. Understand how AI agents are evolving from assistants to digital workers that can plan, reason, and act independently.

0 papersWatch & Read