Best AI papers explained

Un pódcast de Enoch H. Kang

550 Episodo

From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models
Publicado: 23/5/2025
LLM In-Context Learning as Kernel Regression
Publicado: 23/5/2025
Personalizing LLMs via Decode-Time Human Preference Optimization
Publicado: 23/5/2025
Almost Surely Safe LLM Inference-Time Alignment
Publicado: 23/5/2025
Survey of In-Context Learning Interpretation and Analysis
Publicado: 23/5/2025
From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models
Publicado: 23/5/2025
LLM In-Context Learning as Kernel Regression
Publicado: 23/5/2025
Where does In-context Learning Happen in Large Language Models?
Publicado: 23/5/2025
Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting
Publicado: 22/5/2025
metaTextGrad: Learning to learn with language models as optimizers
Publicado: 22/5/2025
Semantic Operators: A Declarative Model for Rich, AI-based Data Processing
Publicado: 22/5/2025
Isolated Causal Effects of Language
Publicado: 22/5/2025
Sleep-time Compute: Beyond Inference Scaling at Test-time
Publicado: 22/5/2025
J1: Incentivizing Thinking in LLM-as-a-Judge
Publicado: 22/5/2025
ShiQ: Bringing back Bellman to LLMs
Publicado: 22/5/2025
Policy Learning with a Natural Language Action Space: A Causal Approach
Publicado: 22/5/2025
Multi-Objective Preference Optimization: Improving Human Alignment of Generative Models
Publicado: 22/5/2025
End-to-End Learning for Stochastic Optimization: A Bayesian Perspective
Publicado: 21/5/2025
TEXTGRAD: Automatic Differentiation via Text
Publicado: 21/5/2025
Steering off Course: Reliability Challenges in Steering Language Models
Publicado: 20/5/2025

17 / 28

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

Visit the podcast's native language site

550 Episodo

From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models

LLM In-Context Learning as Kernel Regression

Personalizing LLMs via Decode-Time Human Preference Optimization

Almost Surely Safe LLM Inference-Time Alignment

Survey of In-Context Learning Interpretation and Analysis

From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models

LLM In-Context Learning as Kernel Regression

Where does In-context Learning Happen in Large Language Models?

Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting

metaTextGrad: Learning to learn with language models as optimizers

Semantic Operators: A Declarative Model for Rich, AI-based Data Processing

Isolated Causal Effects of Language

Sleep-time Compute: Beyond Inference Scaling at Test-time

J1: Incentivizing Thinking in LLM-as-a-Judge

ShiQ: Bringing back Bellman to LLMs

Policy Learning with a Natural Language Action Space: A Causal Approach

Multi-Objective Preference Optimization: Improving Human Alignment of Generative Models

End-to-End Learning for Stochastic Optimization: A Bayesian Perspective

TEXTGRAD: Automatic Differentiation via Text

Steering off Course: Reliability Challenges in Steering Language Models