Best AI papers explained

Un pódcast de Enoch H. Kang

550 Episodo

Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT
Publicado: 2/5/2025
Self-Consuming Generative Models with Curated Data
Publicado: 2/5/2025
Bootstrapping Language Models with DPO Implicit Rewards
Publicado: 2/5/2025
DeepSeek-Prover-V2: Advancing Formal Reasoning
Publicado: 1/5/2025
THINKPRM: Data-Efficient Process Reward Models
Publicado: 1/5/2025
Societal Frameworks and LLM Alignment
Publicado: 29/4/2025
Risks from Multi-Agent Advanced AI
Publicado: 29/4/2025
Causality-Aware Alignment for Large Language Model Debiasing
Publicado: 29/4/2025
Reward Models Evaluate Consistency, Not Causality
Publicado: 28/4/2025
Causal Rewards for Large Language Model Alignment
Publicado: 28/4/2025
Sycophancy to subterfuge: Investigating reward-tampering in large language models
Publicado: 28/4/2025
Bidirectional AI Alignment
Publicado: 28/4/2025
Why Do Multi-Agent LLM Systems Fail?
Publicado: 27/4/2025
LLMs as Greedy Agents: RL Fine-tuning for Decision-Making
Publicado: 27/4/2025
LLM Feedback Loops and the Lock-in Hypothesis
Publicado: 27/4/2025
Representational Alignment Drives Effective Teaching and Learning
Publicado: 27/4/2025
Adaptive Parallel Reasoning with Language Models
Publicado: 27/4/2025
AI: Rewiring the Flow of Ideas and Human Knowledge
Publicado: 27/4/2025
Learning and Equilibrium with Ranking Feedback
Publicado: 27/4/2025
Designing Human-AI Collaboration: A Sufficient-Statistic Approach
Publicado: 27/4/2025

21 / 28

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.