Lorenzo's pile of garbage

This is a pile of garbage. Your job is to find the gold.

Series

7 Parts

Agentic System Design

Engineering LLM agents from problem framing to production monitoring.

Coming soon

5 Parts

How I Learned RL for LLMs

From REINFORCE to GRPO to agents — an eval researcher's map through RL.

Coming soon

7 posts

Latest

The Chameleon's Limit: Why LLM Persona Populations Collapse

A summary of our preprint. We measure persona collapse across 10 LLMs and 1,144 personas, and show that better per-persona fidelity often makes population diversity worse.

Apr 26, 2026 10 min read

#academic #AI #evaluation blog
AI Welfare Is Bullshit: Why Co-Engineered Metrics Cannot Govern Machine Suffering

A summary of our position paper. We argue AI welfare assessment fails for two structural reasons: indicators are co-engineered with the systems they evaluate, and there is no external validation channel that can falsify them.

Apr 19, 2026 9 min read

#academic #AI #ethics blog
A Systems Engineering Approach to LLM Agents · Part 1

A Systems Engineering Approach to LLM Agents — Requirements and Problem Framing

The first design decision is not how smart the system should be, but where uncertainty, agency, and accountability sit. A framework for making that choice deliberately.

Mar 26, 2026 18 min read

#academic #AI #engineering blog
How I Learned RL for LLMs

RL for LLMs: The Reading List

96 papers across algorithms, rewards, preferences, systems, and agents — organized into 5 categories with reading depth recommendations.

Mar 22, 2026 5 min read

#academic #AI #reinforcement-learning blog
How I Learned RL for LLMs · Part 0

How I Learned RL for LLMs: A Researcher's Detour in Five Parts

An NLP evaluation researcher's honest map through the algorithmic, reward, and systems landscape of reinforcement learning for language models.

Mar 22, 2026 10 min read

#academic #AI #reinforcement-learning blog

Lorenzo's pile of garbage

Agentic System Design

How I Learned RL for LLMs

The Chameleon's Limit: Why LLM Persona Populations Collapse

AI Welfare Is Bullshit: Why Co-Engineered Metrics Cannot Govern Machine Suffering

A Systems Engineering Approach to LLM Agents — Requirements and Problem Framing

RL for LLMs: The Reading List

How I Learned RL for LLMs: A Researcher's Detour in Five Parts