How I'm using coding agents in September, 2025

Resumen Principal

El artículo detalla una metodología evolucionada y altamente estructurada para el desarrollo de software asistido por inteligencia artificial, utilizando principalmente Claude Code. Esta aproximación subraya la importancia de un flujo de trabajo disciplinado que integra el aislamiento de proyectos mediante git worktree, un proceso iterativo de diseño y planificación con la IA, y una fase de implementación y revisión de código innovadora. El autor ha refinado sus interacciones con Claude para evitar "muros de texto" y asegurar un compromiso humano continuo, dividiendo las tareas de la IA en roles específicos: un "architect" para el diseño y la revisión, y un "implementer" para la ejecución. Un aspecto central es la gestión activa del contexto de la IA para mantener su enfoque y una crítica evaluación de las sugerencias de revisión de código automatizadas, lo que convierte a la IA en una herramienta de colaboración más fiable y menos propensa a errores ciegos.

Elementos Clave

Metodología de "Worktree" y Sesiones de Claude: El proceso comienza con el aislamiento de tareas en git worktree para manejar múltiples proyectos en una única base de código. La herramienta principal es Claude Code, que se utiliza en sesiones separadas y con gestión explícita del contexto para optimizar su rendimiento.
Proceso de Diseño y Planificación Iterativo: Se utilizan prompts específicos de "brainstorming" para desarrollar ideas en diseños y especificaciones detalladas, pidiendo a Claude preguntas una a una y limitando sus respuestas a 200-300 palabras para asegurar la lectura y el compromiso. Un segundo prompt de "planning" genera planes de implementación exhaustivos, diseñados para un "ingeniero sin contexto" y enfatizando prácticas como DRY, YAGNI y TDD.
Roles Duales de IA ("Architect" y "Implementer"): La fase de ejecución implica el uso de dos sesiones de Claude concurrentes. Una actúa como "architect", responsable de revisar y validar el diseño y el plan, mientras que la otra funciona como "implementer", ejecutando tareas en bloques pequeños y documentando el progreso. Este enfoque fomenta una interacción humano-IA de tipo PM, copiando y pegando revisiones y preguntas entre sesiones.
Revisión de Código Asistida por IA y Meta-Evaluación: Para la revisión de código, se integra CodeRabbit, cuyas sugerencias son procesadas por una herramienta personalizada (coderabbit-review-helper) para consolidarlas en un formato legible por la IA. Crucialmente, se emplea un prompt de "rol-play" que instruye a Claude a evaluar críticamente al revisor externo (CodeRabbit), sopesando la validez de las sugerencias antes de aplicarlas, evitando así cambios ciegos.

Análisis e Implicaciones

Esta metodología representa un avance significativo en la orquestación de la IA en el ciclo de vida del desarrollo de software, promoviendo un flujo de trabajo altamente estructurado e iterativo. Su enfoque en la gestión de contexto y la evaluación crítica de la retroalimentación de la IA tiene el potencial de redefinir las mejores prácticas en la codificación asistida por inteligencia artificial, aumentando la calidad y la eficiencia.

Contexto Adicional

El autor, quien escribe en octubre de 2025, presenta esta metodología como una evolución personal reciente, destacando la importancia de la adaptación continua en el aprovechamiento de las capacidades de las herramientas de IA para la programación.

[Eagle-eyed readers will note that, as I write this, it's October 2025. This post documents what I was doing up to a couple weeks ago. It's still good and I still recommend it.]

Since I last wrote at the beginning of the summer, my methodology for using AI coding assistants has evolved a bit. This is a point-in-time writeup of a flow that's been pretty effective for me.

I'm still primarily using Claude Code.

First up, this is my CLAUDE.md as of this writing. It encodes a bunch of process documentation and rules that do a pretty good job keeping Claude on track.

When I want to start a new task on an existing project, I try to always use a git worktree to isolate that work from other tasks. This is increasingly important for me, because I find myself frequently running 3-4 parallel projects on a single codebase.

To set up a worktree:

cd the-project
mkdir .worktrees # the first time
cd .worktrees
git worktree add some-feature-description
cd some feature-description
npm install # or whatever the setup task for the project is
npm lint
npm test # to make sure I'm starting from a clean baseline
claude

Once I've got claude code running, I use my "brainstorming" prompt:

I've got an idea I want to talk through with you. I'd like you to help me turn it into a fully formed design and spec (and eventually an implementation plan)
Check out the current state of the project in our working directory to understand where we're starting off, then ask me questions, one at a time, to help refine the idea. 
Ideally, the questions would be multiple choice, but open-ended questions are OK, too. Don't forget: only one question per message.
Once you believe you understand what we're doing, stop and describe the design to me, in sections of maybe 200-300 words at a time, asking after each section whether it looks right so far.

That last bit is particuarly critical. I find that AI models are expecially prone to handing me walls of text when they think they're "done". And I'm prone to just tuning out a bit and thinknig "it's probably fine" when confronted with a wall of text written by an agent. By telling Claude to limit its output to a couple hundred words at a time, I'm more likely to actually read and engage.

Once we've walked through the brainstorming process, I usually have a much clearer idea of what I'm doing, as does Claude. Claude will write the design out into docs/plans/ somewhere.

It often wants to leap right into an implementation, but that's not how I want it to work. Sometimes it tries to start writing code before I can stop it. If it does, I hit escape a couple times and rewind the conversation a bit to catch it. Recent updates to my CLAUDE.md reduce that tendency significantly.

The next step is the planning process. Here's the planning prompt I've been using:

Great. I need your help to write out a comprehensive  implementation plan.

Assume that the engineer has zero context for our codebase and questionable taste. document everything they need to know. which files to touch for each task, code, testing, docs they might need to check. how to test it.give them the whole plan as bite-sized tasks. DRY. YAGNI. TDD. frequent commits.                                                                                                                                                                               

Assume they are a skilled developer, but know almost nothing about our toolset or problem domain. assume they don't know good test design very well.  

please write out this plan, in full detail, into docs/plans/

This results in a plan that breaks everything down into tiny little steps with clear instructions and tightly packed context for each step. That means that at execution time, I usually don't need to provide tight step by step oversight.

Next up, I open a new tab or window in the same working directory and fire up another copy of claude. I tell it something like Please read docs/plans/this-task-plan.md and <whatever we named the design doc>. Let me know if you have questions.

It will usually say that the plan is very well crafted. Sometimes it'll point out mistakes or inconsistencies. Putting on my PM hat, I'll then turn around and ask the "architect" session to clarify or update the planning doc.

Once we've sorted out issues with the plan, I'll tell the "implementer" Claude to Please execute the first 3-4 tasks. If you have questions, please stop and ask me. DO NOT DEVIATE FROM THE PLAN.

The implementer will chug along.

When it's done, I'll flip back to the "architect" session and tell it The implementer says it's done tasks 1-3. Please check the work carefully.

I'll play PM again, copying and pasting reviews and Q&A between the two sessions. Once the architect signs off, I'll tell the implementer to update the planning doc with its current state.

And then, I don't */compact*. Instead I */clear* the implementer and start the conversation over. Telling it that it's starting with task 4.

When it's done with the next chunk of work, I flip back to the architect. I typically double-ESC to reset the architect to a previous checkpoint and tell it to review up to the now-current checkpoint. This reduces context bloat for the architect and gets it to look at again without any biases from the previous implementation.

(I have friends who, instead of using multiple sessions, swear that just asking the implementer to look at their most recent work with fresh eyes is good enough. And indeed, using that magic phrase seems to be pretty powerful. I still think that having two different actors is better.)

When the implementer is finally done with the work and the architect has signed off on the work, I ask the implementer to push up to GitHub and create a pull request.

That kicks off a CodeRabbit code review. I generally find that CodeRabbit's reviews are very good at catching nits and logic issues, but sometimes fall short on understanding the project's real design intent or constraints. That leads to CodeRabbit making bad suggestions.

CodeRabbit's reviews provide prompts for AI agents to fix issues, but actually getting all those prompts back to your coding agent can be a pain, because you need to copy them one by one and they only provide prompts for some types of issues. To help solve this, I built coderabbit-review-helper. It digs through all the different types of CodeRabbit review comments and formats them as a big wall of text for your coding agent to chew through.

The only problem with tools like this is that our robot buddies are quite credulous. If you paste in a list of instructions for how to update a codebase, Claude's just going to take you at your word and make the changes, even if what you're asking for is crazy and wrong.

My best current technique for avoiding this is a bit of role-play that gives the coding agent a reason not to blindly trust the code review. Every review gets prefixed with this chunk of text:

A reviewer did some analysis of this PR. They're external, so reading the codebase cold. This is their analysis of the changes and I'd like you to evaluate the analysis and the reviewer carefully.

1) should we hire this reviewer
2) which of the issues they've flagged should be fixed?
3) are the fixes they propose the correct ones?

Anything we *should* fix, put on your todo list.
Anything we should skip, tell me about now.

CodeRabbit "reviewers" typically get a 'Strong hire' review, but it's not unheard of for Claude to report that the reviewer "seems quite technically adept, but didn't take the time to understand our project and made a number of suggestions that are wrong. No hire."

If you decide to try out this methodology or have come up with something else that works even better for you, please drop me a line at jesse@fsck.com.

Absortio

How I'm using coding agents in September, 2025

Extracto

Resumen

Resumen Principal

Elementos Clave

Análisis e Implicaciones

Contexto Adicional

Contenido