21 May 2026
Planet Python
The Python Coding Stack: How I Learn (2026 Version) • My Tutor Agent
I know how I like to learn new things. Over the years, I figured out what works for me and what doesn't. If you read my articles or attend my courses, then you know how I like to learn since I teach in the same way.
The challenge when learning something new is finding resources that are just right for me. And that's not easy. I know I can learn things better and quicker with resources that fit my style well, but you can't always find these resources.
I recently got particularly annoyed learning about the biomechanics of sprinting - I do have non-Python interests, yes - because all three textbooks I read, and lots of the online writing in this field, are just, let's say, not great.
But I now found the solution.
After many decades of learning in the same way, I have now upgraded how I learn thanks to my new tutor, Priya.
Yes, I gave her a name. No, she's not a real person. Priya is my personalised tutor agent. I'll tell you all about her below.
And you'll experience her teaching, too (not on the Python articles, though, I'll keep writing those the old-fashioned way.) I'll tell you more about this below, too, but let me first tell you why this works for me.
My Tutor, My Style
I've been thinking about the way I learn and teach for many years, from way back when I was a young University lecturer faced with 120 students in a lecture hall. I wasn't that much older than the students, but I learnt fast. And they liked my teaching (I even have awards to prove it!)
More recently, I've been writing a lot. I wrote articles here on The Python Coding Stack and elsewhere. I wrote a Python textbook. I even wrote about learning and technical writing in Breaking the Rules: the substack and the book.
All this meant that I could ask my freshly-spawned agent to spend a bit of time reading what I wrote to understand how I teach, which is how I like to learn. Priya analysed the techniques I use in my writing and understood my motivations for doing what I do through my technical writing texts.
Then, Priya and I had a good chat to refine ideas, to make sure she captured the essence of "my style".
And since Priya is an AI agent, "my style" became her knowledge base. This knowledge now lives in several lengthy markdown files and is summarised in shorter context packs and an index to ensure Priya's short-term memory (the context window) isn't overwhelmed.
Then I was ready to go. Any topic I wanted to learn, large or small, I could ask Priya to research it thoroughly, creating a new set of knowledge files, this time specific to the topic she needed to teach rather than my learning style. And then, she's ready to teach me.
And it worked. The stuff she prepared was exactly the way I like it.
The Tutor-Student Conversation Course
And here's the format I settled on (for now). Once the agent completes her research about the topic I want to learn, I ask her to plan a course spanning several modules.
But here's the refinement loop that makes the real difference:
-
I ask Priya to draft the first module. She writes this in a markdown file.
-
I read through her draft and leave comments and questions directly within the text.
-
Priya reads my questions and revises the text to address my questions. (But read on to find out more about the two categories of comments/questions I leave for her.)
-
Repeat steps 2 and 3 until I feel I understand the topic.
-
Move on to the next module and repeat steps 1 to 4.
This is a human-in-the-loop approach to creating the learning material. Yes, Priya is trained in my way of learning and teaching and in my writing style. But I'm actively having a conversation with her within the text.
This is equivalent to raising your hand in a lesson and asking the teacher a question. A good teacher will then revise how they present the material to address your question.
Priya's learning materials are just like that. In fact, I will take credit for her output. Sure, I'm not an expert in the subject matter she's teaching me - that's the whole point, right? But the output reflects my views and ideas about teaching and includes my questions and queries as I tried to understand and master the topic.
This is a collaboration. Priya and I are co-authors, even though Priya did most of the "writing".
I tried this approach on several topics, but there are two I want to share with you. I'm setting up two new sections here on The Python Coding Stack, which I'll use to learn these two topics in public. I'll publish the "transcripts" of the conversations Priya and I are having. It's mostly Priya doing the talking, but my questions are there, too.
The first topic I'm learning in public with Priya's help is Agentic AI. It's very meta to use agentic AI to learn about agentic AI! I'll publish an introduction and the first module in the coming days in the new section here on The Python Coding Stack called Agents Unpacked. You can already see this section in the menu on the homepage.
I'll set up another section to deal with the second topic in a week or so. No spoilers for now except to say it's directly related to programming but it's distinct from the articles I publish in the main section on The Python Coding Stack and in The Club.
By the way, you'll be able to select which sections you want to receive regularly by email. So if you're interested in my Python core content but not in these other topics, you can pick and choose what to opt out of. You can always go to The Python Coding Stack to read the other sections, of course.
How Priya and I Create These "Courses"
But let me expand on how Priya - my tutor agent - and I created these courses. [Incidentally, those are my em-dashes - I use them often and have always done. Commas would be ambiguous in that context!]
I provide two types of questions or comments to my agent as I read through the drafts: private and public.
Private Questions and Comments
When Priya reads the private questions or comments, she makes changes to the text, but then she deletes my input. So, you won't see my intervention explicitly in these cases. However, Priya's text reflects my thoughts. My interventions guide Priya. This type of intervention is similar to an editor's role, but I'm intervening as a learner more than as an editor.
Public Questions and Comments
However, when Priya comes across a comment or question I mark as public, she leaves it in the text, acknowledges the question, and answers it directly. So, you'll see my public questions in the text. Priya and I decided not to include too many of these public questions to keep the text flowing. However, I think it's beneficial to see some of my interventions. My questions may also be your questions.
More Learning. More Articles. More Fun
As with everything to do with AI, this is all very new. It's a work in progress. I may refine and revise how I interact with my agent. But it's been fun learning this way, and I hope you enjoy reading my interactions with Priya and you find it useful, too.
To state the obvious, the posts I'll publish in these two new sections are mostly AI-generated content. If you read this far, then you won't be surprised by that statement. A year ago, I would never have thought I'd publish anything written by AI. But a year is a long time in the AI world. And this AI content reflects me and my thinking. The agent is my mentee - someone I trained to teach the way I do, to write the way I do. But she's also my tutor, teaching me new stuff.
So there's a lot of "me" in what you read, even if it's mostly written by Priya!
The posts in the main section of The Python Coding Place and those in The Club (for premium subscribers) won't change. They're still my writing from beginning to end. Every word and letter you read in those posts is the result of nerve signals going from my brain to my fingers, which tap keys on a keyboard. In this era of AI doing a lot of work for us, I think it's more important than ever for me to keep using my pre-AI skills. Otherwise, my brain will atrophy, and I don't want that!
So, in summary, there will soon be four sections here on The Stack:
-
The main area in The Python Coding Stack - no change here, you'll get the same type of Python articles you've been reading for the past 3+ years
-
The Club - the extra Python posts for premium subscribers
-
Agents Unpacked - the Agentic AI course Priya and I are creating for me to learn all about this agentic stuff. Learn with me (and Priya) if you're interested.
-
Mystery Fourth Section - Stay tuned, you won't have to wait long. This is also a Priya-Stephen collaboration.
Next post will be the introduction and first section in Agents Unpacked. Soon after, I have another Python post I'm planning for you.
21 May 2026 9:23pm GMT
Kevin Renskers: uv is fantastic, but its package management UX is a mess
Astral's uv has taken the Python world by storm, and for good reason. It is blisteringly fast, handles Python versions with ease, and replaces a half-dozen tools with a single binary. I've written multiple articles about it before.
Getting started with a new Python project using uv and adding your first dependencies is very easy. But once you move past the initial setup and into the maintenance phase of a project, i.e. checking for outdated packages and performing routine upgrades, the CLI starts to feel surprisingly clunky compared to its peers like pnpm or Poetry.
Finding outdated packages
In my JavaScript projects, if I want to see what needs an update, I run:
$ pnpm outdated
This gives a clean, concise list of outdated packages, their current version, the latest version, and the version allowed by your constraints.
In uv, there is no uv outdated. Instead, you have to memorize the following mouthful:
$ uv tree --outdated --depth 1
The output is also a problem. It doesn't just show you what is outdated; it shows you your entire top-level dependency tree, with a small annotation next to the ones that have updates available. If you have 50 dependencies and only two are outdated, you still have to scan a 50-line list.
Poetry isn't much better with its command poetry show --outdated, but at least it only shows actual outdated packages.
Unsafe version constraints by default
This is the most significant philosophical departure uv takes from pnpm and Poetry, and it's a dangerous one for production stability.
How pnpm/Poetry handle it
When you add a package using pnpm add, it writes it to package.json using the caret requirement (^1.23.4). The caret at the beginning means that any 1.x.x version is allowed, but it will not update to 2.0.0.
Poetry does the same by default, using a format like >=1.23.4,<2.0.0. I find this less readable than ^1.23.4, but the effect is the same.
In both cases, updates are safe by default. You can run pnpm update or poetry update every morning and have high confidence that your build won't break due to a major API change (assuming the packages you depend on respect SemVer).
How uv handles it
When you run uv add pydantic, it inserts this into your pyproject.toml:
dependencies = [ "pydantic>=2.13.4", ]
Note the lack of an upper bound. In the eyes of uv, pydantic version 2, 3, and 100 are all perfectly acceptable.
This means uv updates are unsafe by default. If you run a bulk update, you aren't just getting bug fixes; you are opting into every breaking change published by every maintainer in your dependency graph.
The bad UX of the upgrade command
The commands to actually perform an update in uv feel like they were designed for machines rather than humans.
If you want to update everything in pnpm or Poetry, it's a simple pnpm update or poetry update command. In uv, you use:
$ uv lock --upgrade
THOUGHTS
Why isn't this simply uv update or uv upgrade? Who designed this command line interface? It's not uv lock --add or uv lock --remove either!
Because of the "no upper bounds" issue mentioned above, uv lock --upgrade is a nuclear option. It will upgrade every single package in your lockfile to their absolute latest versions, ignoring SemVer safety. And this includes deep, nested dependencies you've never heard of! Good luck, better hope there are no breaking changes anywhere.
Once you realize this is too risky, you'll want to upgrade only specific packages. After scouring the subpar output of uv tree --outdated --depth 1 to find them, the syntax becomes a repetitive chore.
How pnpm does it:
$ pnpm update pydantic httpx uvicorn
How uv does it:
$ uv lock --upgrade-package pydantic --upgrade-package httpx --upgrade-package uvicorn
Having to repeat the --upgrade-package flag for every single item is a huge hassle when you want to update a bunch of packages. I don't understand why the UX of uv's commands is so poor.
There is hope: the bounds flag
Luckily uv has recently introduced a --bounds option for uv add:
$ uv add pydantic --bounds major
This produces the safer pydantic>=2.13.4,<3.0.0 constraint we've come to expect. However, this is currently an opt-in feature. You have to remember to type it every time, and as of now, it is considered a preview feature.
Until --bounds major (or a similar configuration) becomes the default behavior, uv users are essentially forced to choose between two bad options:
- Manually edit
pyproject.tomlto add upper bounds for every single dependency. - Live in fear that
uv lock --upgradewill accidentally pull in a breaking major version change.
What I'd like to see
I love uv. Its speed is transformative, and the way it manages Python toolchains is second to none. But as a package manager, the developer experience for maintaining a project is currently a step backward from the tools that came before it.
We need a dedicated uv outdated command that filters noise, a more ergonomic update command that doesn't require repeating flags, and default version constraints that respect the sanity of Semantic Versioning.
Until then, I'll be double-checking every single line of my lockfile changes with a healthy dose of suspicion.
21 May 2026 6:08pm GMT
Real Python: Quiz: Context Managers and Using Python's with Statement
In this quiz, you'll test what you learned in the video course Context Managers and Using Python's with Statement.
By working through this quiz, you'll revisit how the with statement runs setup and teardown for you, how to use standard-library context managers like open(), and how to write your own context managers as classes or with the @contextmanager decorator.
[ Improve Your Python With 🐍 Python Tricks 💌 - Get a short & sweet Python Trick delivered to your inbox every couple of days. >> Click here to learn more and see examples ]
21 May 2026 12:00pm GMT
Django community aggregator: Community blog posts
Utrecht (NL) Python meetup summaries
I made summaries at the 4th PyUtrecht meetup (in Nieuwegein, at Qstars this time).
Qstars IT and open source - Derk Weijers
Qstars IT hosted the meeting. It is an infra/programming/consultancy/training company that uses lots of Python.
They also love open source and try to sponsor where possible.
One of the things they are going to open source (next week) is a "cable thermal model", a calculation method to determine the temperature of underground electricity cables. The Netherlands has a lot of net congestion... So if you can have a better grid usage by calculating the real temperature of cables instead of using an estimated temperature, you might be able to increase the load on the cable without hitting the max temperature. Coupled with "measurement tiles" that actually monitor the temperature.
They build it for one of the three big electricity companies in the Netherlands and got permission to open source it so that the other companies can also use it. They hope it will have real impact.
He explained an open source project he started personally: "the space devs". Integrating rocket launch data and providing an API. Now it has five core developers (and got an invitation to the biggest space conference, two years ago!)
Some benefits from writing open source:
- You build your own portfolio.
- You can try new technologies. Always nice to have the skill to learn new things.
- You improve your communication skills (both sending and receiving).
- You can make your own decisions.
- You write in the open.
- Perhaps you help others with your work.
- You could be part of a cummunity.
- It is your code.
How to start?
- Reach out to other communities.
- Read and improve documentation.
- Find good first issues.
- Be proactive.
- Don't be afraid to ask questions (and don't let negative comments discourage you).
When working on open source, make sure you take security serious. People nowadays like to use supply chain attacks via open source software. So use 2FA and look at your deployment procedure.
Learning Python with Karel - EiEi Tun H
What is Karel <https://github.com/alts/karel>)? A teaching tool/robot for learning programming. You need to steer a robot in an area and have it pick up or dump objects. And... in the meantime you learn how to use functions and loops.
Karel only has a turn_left() function. So if you want to have it turn right, it is handy to add a function for it:
def turn_right():
turn_left()
turn_left()
turn_left()
Simple, but you have to learn it sometime!
In her experience, AI can help a lot when learning to code: it explains stuff to you like you're a five-year-old, and that's perfect.
If you want to play with Karel: https://compedu.stanford.edu/karel-reader/docs/python/en/ide.html
JSON freedom or chaos; how to trust your data - Bart Dorlandt
For this talk, I'm pointing at the PyGrunn summary I made three weeks ago. I liked the talk!
Practical software architecture for Python developers - Henk-Jan van Hasselaar
There are several levels of architecture. Organization level. System level. Application, Code.
Cohesion: "the degree to which the elements inside a module belong together". What does it mean? Working towards the same goal or function. Together means something like distance. When two functions are in separate libraries, they're not together. It is also important for cognitive load.
Coupling: loose coupling versus high coupling. You want loose coupling, so that changes in one module don't affect another module.
You don't really have to worry about coupling and cohesion in existing systems that don't need to be changed. But when you start changing or build something new: take coupling/cohesion into account.
Software architecture is a tradeoff. Seperation of concerns is fine, but it creates layers and thus distance, for instance.
Python is one of the most difficult languages when it comes to clean coding and clean architecture. You're allowed to do so many dirty things! Typing isn't even mandatory...
He showed a simple REST API as an example. Database model + view. But when you change the database model, like a field name, that field name automatically changes in the API response. So your internal database structure is coupled to the function at the customer that consumes the API.
What you actually need to do is to have a better "contract". A domain model. In his example code, it was a Pydantic model with a fixed set of fields. A converter modifies the internal database model to the domain model.
You can also have services, generic pieces of code that work on domain models. And adapters to and from domain models, like converting domain models to csv.
Finding the balance is the software architect's job.
What is the least you should do as a software developer? At least to create a domain layer. Including a validator.
There was a question about how to do this with Django: it is hard. Django's models are everywhere. And you really need a clean domain layer...
21 May 2026 4:00am GMT
My PyCon US 2026
A timeline of my PyCon US 2026 journey, in Long Beach (US), told through the Mastodon posts I shared along the way.
21 May 2026 3:00am GMT
20 May 2026
Django community aggregator: Community blog posts
Weeknotes (2026 week 17)
Weeknotes (2026 week 17)
I published the last entry near the beginning of March. I'm really starting to see a theme in my Weeknotes publishing schedule.
Releases since the first weeks of March
I'm trying out a longer-form version of those notes here than in the past. I think it's worth going into some detail and not just listing releases with half a sentence each.
feincms3-sites and feincms3-language-sites
I released updates to feincms3-sites and feincms3-language-sites fixing the same issue in both projects: When an HTTP client didn't strip the default ports :80 (for HTTP) or :443 (for HTTPS) from a request, finding the correct site would fail. Browsers generally strip the port already, but some other HTTP clients do not.
django-tree-queries
As I wrote elsewhere I closed many issues in the repositories, mostly documentation issues but also some bugs. {% recursetree %} should now work properly and not cache old data anymore, using the primary key in .tree_fields() now raises an intelligible error, and I also fixed a bug with table quoting when using django-tree-queries with the not yet released Django 6.1+.
feincms3-cookiecontrol
feincms3-cookiecontrol not only offers a cookie consent banner (which actually supports only embedding tracking scripts when users give consent) but also a third-party content embedding functionality which allows allowlisting individual services.
The privacy policies of these services are now linked inline instead of with an ugly extra link. This reduces content inside the embed which helps on small screens.
Version 1.7 used a buggy trusted publishing workflow so I immediately published 1.7.1.
django-cabinet and django-prose-editor
django-cabinet can now be used as a media library directly inside django-prose-editor. I'm (ab)using the CKEditor 4 protocol for embedding, which uses window.opener.CKEDITOR.callFunction to send data back from the file manager popup into the editor. It feels icky but works nicely. This is only available if you're installing the alpha prereleases, but I'm already testing the functionality in production somewhere, so I feel quite good about it.
django-prose-editor now also ships brand new ClassLoom and StyleLoom extensions. Both extensions allow adding either classes or inline styles to text spans or nodes. In an ideal world we might not use something like this, but to make the editor more useful in the real world, editors need more flexibility. These two extensions provide that. I already mentioned ClassLoom in December, now it's actually available. I'm not completely sold on how they work yet, but both of them are already solving real issues.
Honorable mentions
django-debug-toolbar 6.3 has been released, I only contributed reviews during this cycle.
20 May 2026 5:00pm GMT
04 Apr 2026
Planet Twisted
Donovan Preston: Using osascript with terminal agents on macOS
Here is a useful trick that is unreasonably effective for simple computer use goals using modern terminal agents. On macOS, there has been a terminal osascript command since the original release of Mac OS X. All you have to do is suggest your agent use it and it can perform any application control action available in any AppleScript dictionary for any Mac app. No MCP set up or tools required at all. Agents are much more adapt at using rod terminal commands, especially ones that haven't changed in 30 years. Having a computer control interface that hasn't changed in 30 years and has extensive examples in the Internet corpus makes modern models understand how to use these tools basically Effortlessly. macOS locks down these permissions pretty heavily nowadays though, so you will have to grant the application control permission to terminal. But once you have done that, the range of possibilities for commanding applications using natural language is quite extensive. Also, for both Safari and chrome on Mac, you are going to want to turn on JavaScript over AppleScript permission. This basically allows claude or another agent to debug your web applications live for you as you are using them.In chrome, go to the view menu, developer submenu, and choose "Allow JavaScript from Apple events". In Safari, it's under the safari menu, settings, developer, "Allow JavaScript from Apple events". Then you can do something like "Hey Claude, would you Please use osascript to navigate the front chrome tab to hacker news". Once you suggest using OSA script in a session it will figure out pretty quickly what it can do with it. Of course you can ask it to do casual things like open your mail app or whatever. Then you can figure out what other things will work like please click around my web app or check the JavaScript Console for errors. Another very important tips for using modern agents is to try to practice using speech to text. I think speaking might be something like five times faster than typing. It takes a lot of time to get used to, especially after a lifetime of programming by typing, but it's a very interesting and a different experience and once you have a lot of practice It starts to to feel effortless.
04 Apr 2026 1:31pm GMT
16 Mar 2026
Planet Twisted
Donovan Preston: "Start Drag" and "Drop" to select text with macOS Voice Control
I have been using macOS voice control for about three years. First it was a way to reduce pain from excessive computer use. It has been a real struggle. Decades of computer use habits with typing and the mouse are hard to overcome! Text selection manipulation commands work quite well on macOS native apps like apps written in swift or safari with an accessibly tagged webpage. However, many webpages and electron apps (Visual Studio Code) have serious problems manipulating the selection, not working at all when using "select foo" where foo is a word in the text box to select, or off by one errors when manipulating the cursor position or extending the selection. I only recently expanded my repertoire with the "start drag" and "drop" commands, previously having used "Click and hold mouse", "move cursor to x", and "release mouse". Well, now I have discovered that using "start drag x" and "drop x" makes a fantastic text selection method! This is really going to improve my speed. In the long run, I believe computer voice control in general is going to end up being faster than WIMP, but for now the awkwardly rigid command phrasing and the amount of times it misses commands or misunderstands commands still really holds it back. I've been learning the macOS Voice Control specific command set for years now and I still reach for the keyboard and mouse way too often.
16 Mar 2026 11:04am GMT
04 Mar 2026
Planet Twisted
Glyph Lefkowitz: What Is Code Review For?
Humans Are Bad At Perceiving
Humans are not particularly good at catching bugs. For one thing, we get tired easily. There is some science on this, indicating that humans can't even maintain enough concentration to review more than about 400 lines of code at a time..
We have existing terms of art, in various fields, for the ways in which the human perceptual system fails to register stimuli. Perception fails when humans are distracted, tired, overloaded, or merely improperly engaged.
Each of these has implications for the fundamental limitations of code review as an engineering practice:
-
Inattentional Blindness: you won't be able to reliably find bugs that you're not looking for.
-
Repetition Blindness: you won't be able to reliably find bugs that you are looking for, if they keep occurring.
-
Vigilance Fatigue: you won't be able to reliably find either kind of bugs, if you have to keep being alert to the presence of bugs all the time.
-
and, of course, the distinct but related Alert Fatigue: you won't even be able to reliably evaluate reports of possible bugs, if there are too many false positives.
Never Send A Human To Do A Machine's Job
When you need to catch a category of error in your code reliably, you will need a deterministic tool to evaluate - and, thanks to our old friend "alert fatigue" above - ideally, to also remedy that type of error. These tools will relieve the need for a human to make the same repetitive checks over and over. None of them are perfect, but:
- to catch logical errors, use automated tests.
- to catch formatting errors, use autoformatters.
- to catch common mistakes, use linters.
- to catch common security problems, use a security scanner.
Don't blame reviewers for missing these things.
Code review should not be how you catch bugs.
What Is Code Review For, Then?
Code review is for three things.
First, code review is for catching process failures. If a reviewer has noticed a few bugs of the same type in code review, that's a sign that that type of bug is probably getting through review more often than it's getting caught. Which means it's time to figure out a way to deploy a tool or a test into CI that will reliably prevent that class of error, without requiring reviewers to be vigilant to it any more.
Second - and this is actually its more important purpose - code review is a tool for acculturation. Even if you already have good tools, good processes, and good documentation, new members of the team won't necessarily know about those things. Code review is an opportunity for older members of the team to introduce newer ones to existing tools, patterns, or areas of responsibility. If you're building an observer pattern, you might not realize that the codebase you're working in already has an existing idiom for doing that, so you wouldn't even think to search for it, but someone else who has worked more with the code might know about it and help you avoid repetition.
You will notice that I carefully avoided saying "junior" or "senior" in that paragraph. Sometimes the newer team member is actually more senior. But also, the acculturation goes both ways. This is the third thing that code review is for: disrupting your team's culture and avoiding stagnation. If you have new talent, a fresh perspective can also be an extremely valuable tool for building a healthy culture. If you're new to a team and trying to build something with an observer pattern, and this codebase has no tools for that, but your last job did, and it used one from an open source library, that is a good thing to point out in a review as well. It's an opportunity to spot areas for improvement to culture, as much as it is to spot areas for improvement to process.
Thus, code review should be as hierarchically flat as possible. If the goal of code review were to spot bugs, it would make sense to reserve the ability to review code to only the most senior, detail-oriented, rigorous engineers in the organization. But most teams already know that that's a recipe for brittleness, stagnation and bottlenecks. Thus, even though we know that not everyone on the team will be equally good at spotting bugs, it is very common in most teams to allow anyone past some fairly low minimum seniority bar to do reviews, often as low as "everyone on the team who has finished onboarding".
Oops, Surprise, This Post Is Actually About LLMs Again
Sigh. I'm as disappointed as you are, but there are no two ways about it: LLM code generators are everywhere now, and we need to talk about how to deal with them. Thus, an important corollary of this understanding that code review is a social activity, is that LLMs are not social actors, thus you cannot rely on code review to inspect their output.
My own personal preference would be to eschew their use entirely, but in the spirit of harm reduction, if you're going to use LLMs to generate code, you need to remember the ways in which LLMs are not like human beings.
When you relate to a human colleague, you will expect that:
- you can make decisions about what to focus on based on their level of experience and areas of expertise to know what problems to focus on; from a late-career colleague you might be looking for bad habits held over from legacy programming languages; from an earlier-career colleague you might be focused more on logical test-coverage gaps,
- and, they will learn from repeated interactions so that you can gradually focus less on a specific type of problem once you have seen that they've learned how to address it,
With an LLM, by contrast, while errors can certainly be biased a bit by the prompt from the engineer and pre-prompts that might exist in the repository, the types of errors that the LLM will make are somewhat more uniformly distributed across the experience range.
You will still find supposedly extremely sophisticated LLMs making extremely common mistakes, specifically because they are common, and thus appear frequently in the training data.
The LLM also can't really learn. An intuitive response to this problem is to simply continue adding more and more instructions to its pre-prompt, treating that text file as its "memory", but that just doesn't work, and probably never will. The problem - "context rot" is somewhat fundamental to the nature of the technology.
Thus, code-generators must be treated more adversarially than you would a human code review partner. When you notice it making errors, you always have to add tests to a mechanical, deterministic harness that will evaluates the code, because the LLM cannot meaningfully learn from its mistakes outside a very small context window in the way that a human would, so giving it feedback is unhelpful. Asking it to just generate the code again still requires you to review it all again, and as we have previously learned, you, a human, cannot review more than 400 lines at once.
To Sum Up
Code review is a social process, and you should treat it as such. When you're reviewing code from humans, share knowledge and encouragement as much as you share bugs or unmet technical requirements.
If you must reviewing code from an LLM, strengthen your automated code-quality verification tooling and make sure that its agentic loop will fail on its own when those quality checks fail immediately next time. Do not fall into the trap of appealing to its feelings, knowledge, or experience, because it doesn't have any of those things.
But for both humans and LLMs, do not fall into the trap of thinking that your code review process is catching your bugs. That's not its job.
Acknowledgments
Thank you to my patrons who are supporting my writing on this blog. If you like what you've read here and you'd like to read more of it, or you'd like to support my various open-source endeavors, you can support my work as a sponsor!
04 Mar 2026 5:24am GMT