Author here: To be honest, I know there are like a bajillion Claude code posts o...

peter422 · 2025-06-08T02:28:39 1749349719

Just to provide a contrast to some of the negative comments…

As a very experienced engineer who uses LLMs sporadically* and not in any systematic way, I really appreciated seeing how you use them in production in a real project. I don’t know why people are being negative, you just mentioned your project in details where it was appropriate to talk about the structure of it. Doesn’t strike me as gratuitous self promotion at all.

Your post is giving me motivation to empower the LLMs a little bit more in my workflows.

*: They absolutely don’t get the keys to my projects but I have had great success with having them complete specific tasks.

diwank · 2025-06-08T02:36:04 1749350164

Really appreciate the kind words! I did not intend the post to be too much about our company, just that it is the codebase I mostly hack on. :)

panny · 2025-06-08T20:00:32 1749412832

>Think of this post as your field guide to a new way of building software. By the time you finish reading, you’ll understand not just the how but the why behind AI-assisted development that actually works.

Hi, AI skeptic with an open-mind here. How much will this cost me to try? I don't see that mentioned in your writeup.

diwank · 2025-06-09T04:01:45 1749441705

There are a bunch of different options so it depends on the models you end up liking but the simplest place to start is to get Claude Max $100 tier and use Opus 4 (the others don’t really get you the full experience)

mcv · 2025-06-09T07:19:13 1749453553

I hear a lot of good stuff about Claude Code lately, but before, it was all about Copilot or Cursor. And some options are a lot cheaper than others. Is Claude Code so much better now?

I admit I have no idea what the real differences are. Everybody seems to claim to be the best and most comprehensive AI coding solution.

indigodaddy · 2025-06-13T09:25:12 1749806712

Isn't Claude Code fully functional/available now on the $20 Pro monthly plan?

diwank · 2025-06-17T04:39:31 1750135171

In my experience, you’ll run out of Opus usage credits way too quickly on the Pro plan

noufalibrahim · 2025-06-08T18:43:16 1749408196

There are a lot of posts around but this was very practical and gives me a system i can try to implement and perhaps improve. Much appreciated. Thanks for taking the time to write it.

One thing I would have liked to know is the difference between a workflow like this and the use of aider. If you have any perspective on that, it would be great.

diwank · 2025-06-09T03:59:18 1749441558

Thank you! aider is a different beast actually, I found its memory/context handling best in class. Somehow though, I ended up liking Claude Code the most because of its TUI but really a matter of personal preference and workflow

kikimora · 2025-06-08T10:41:43 1749379303

Thanks for the great article, this is much needed to understand how to properly use LLM at scale.

You mentioned that LLM should never touch tests. Then followed up with an example refactoring changing 500+ endpoints completed in 4 hours. This is impressive! I wonder if these 4 hours included test refactoring as well or it is just prompting time?

diwank · 2025-06-08T16:02:29 1749398549

that didn't include the testing, that def took a lot longer but at least now my devs don't have an excuse for poorly written tests lol

r0b0ji · 2025-06-08T17:27:03 1749403623

At one place you mentioned, if a test is updated by AI, you reject the PR. How do you know if it was generated or updated by AI. From the article I only got that it's a git commit message convention to add that but that too is only at commit level.

diwank · 2025-06-08T18:10:57 1749406257

Mostly just good faith during PR reviews. Plus other than Opus 4 models largely flub it and it shows

mcv · 2025-06-09T07:21:55 1749453715

Really? I was hoping to use AI to write mocks. That's always the part that I hate most about unit tests.

mafro · 2025-06-08T09:28:30 1749374910

Great post. I'm fairly new to the AI pair programming thing (I've been using Aider), but with 20 years of coding behind me I can see where things are going. You're dead right in the conclusion about now being the time to adopt this stuff as part of your flow -- if you haven't already.

And regarding the HN post getting buried for a while there...[1] Somewhat ironic that an article about using AI to help write code would get canned for using an AI to help write it :D

[1]: https://news.ycombinator.com/item?id=44214437

localhost · 2025-06-08T15:12:18 1749395538

Did you use Claude Code to write the post? I'm finding that I'm using it for 100% of my own writing because agentic editing of markdown files is so good (and miles better than what you get with claude.ai artifacts or chatgpt.com canvas). This is how you can do things like merge deep research or other files into the doc that you are writing.

diwank · 2025-06-08T15:47:53 1749397673

no, just used chatgpt to bootstrap the research :)

here's the original chat: https://chatgpt.com/share/6844eaae-07d0-8001-a7f7-e532d63bf8...

I also used bits from claude research but apparently if you use claude research, they don't let you create a share link -_-

localhost · 2025-06-08T15:56:45 1749398205

Right. But you can copy paste that into a separate doc and have Claude Code merge it in (and not a literal merge - a semantic merge "integrate relevant parts of this research into this doc"). This is super powerful - try it!

gavinray · 2025-06-09T12:34:10 1749472450

Does Claude Code perform any different than browser Claude for writing tasks?

I recently wrote a long Markdown document, and asked Claude, ChatGPT, Grok, and Gemini to improve it.

Comparing outputs, it was very close between Gemini and Claude, but I decided that Claude was slightly better-written.

localhost · 2025-06-09T14:06:11 1749477971

The models are the same, but the actual prompts sent to the model are likely somewhat different because of the agentic loop - so I would imagine (without having done the experiments) there will be slight differences. Unclear whether they will be more or less than the variance in responses sent multiple times to the same experience (e.g., Claude.ai variance vs. Claude Code variance vs. variance between Claude.ai and Claude Code). Would be an interesting controlled experiment to try!

meeech · 2025-06-08T01:11:12 1749345072

Q: How do you ensure tests are only written by humans? Basically just the honor system?

diwank · 2025-06-08T01:13:55 1749345235

You can:

1. Add instructions in CLAUDE.md to not touch tests.

2. Disallow the Edit tool for test directories in the project’s .claude/settings.json file

meeech · 2025-06-08T02:37:27 1749350247

Disallow edit in test dirs is a good tip. thanks.

I meant though in the wider context of the team - everyone uses it but not everyone will work the same, use the same underlying prompts as they work. So how do you ensure everyone keeps to that agreement?

mathgeek · 2025-06-08T13:45:04 1749390304

> So how do you ensure everyone keeps to that agreement?

There's nothing specific to using Claude or any other automation tool here. You still use code reviews, linters, etc. to catch anything that isn't following the team norms and expectations. Either that or, as the article points out, someone will cause an incident and may be looking for a new role (or nothing bad happens and no one is the wiser).

davidmurdoch · 2025-06-08T16:16:00 1749399360