I use AI for most of those things. And I think it probably saves me a bit of tim...

Filligree · 2025-08-29T13:49:07 1756475347

That study predates Claude Code though.

I’m not surprised by the contents. I had the same feeling; I made some attempts at using LLMs for coding prior to CC, and with rare exceptions it never saved me any time.

CC changed that situation hugely, at least in my subjective view. It’s of course possible that it’s not as good as I feel it is, but I would at least want a new study.

sarchertech · 2025-08-29T15:48:56 1756482536

I don’t believe that CC is so much better than cursor using Claude models that it moves the needle enough to flip the results of that study.

The key thing to look at is that even the participants that did objectively save time, overestimated time saved by a huge amount.

But also you’re always likely to be at least one model ahead of any studies that come out.

jimbokun · 2025-08-29T15:35:18 1756481718

> That study predates Claude Code though.

Is there a study demonstrating Claude Code improves productivity?

jama211 · 2025-08-29T18:46:22 1756493182

I mean, I used to average 2 hours of intense work a day and now it’s 1 hour.

sarchertech · 2025-08-30T00:41:49 1756514509

How are you tracking that? Are you keeping a log, or are you just guessing? Do you have a mostly objective definition of intense work or are you just basing it on how you feel? Is your situation at work otherwise exactly the same, or have you gotten into a better groove with your manager? Are you working on exactly the same thing? Have you leveled up with some more experience? Have you learned the domain better?

Is your work objectively the same quality? Is it possible that you are producing less but it’s still far above the minimum so no one has noticed? Is your work good enough for now, but a year from now when someone tries to change it, it will be a lot harder for them?

Based on the only real studies we have, humans grossly overestimate AI time savings. It’s highly likely you are too.

jama211 · 2025-08-30T19:37:37 1756582657

_sigh_. Really dude? Just because people overestimate them on average doesn’t mean every person does. In fact, you should be well versed enough about the statistics to understand that it will be a spectrum that is highly dependent on both a persons role and how they use it.

For any given new tool, a range of usefulness that depends on many factors will affect people differently as individuals. Just because a carpenter doesn’t save much time because Microsoft excel exists doesn’t mean it’s not a hugely useful tool, and doesn’t mean it doesn’t save a lot of time for accountants, for example.

Instead of trying to tear apart my particular case, why not entertain the possibility that it’s more likely I’m reporting pretty accurately but it’s just I may be higher up that spectrum - with a good combo of having a perfect use case for the tool and also using the tool skilfully?

sarchertech · 2025-08-30T22:50:29 1756594229

> _sigh_. Really dude? Just because people overestimate them on average doesn’t mean every person does.

In the study, every single person overestimated time saved on nearly every single task they measured.

Some people saved time, some didn’t. Some saved more time, some less. But every single person overestimated time saved by a large margin.

I’m not saying you aren’t saving time, but it’s very unlikely that if you aren’t tracking things very carefully that you are overestimating.

jama211 · 2025-08-31T11:57:03 1756641423

I’ll admit it’s possible my estimates are off a bit. What isn’t up for debate though is that it’s made a huge difference in my life and saved me a ton of time.

The fact that people overestimate its usefulness is somewhat of a “shrug” for me. So long as it _is_ making big differences, that’s still great whether people overestimate it or not.

sarchertech · 2025-09-01T17:17:02 1756747022

If people overestimate time saved by huge margins, we don’t know whether it’s making big differences or not. Or more specifically whether the boost is worth the cost (both monetary and otherwise).

jama211 · 2025-09-01T19:21:43 1756754503

Only if we’re only using people’s opinions as data. There are other ways to do this.

sarchertech · 2025-09-02T00:21:42 1756772502

Sure and if we look at data, the. only independent studies we have show either small productivity gains or a reduction in productivity for everything but small greenfield projects.

jama211 · 2025-09-02T06:52:57 1756795977

Studies plural? Can you link them?

sarchertech · 2025-09-02T17:22:51 1756833771

Google for the Stanford study by Yegor Denisov-Blanch. You might have to pay to access the paper, but you can watch the author’s synopsis on YouTube.

For low complexity greenfield projects (best case) they found a 30% to 40% productivity boost.

For high-complexity brownfield projects (worst case) they found a -5% to 10% productivity boost.

The METR study from a few weeks ago showed an average productivity drop around 20%.

That study also found that the average developer believed AI had made them 20% more productive. The difference in perception and reality was on average 40 percentage points.

jama211 · 2025-09-02T19:19:11 1756840751

The devil is always in the details with these studies. What did they measure, how did they measure it, are they counting learning the new tool as unproductive time, etc etc etc. I’ll have to read them myself. Regardless, I’ll be sad if it makes most people less productive on average if that’s the scientific truth, but it won’t change the fact that for my specific use case there is a clear time save.

sarchertech · 2025-09-03T15:10:40 1756912240

Sure you need to read them yourself to know what conclusions to draw.

In my specific case I felt like I was maybe 30% faster on greenfield projects with AI (and maybe 10% on brownfield). Then I read the study showing a 40 percentage point overestimate on average.

I started tracking things and it’s pretty clear I’m not actually saving anywhere near 30%, and I’d estimate that long term I might be in the negative productivity realm.