Been using Claude Code (4 Opus) fairly successfully in a large Rust codebase, bu...

joshvm · 2025-06-25T18:24:12 1750875852

Gemini has some fun failure modes. It gets "frustrated" when changes it makes doesn't work, and replies with oddly human phrases like "Well, that was unexpected" and then happily declares that (I see the issue!) "the final tests will pass" when it's going down a blind alley. It's extremely overconfident by default and much more exclamatory without changing the system prompt. Maybe in training it was taught/figured out that manifesting produces better results?

jjice · 2025-06-25T18:50:47 1750877447

It also gets really down on itself, which is pretty funny (and a little scary). Aside from the number of people who've posted online about it wanting to uninstall itself after being filled with shame, I had it get confused on some Node module resolution stuff yesterday and it told me it was deeply sorry for wasting my time and that I didn't deserve to have such a useless assistant.

Out of curiosity, I told it that I was proud of it for trying and it had a burst of energy again and tried a few more (failing) solution, before going back to it's shameful state.

Then I just took care of the issue myself.

danielbln · 2025-06-25T19:08:40 1750878520

After a particular successful Claude Code task I praised it and told it to "let's fucking go!" to which it replied that loved the energy and proceeded to only output energetic caps lock with fire emojis. I know it's all smoke and mirrors (most likely), but I still get a chuckle out of this stuff.

edg5000 · 2025-07-03T13:05:51 1751547951

This really cracked me up, indeed mostly funny and maybe slightly scary or at least off-putting that it's so human-like.

noisy_boy · 2025-06-26T00:57:16 1750899436

I asked it to do a comparatively pedestrian task: write a script to show top 5 google searches.

First it did the search itself and then added "echo" for each of them - cute

Then it tried to use pytrends which didn't go anywhere

Then it tried some other paid service which also didn't go anywhere

Then it tried some other stuff which also didn't go anywhere

Finally it gave up and declared failure.

It will probably be useful as it can do the modify/run loop itself with all the power of Gemini but so far, underwhelming.

fcoury · 2025-06-26T12:42:43 1750941763

This was also my exact experience. I was pretty excited because I usually use Gemini Pro 2.5 when Claude Code gets stuck by pasting the whole code and asking questions and it was able to get me out of a few pickles a couple of times.

Unfortunately the CLI version wasn't able to create coherent code or fix some issues I had in my Rust codebase as well.

Here's hope that it eventually becomes great.

fpgaminer · 2025-06-25T19:10:30 1750878630

Claude will do the same start over if things get too bad. At least I've seen it when its edits went haywire and trashed everything.

eknkc · 2025-06-25T19:40:43 1750880443

Same here. Tried to implement a new feature on one of our apps to test it. It completely screwed things up. Used undefined functions and stuff. After a couple of iterations of error reporting and fixing I gave up.

Claude did it fine but I was not happy with the code. What Gemini came up with was much better but it could not tie things together at the end.

taberiand · 2025-06-25T21:56:09 1750888569

Sounds like you can use gemini to create the initial code, then have claude review and finalise what gemini comes up with

ZeroCool2u · 2025-06-25T17:18:43 1750871923

Personally my theory is that Gemini benefits from being able to train on Googles massive internal code base and because Rust has been very low on uptake internally at Google, especially since they have some really nice C++ tooling, Gemini is comparatively bad at Rust.

data-ottawa · 2025-06-26T02:19:16 1750904356

Tangental, but I worry that LLMs will cause a great stagnation in programming language evolution, and possibly a bunch of tech.

I've tried using a few new languages and the LLMs would all swap the code for syntactically similar languages, even after telling them to read the doc pages.

Whether that's for better or worse I don't know, but it does feel like new languages are genuinely solving hard problems as their raison d'etre.

breakingcups · 2025-06-26T09:43:51 1750931031

Not just that, I think this will happen on multiple levels too. Think de-facto ossified libraries, tools, etc.

LLMs thrive because they had a wealth of high-quality corpus in the form os Stack Overflow, Github, etc. and ironically their uptake is causing a strangulation of that source of training data.

sillystu04 · 2025-06-27T10:31:44 1751020304

Perhaps the next big programming language will be designed specifically for LLM friendliness. Some things which are human friendly like long keywords are just a waste of tokens for LLMs, and there could be other optimisations too.

leoh · 2025-06-25T23:58:58 1750895938

>Personally my theory is that Gemini benefits from being able to train on Googles massive internal code base and because Rust has been very low on uptake internally at Google, especially since they have some really nice C++ tooling, Gemini is comparatively bad at Rust.

Were they to train it on their C++ codebase, it would not be effective on account of the fact that they don't use boost or cmake or any major stuff that C++ in the wider world use. It would also suggest that the user make use of all kinds of non-available C++ libraries. So no, they are not training on their own C++ corpus nor would it be particularly useful.

leoh · 2025-06-26T08:25:44 1750926344

Excuse me why was this downvoted so aggressively??

simianwords · 2025-06-26T16:34:18 1750955658

How can they train on internal codebase without leaking specifics?

leoh · 2025-06-27T02:47:26 1750992446

They can’t, which is a good point. Also it would be basically useless for the reasons I mention.

thimabi · 2025-06-25T20:50:45 1750884645

> Personally my theory is that Gemini benefits from being able to train on Googles massive internal code base

But does Google actually train its models on its internal codebase? Considering that there’s always the risk of the models leaking proprietary information and security architecture details, I hardly believe they would run that risk.

kridsdale3 · 2025-06-25T20:56:50 1750885010

Googler here.

We have a second, isolated model that has trained on internal code. The public Gemini AFAIK has never seen that content. The lawyers would explode.

blurrybird · 2025-06-26T09:47:45 1750931265

What model do your lawyers run on?

thimabi · 2025-06-25T21:00:31 1750885231

Oh, you’re right, there are the legal issues as well.

Just out of curiosity, do you see much difference in quality between the isolated model and the public-facing ones?

kridsdale3 · 2025-06-25T21:03:18 1750885398

We actually only got the “2.5” version of the internal one a few days ago so I don’t have an opinion yet.

But when I had to choose between “2.0 with Google internal knowledge” and “2.5 that knows nothing” the latter was always superior.

The bitter lesson indeed.

dilap · 2025-06-25T18:26:38 1750875998

That's interesting. I've tried Gemini 2.5 Pro from time to time because of the rave reviews I've seen, on C# + Unity code, and I've always been disappointed (compared to ChatGPT o3 and o4-high-mini and even Grok). This would support that theory.

danielbln · 2025-06-25T19:10:20 1750878620

Interesting, Gemini must be a monster when it comes to Go code then. I gotta try it for that

jordanbeiber · 2025-06-26T06:56:03 1750920963

As go feels like a straight-jacket compared to many other popular languages, it’s probably very suitable for an LLM in general.

Thinking about it - was this not the idea of go from the start? Nothing fancy to keep non-rocket scientist away from foot-guns, and have everyone produce code that everyone else can understand.

Diving in to a go project you almost always know what to expect, which is a great thing for a business.

Unroasted6154 · 2025-06-25T20:18:16 1750882696

There is way more Java and C++ than Go at Google.

chewz · 2025-06-27T08:31:21 1751013081

Reasonsbly small Go codebase works well almost with any LLM

I had always designed very large projects as few medium sized independent Go tools and that strategy pays in times of AI assisted coding.

raincole · 2025-06-25T19:01:06 1750878066

So far I've found Gemini CLI is very good at explaining what existing code does.

I can't say much about writing new code though.

skerit · 2025-06-25T20:57:21 1750885041

I tried it too, it was so bad. I got the same "revert" behaviour after only 15 minutes.