More

gck1 · 2025-09-22T11:33:06 1758540786

O-1 requires yearly assessment of the exceptional status though. You can hardly plan your life around a visa that is quite subjective in itself and may depend on the mood of the USCIS officer reviewing your case on that particular day.

gck1 · 2025-09-09T21:32:14 1757453534

Ehh, the great firewall of China has been in place for a long time now, it's also illegal to evade it, and yet, it has sprouted many great tools that make it so simple to evade, that enforcement becomes impossible.

If anything, technology will always win over the legislation if it happens at scale. It may even lead to some new breakthroughs.

gck1 · 2025-09-05T22:42:32 1757112152

I was insta shadowbanned on Tinder after I went through the registration on my network which exits via VPN and blocks spyware domains via adguard.

gck1 · 2025-08-30T13:26:32 1756560392

> Not to mention the big cloud providers are unhinged with their egress pricing.

I always wonder why this status quo persisted even after Cloudflare. Their pricing is indeed so unhinged, that they're not even in consideration for me for things where egress is a variable.

Why is egress seemingly free for Cloudflare or Hetzner but feels like they launch spaceships at AWS and GCP every time you send a data packet to the outside world?

nromiun · 2025-08-30T13:39:11 1756561151

They are just greedy. And they know nobody can compete with them for availability in every country. Except for Cloudflare, which is why it is so popular.

gck1 · 2025-08-30T13:18:03 1756559883

And god forbid you live in an authoritarian country and must use VPN to protect your freedom. Internet becomes captcha hell run by 2-3 companies.

I've had far fewer issues with my own bots that access cloudflare protected websites, than during my regular browsing with privacy respecting browsers and a VPN.

As a side note: I'm at least thankful Microsoft isn't behind web gatekeeping. Try and solve any microsoft captcha behind a VPN - its like writing a thesis, you gotta dedicate like 5 minutes, full attention.

gck1 · 2025-08-30T13:01:11 1756558871

Internet Archive is missing enormous chunks of the internet though. And I don't mean weird parts of the internet, just regional stuff.

Not even news articles from top 10 news websites from my country are usually indexed there.

swed420 · 2025-08-30T21:52:08 1756590728

So then make a better one. I was only referencing it as a general concept that can be approved upon as desired.

gck1 · 2025-08-29T00:23:59 1756427039

Also fonts you have installed, the type of connection you're using, GPU parameters, keyboard languages on your system and so much more [1]

[1] https://abrahamjuliot.github.io/creepjs/

gck1 · 2025-08-29T00:09:08 1756426148

Curiously, I explicitly tell all my LLM agents to never touch Pydantic models or environment stuff - it’s even in big, uppercase, bold text in my custom instructions for Roo-Code. LLMs seem to trip a lot over Pydantic’s magic.

__MatrixMan__ · 2025-08-29T01:04:33 1756429473

pydantic-AI is a bit different than pydantic. The LLM isn't prompted to generate the pydantic model, instead it's encouraged to take input in the form of one model and produce output in the form of another.

gck1 · 2025-08-29T00:02:37 1756425757

LLMs (Sonnet, Gemini from what I tested) tend to “fix” failing tests by either removing them outright or tweaking the assertions just enough to make them pass. The opposite happens too - sometimes they change the actual logic when what really needs updating is the test.

In short, LLMs often get confused about where the problem lies: the code under test or the test itself. And no amount of context engineering seems to solve that.

Zerot · 2025-08-29T04:40:40 1756442440

I think in part the issue is that the LLM does not have enough context. The difference between a bug in the test or a bug in the implementation is purely based on the requirements which are often not in the source code and stored somewhere else(ticket system, documentation platform).

Without providing the actual feature requirements to the LLM(or the developer) it is impossible to determine which is wrong.

Which is why I think it is also sort of stupid by having the LLM generate tests by just giving it access to the implementation. That is at best testing the implementation as it is, but tests should be based on the requirements.

gck1 · 2025-08-29T05:29:10 1756445350

Oh, absolutely, context matters a lot. But the thing is, they still fail even with solid context.

Before I let an agent touch code, I spell out the issue/feature and have it write two markdown files - strategy.md and progress.md (with the execution order of changes) inside a feat_{id} directory. Once I’m happy with those, I wipe the context and start fresh: feed it the original feature definition + the docs, then tell it to implement by pulling in the right source code context. So by the time any code gets touched, there’s already ~80k tokens in play. And yet, the same confusion frequently happens.

Even if I flat out say “the issue is in the test/logic,”, even if I point out _exactly_ what the issue is, it just apologizes and loops.

At that point I stop it, make it record the failure in the markdown doc, reset context, and let it reload the feature plus the previous agent’s failure. Occasionally that works, but usually once it’s in that state, I have to step in and do it myself.

gck1 · 2025-08-28T23:53:48 1756425228

> Overconfident, forgetful, sloppy, easily distracted

And constantly microdosing, sometimes a bit too much.

DrewADesign · 2025-08-29T04:27:35 1756441655

And sometimes it seems like a macro-dose huffing xylene-based glue.