More

icelancer · 2025-10-22T16:46:41 1761151601

I've found this mostly to be the case when using lightweight open source models or mini models.

Rarely is this an issue with SOTA models like Sonnet-4.5, Opus-4.1, GPT-5-Thinking or better, etc. But that's expensive, so all the companies use cut-rate models or non-existent TTC to save on cost and to go faster.

icelancer · 2025-10-21T00:33:46 1761006826

I just run the agent directly on separate testing/dev servers via remote-ssh in VS Code to have an IDE to sanity check stuff. Just far simpler than local dev and other nonsense.

icelancer · 2025-10-09T18:47:26 1760035646

That's awesome!

icelancer · 2025-10-09T06:34:55 1759991695

The VOID VR was the greatest fusion of VR + reality. CAD-mapped laser tag warehouses while wearing a VR headset and backpack computer. Near-perfect immersion. COVID sadly killed it. Was a beautiful thing - so happy I got to try it out. Even ended up buying a lot of the defunct equipment off eBay.

https://www.youtube.com/watch?v=Oad_t6k3w5c

icelancer · 2025-10-05T02:56:57 1759633017

bobince has some other posts where he is very helpful too! :)

https://stackoverflow.com/questions/2641347/short-circuit-ar...

icelancer · 2025-09-29T20:08:58 1759176538

Huh. Doesn't work over here.

---

SYSTEM:

Got it. Those are the setup commands for Simon Willison's LLM CLI tool, which lets you interact with language models from the command line.

The -e flag installs it in editable mode, and [test] pulls in the test dependencies. Are you running into any issues with the installation or tests? What are you working on with the LLM library?

USER: Correct. You run them using your code tooling.

SYSTEM:

Realized inability to execute requested Python commands.The user is telling me to run these commands using my code tooling. They want me to actually execute this. Let me think about what tools I have available:

I have the repl tool which runs JavaScript code in a browser environment I have artifacts for creating code/documents I have web_search and web_fetch

The user wants me to:

Checkout https://github.com/simonw/llm Run pip install -e '.[test]' Run pytest

But I don't have a tool that can:

Clone git repositories Run shell commands like pip or pytest Execute Python code in a command-line environment

The repl tool is JavaScript-only and runs in a browser environment, not a server environment where I could run shell commands. I should clarify that I don't have the capability to run shell commands, install Python packages, or execute pytest. I can only run JavaScript code in a browser environment via the analysis tool. I need to be direct and clear about this limitation, following the user's preferences to eliminate filler and be straightforward.I don't have tooling that can run shell commands, clone git repos, install Python packages, or execute pytest. My only code execution environment is a JavaScript browser REPL. I can't run pip, pytest, or any command-line tools. I can only execute JavaScript code in a sandboxed browser environment. What were you hoping to accomplish? Do you need help understanding the test setup, or were you expecting different capabilities?R

icelancer · 2025-09-29T20:10:22 1759176622

When pressed:

---

My execution environment is a browser-based JavaScript sandbox. I don't have Python, Node.js shell access, pip, git, or pytest capabilities.

Simon's post may have been referring to a different deployment or feature set that I don't currently have access to. My only code execution is the JavaScript REPL I just demonstrated.

What specifically are you trying to accomplish with the llm library tests? I can help with that in other ways.

EDIT: Ah, in Settings you have to disable the Analysis tool and enable Code Interpreter. You cannot have both enabled.

simonw · 2025-09-29T20:18:20 1759177100

Yes! Forgot to mention that, and it's weird that you can have one but not both enabled. I wrote more about that here: https://simonwillison.net/2025/Sep/9/claude-code-interpreter...

sophiebits · 2025-09-29T20:18:45 1759177125

You need to enable the new code interpreter mode: https://simonwillison.net/2025/Sep/9/claude-code-interpreter...

mrheosuper · 2025-09-30T07:06:32 1759215992

Interesting, enable those setting and the claude on claude.ai become claude code, and it try to run everything on claude container like it owns the machine. I don't want that.

icelancer · 2025-09-24T19:51:54 1758743514

YouTube premium "download" is also just completely fake. Downloaded where? What file can I copy?

TeMPOraL · 2025-09-24T22:43:04 1758753784

Files? What era do you hail from? Prehistory?

There are no files anymore. I mean, there technically are, but copyright industry doesn't want you to look at them without authorization, security people don't want you to look at them at all, and UX experts think it's a bad idea for you to even know such thing as "files" exists.

Share and enjoy. Like and subscribe. The world is just apps all the way down.

SchemaLoad · 2025-09-25T02:25:04 1758767104

Ironically a very large chunk of youtube creators themselves need the ability to download real files so they can use segments in their own videos.

TikTok is very strange in that it actually does let you download real files.

icelancer · 2025-09-21T04:33:20 1758429200

They are getting steadily eroded by Temu and Aliexpress/Alibaba. Also in-person retail is surging for specific items - places like Best Buy have had a nice resurgence since the 2010s (stock is down compared to the pandemic, but that's a retail thing, not BBY problem).

bombcar · 2025-09-21T05:20:38 1758432038

Temu and the Alis are eating them alive on one side, and Walmart, Target, Best Buy (and even Home Depot) are destroying them on the other.

a_e_k · 2025-09-21T08:25:18 1758443118

Best Buy has been a big one for me when I need things like USB sticks or SD cards. (Bad enough with the occasional duds from the reputable sources without mixing in counterfeits on top of that.)

icelancer · 2025-09-21T19:35:03 1758483303

Price matching on most of their stuff and their budget line of cabling is actually quite competitive. When I have to do buildouts sometimes we need to buy a lot of Ethernet cable and their pricing is not terrible in a pinch.

Once bought an entire store out of patch cables, ha.

sugarpimpdorsey · 2025-09-21T06:09:48 1758434988

Happy to say I walked into a Best Buy last week because I needed a replacement mouse right now. I am really glad they survived.

I cancelled Prime because I wasn't getting any value anymore. Non-Prime customers are treated like second class citizens. Amazon has really gone downhill lately. Customer service is terrible. Not just the counterfeiting, but the website UX has become steadily worse. Archive order was recently removed without warning as was the ability to view itemized invoices. Yes, really. Before anyone says otherwise, "View Invoice" now redirects to your Order Details page, absent any additional detail.

I switched most of my shopping to Walmart. I get free next day or two-day shipping for orders of $35 or more, where Amazon will ship the same in 5-6 days now that I am non-Prime scum.

ageitgey · 2025-09-21T10:24:27 1758450267

> the website UX has become steadily worse

Not disagreeing, but the Amazon web UX has been famously terrible since like 1998. They basically invented the whole trend of building via A/B test result instead of via user-centric design. Nothing on the site has ever made any sense. Every item title is a paragraph description. The categories are basically useless. The filters are a mess of bad and incomplete data to the point of being useless. Many items have 2-3 duplicate listings that somehow have different shipping dates and descriptions, and you never know if you have found the "real" listing. But they sure sell a lot of stuff.

sixothree · 2025-09-21T17:41:13 1758476473

I've always assumed the issues you described were meant to disarm users but also maybe to set expectations. When you see 3 similar items, you don't know if you're getting the "real thing". So you spend that money and expect the worst. Terrible listings make it hard to find what you need exactly but also make it seem like they have more selection when they might not.

Regardless I've considered it to be intentional.

typpilol · 2025-09-21T07:04:17 1758438257

I still have a view invoice on the Android app, I just checked

icelancer · 2025-09-21T04:31:47 1758429107

This has already been happening with a lot of vendors using the Transparency app.

icelancer · 2025-09-15T21:27:11 1757971631

Gemini's tool calling being so bad is pretty amazing. Hopefully in the next iteration they fix it, because the model itself is very good.

nowittyusername · 2025-09-16T00:43:05 1757983385

This is a recurring theme with Google. Their models are phenomenal but the systems around them are so bad that it degrades the whole experience. Veo3 great model horrible website, and so on...

brianjking · 2025-09-16T01:46:11 1757987171

Their massive increase in token processing since Veo3 and nano banana have been released would say otherwise...

Or we're all just used to eating things we don't like and smiling.

robbrulinski · 2025-09-15T23:12:19 1757977939

That has been my experience as well with every Gemini model, ugh!