More

mkl · 2025-06-08T02:29:55 1749349795

Yes: https://512pixels.net/2025/05/original-macintosh-resolution/

There was a discussion here a couple of weeks ago (with a typo in the title): https://news.ycombinator.com/item?id=44110219

mkl · 2025-06-07T23:52:43 1749340363

Agreed. Even stranger to me is @ as the fourth most common operator, supposedly more common than +. The whole thing seems dubious.

yorwba · 2025-06-08T04:33:51 1749357231

Its number of occurrences is 103,090. In the master's thesis identified as the original source https://cs.uwaterloo.ca/~smwatt/home/students/theses/CSo2005... the Unicode value of the operator occurring 103,090 times is given as 2061, and the thesis helpfully explains that

Unicode 2061, 2062 and 2063 are invisible operators. TeX does not have any of these invisible operators. These invisible operators result from the TEX to MathML conversion.

– 2061 – Function application

– 2062 – Invisible times

– 2063 – Invisible separator

And Wikipedia says that function application may be represented as

U+2061 FUNCTION APPLICATION (⁡, ⁡) — a contiguity operator indicating application of a function; that is an invisible zero width character intended to distinguish concatenation meaning function application from concatenation meaning multiplication. https://en.wikipedia.org/wiki/Function_application#Represent...

I'm not sure though how an automated conversion process would be able to distinguish between these.

dleeftink · 2025-06-08T03:05:44 1749351944

The table byline says: "The @ symbol is used to encode mathematical formulas for the computer. It is not visible to the user."

layer8 · 2025-06-08T00:11:06 1749341466

I would suspect that the @ comes from author email addresses. It's not entirely wrong to call that an operator. ;)

mkl · 2025-06-08T03:02:05 1749351725

No, the data (as described in So's thesis) was mathematical expressions extracted from TeX source code, so the surrounding text and email addresses etc. were ignored. Skimming through by eye I can't see @ in any of So's tables, and searching for the hex Unicode value the tables list for every other character yields no hits: @ is not in the tables.

∋ is there anomalously frequently, and @ is missing, so something seems to have gone wrong, probably at multiple stages in the pipeline.

mmooss · 2025-06-08T00:51:40 1749343900

Do papers tend to have more email addresses or more plus signs? I'd expect the latter, by a lot.

mkl · 2025-06-07T00:01:39 1749254499

With tools like Ollama, self-hosting is easier than hosted. No sign-up, no API keys, no permission to spend money, no worries about data security, just an easy install then import a Python library. Qwen2.5-VL 7B is proving useful even on a work laptop with insufficient VRAM - I just leave it running over a night or weekend and it's saving me dozens of hours of work (that I then get to spend on other higher-value work).

genewitch · 2025-06-07T00:57:47 1749257867

I got the 70b qwen llama distill, I have 24GB of vram.

I opened aider and gave a small prompt, roughly:

  Implement a JavaScript 2048 game that exists as flat file(s) and does not require a server, just the game HTML, CSS, and js. Make it compatible with firefox, at least.

That's it. Several hours later, it finished. The game ran. It was worth it because this was in the winter and it heated my house a bit, yay. I think the resulting 1-shot output is on my github.

I know it was in the training set, etc, but I wanted to see how big of a hassle it was, if it would 1-shot with such a small prompt, how long it would take.

Makes me want to try deepseek 671B, but I don't have any machines with >1TB of memory.

I do take donations of hardware.

mechagodzilla · 2025-06-07T11:40:34 1749296434

Buy a used workstation with 512GB of DDR4 RAM. It will probably cost like $1-1.5k, and be able to run a Q4 version of the full deepseek 671B models. I have a similar setup with dual-socket 18 core Xeons (and 768GB of RAM, so it cost about $2k), and can get about 1.5 tokens/sec on those models. Being able to see the full thinking trace on the R1 models is awesome compared to the OpenAI models.

3036e4 · 2025-06-07T13:35:24 1749303324

If/when Corporate Legal approves a tool like Ollama for use on company computers, yes. Might not require purchasing anything, but there can still be red tape.

yb6677 · 2025-06-07T13:51:08 1749304268

What do you ask it to do overnight or weekend?

mgraczyk · 2025-06-07T00:30:34 1749256234

It does not take dozens of hours to get an API key for gemini

cortesoft · 2025-06-07T04:33:48 1749270828

They weren’t saying getting the api key would take that long, just getting permission from their company to let them do it.

mkl · 2025-06-07T00:36:00 1749256560

I never claimed that it did. Gemini would probably save me the same dozens of hours, but come with ongoing costs and additional starting up hurdles (some near insurmountable in my organisation, like data security for some of what I'm doing).

shmoogy · 2025-06-07T01:19:02 1749259142

Gemini flash or any free LLM on openrouter would be orders of magnitude faster and effectively free. Unless you are concerned about privacy of the conversation - it's really purely being able to say you did it locally.

I definitely do appreciate and believe in the value of open source / open weight LLMs - but inference is so cheap right now for non frontier models.

mkl · 2025-06-06T05:29:20 1749187760

*expressive!

mkl · 2025-06-05T11:03:49 1749121429

2021 here: https://worldpopulationreview.com/state-rankings/fatal-car-a...

Texas has a plurality of fatal car accidents (for USA), but California is not far behind, and in 2022 California has slightly more deaths. (This page doesn't have the number of fatal car accidents for 2022, which is a bit odd.)

2023 here, Texas has plurality again, with California close behind: https://www-fars.nhtsa.dot.gov/States/StatesCrashesAndAllVic...

You're not looking at absolute numbers, which is what plurality means. I don't see how "someone in the US is more likely to die in a car wreck in Texas even if they never go to Texas" could make sense.

genewitch · 2025-06-06T00:49:34 1749170974

like this:

A driver in the US dies while driving due to a crash/wreck/whatever.

Statistically, that occurred with the highest probability in TX. as i said, this was like 2015-2019 when i used to claim this. There's a sign on freeways in TX that say "highway deaths so far in <year>: <16 bit int>" which led me to start looking in to it, and i think my little quip is just a way to draw attention to how dangerous it is to drive in TX. But it is quite large, Texas.

mkl · 2025-06-03T20:34:27 1748982867

Both those are ccTLDs.

mkl · 2025-06-01T10:22:17 1748773337

No, that works well IME. If it's worth something towards the final grade, even 1%, most students will do it. It can be hard to persuade some of my students not to spend multiple hours attempting to get 0.1% more of the course grade by doing another quiz attempt when they've already achieved 90% - I think they're better off moving on to the next thing.

mkl · 2025-05-31T13:31:51 1748698311

"You" used to be the second person plural, as a counterpart to "thou".

eru · 2025-06-01T00:14:15 1748736855

Exactly. And y'all had to be invented, because 'you' became ambiguous.

mkl · 2025-05-28T09:30:15 1748424615

LLMs have knowledge cutoffs...

mkl · 2025-05-27T01:54:49 1748310889

This one is customisable. The firmware uses QMK, so you can remap it however you like. You'd need to make some key label stickers in Inkscape or something if you want the keys to show the characters.