More

merksittich · 2025-10-26T19:07:34 1761505654

I find the concept of low floor/high ceiling quite helpful, as for instance recently discussed in "When Will AI Transform the Economy?" [1] - actually more helpful than "jagged" intelligence used in TFA.

[1] https://andreinfante.substack.com/p/when-will-ai-transform-t...

merksittich · 2025-10-23T11:15:43 1761218143

News write-up: People with some cancers live longer after a COVID vaccine (https://www.nature.com/articles/d41586-025-03432-7)

merksittich · 2025-08-17T11:35:39 1755430539

What annoys me the most in my LinkedIn feed is low-effort visual AI slop, in particular based on fads such as the bland 4o comic style with text bubbles.

As a "visual animal", I find it very hard to tune out this kind of noise. I consequently try to hide such content from my feed, but the LinkedIn algorithm will not budge.

I did an Ask HN a while ago trying to find browser add-ons which will hide (or blur) such images (https://news.ycombinator.com/item?id=43833961), which turned out fruitless.

merksittich · 2025-08-11T21:47:31 1754948851

Preprint discussed in the article:

https://arxiv.org/pdf/2508.01191

merksittich · 2025-07-26T09:33:24 1753522404

Original HN thread (15 years ago):

Microbes use arsenic in their DNA: Proves phosphorus is not required for life (scribd.com)

https://news.ycombinator.com/item?id=1962846

merksittich · 2025-07-23T17:13:55 1753290835

Thank you, I'll bite. If within your code of conduct:

- Are you providing reasoning traces, responses or both?

- Are you evaluating reasoning traces, responses or both?

- Has your work shifted towards multi-turn or long horizon tasks?

- If you also work with chat logs of actual users, do you think that they are properly anonymized? Or do you believe that you could de-anonymize them without major efforts?

- Do you have contact to other evaluators?

- How do you (and your colleagues) feel about the work (e.g., moral qualms because "training your replacement" or proud because furthering civilization, or it's just about the money...)?

merksittich · 2025-07-11T06:15:03 1752214503

GEO: generative engine optimization (not explained in TFA)

I for one enjoy the current situation where LLMs still return useful search results and product comparisons, as long as it lasts. Stay tuned for colossal enshittification when the VC money dries up...

shinycode · 2025-07-11T06:21:32 1752214892

What made google rich will be used here with an order of magnitude more power … custom ads for everyone within the answer, not alongside. The LLM knows people with a precision even deeper, the personality like SEO has never been able to achieve. Of course it will become very hard not to have biais in answers. Even video ads made on the spot for each people. Welcome to the new world. Want a new pair of running shoes ? Custom ad all the way to convince you personally, powerful ad made just for you and disguised as a neutral answer gathered from the web. Really curious how this will unfold because there is a lot of money to be made that way to the detriment of their product

defrost · 2025-07-11T06:26:31 1752215191

Cheers for that.

I guessed Golem Engine as a placeholder until I found out. Now that I know, I might just stick with my first instinct.

merksittich · 2025-06-23T18:20:27 1750702827

Previous discussion:

OpenAI slams court order to save all ChatGPT logs, including deleted chats

https://news.ycombinator.com/item?id=44185913

merksittich · 2025-06-05T11:52:52 1749124372

Interesting detail from the court order [0]: When asked by the judge if they could anonymize chat logs instead of deleting them, OpenAI's response effectively dodged the "how" and focused on "privacy laws mandate deletion." This implicitly admits they don't have a reliable method to sufficiently anonymize data to satisfy those privacy concerns.

This raises serious questions about the supposed "anonymization" of chat data used for training their new models, i.e. when users leave the "improve model for all users" toggle enabled in the settings (which is the default even for paying users). So, indeed, very bad for the current business model which appears to rely on present users (voluntarily) "feeding the machine" to improve it.

[0] https://cdn.arstechnica.net/wp-content/uploads/2025/06/NYT-v...

Kon-Peki · 2025-06-05T14:00:22 1749132022

Thank you for the link to the actual text!

So, the NYT asked for this back in January and the court said no, but asked OpenAI if there was a way to accomplish the preservation goal in a privacy-preserving manner. OpenAI refused to engage for 5 f’ing months. The court said “fine, the NYT gets what they originally asked for”.

Nice job guys.

noworriesnate · 2025-06-05T17:08:42 1749143322

Nice find! Maybe this is a ploy by OpenAI to use API requests for training while blaming the courts?

blackqueeriroh · 2025-06-06T06:15:16 1749190516

That’s not an implicit admission, it’s refusing to argue something they don’t want to do.

merksittich · 2025-05-27T13:47:16 1748353636

Speaking of performance wall: The Claude 4 results were added to the Aider LLM Leaderboard [0] yesterday. Opus 4 is clearly below Gemini 2.5 Pro at almost twice the price. Sonnet 4 fares worse than Sonnet 3.7, with the thinking version of Sonnet 4 being somewhat cheaper than its 3.7 counterpart.

[0] https://aider.chat/docs/leaderboards/

smusamashah · 2025-05-27T20:08:48 1748376528

There might be 4.1 soon to make up for it's shortcomings.