More

admiralrohan · 2025-10-01T09:09:51 1759309791

Working on a original algorithm to explain human behavior from 3rd person perspective.

The whole research is divided into 6 stages. In 2nd stage, I want to use that to mathematically establish the best course of action as an individual.

In 3rd stage, I will explain common psychological phenomenon through the theory, things like narcissism, anxiety, self-doubt, how to forgive others, etc.

In 4th stage, I will explain how the theory is the fastest way to learn across multiple domains and become a generalist and critical thinker.

In 5th stage, I will explain how society will unfold if everyone can become generalist and critical thinker through the theory.

In 6th and last stage, I will think about how to use this theory to make India the next superpower, as this theory can give us the demographic advantage.

Shared more about the algorithm here https://x.com/admiralrohan/status/1973312855114998185

admiralrohan · 2025-09-23T08:05:09 1758614709

I always wondered why people care so much about data. Now I can understand why. Thanks for sharing.

admiralrohan · 2025-09-19T05:47:10 1758260830

Inevitable.

admiralrohan · 2025-09-17T23:23:44 1758151424

Most likely the role of "programmer" will go away as we conventionally know Anyways, we have evolved a lot like from assembly language to using npm packages and we are going to see the next evolution.

On inability to solve hard problems, I think we are going to tackle even harder problems in future with AI in our side, just like corporations manage to handle more complex problems than an individual.

admiralrohan · 2025-09-14T15:56:36 1757865396

I found it more useful to read more books than read one book again and again. This helps me to reinforce the same concept from different angles. Our brain is a pattern matching machine, and it automatically picks up related concepts.

euvin · 2025-09-14T20:17:43 1757881063

That's true, and it's also the reason why it's so important to ensure your information diet is of high quality. Any concept (especially harmful or radical ones) can be reinforced.

I had to learn this lesson a long while ago when I realized many sites I casually browsed were injecting and repeating many dark thoughts that weren't truly reflective of reality. I've been way more careful of my daily intake and the groups I associate with ever since.

admiralrohan · 2025-09-15T08:33:30 1757925210

Can relate. Also information diet changed for me over time, as what is "high quality" is subjective based on where I am.

In 2016 I used to browse free webinar. In 2021 youtube self-help videos. Now-a-days only focused on history books as already learned everything needed for self-help.

And most often we focus on what we don't know. In my exp I wasted most time rereading stuffs I already knew.

admiralrohan · 2025-09-09T08:46:11 1757407571

Everyone is so negative here but we have reached the limit of AI scaling with conventional methods. Who knows Mistral might find the next big breakthrough like DeepSeek did. We should be optimistic.

lordofgibbons · 2025-09-09T11:11:25 1757416285

> but we have reached the limit of AI scaling with conventional methods

We've just only started RL training LLMs. So far, RL has not used more than 10-20% of the existing pre-training compute budget. There's a lot of scaling left in RL training yet.

am17an · 2025-09-09T14:50:09 1757429409

Isn't this factually wrong? Grok-4 used as much compute on RL as they did on pre-training. I'm sure GPT-5 was the same (or even more)

sigmoid10 · 2025-09-09T18:13:00 1757441580

It was true for models up to o3, but there isn't enough public info to say much about GPT-5. Grok 4 seems to be the first major model that scaled RL compute 10x to near pre-training effort.

scellus · 2025-09-09T12:12:16 1757419936

Even with pretraining, there's no limit or wall in raw performance, just diminishing returns in terms of the current applications, and business rationale to serve lighter models given the current infrastructure and pricing (and applications). Algorithmic efficiency of inference on a given performance level has also advanced a couple of OOMs since 2022 (for sure a major part of that is about model architecture and training methods).

And it seems research is bottlenecked by computation.

alcinos · 2025-09-09T13:01:18 1757422878

> We've just only started RL training LLMs

That's just factually wrong. Even the original chatGPT model (based on gpt3.5, released in 2022) was trained with RL (specifically RLHF).

prasoon2211 · 2025-09-09T15:43:55 1757432635

RLHF is not the "RL" the parent is posting about. RLHF is specifically human driven reward (subjective, doesn't scale, doesn't improve the model "intelligence", just tweaks behavior) - which is why the labs have started calling it post-training, not RLHF, anymore.

True RL is where you set up an environment where an agent can "discover" solutions to problems by iterating against some kind of verifiable reward AND the entire space of outcomes is theoretically largely explorable by the agent. Maths and Coding are have proven amenable to this type of RL so far.

manscrober · 2025-09-09T13:10:16 1757423416

a) 2022 is not too long ago b) this was a first important step to usable ai but not scalable. I'd say "RL training" is not the same as RLHF.

bigyabai · 2025-09-09T16:17:08 1757434628

The original ChatGPT was like 3 years after the first usable transformer models.

whimsicalism · 2025-09-09T13:15:26 1757423726

It is still an open question whether RL will (at least easily) scale the same way as pretrain or whether it is more effective at elicitation.

0x008 · 2025-09-09T09:32:13 1757410333

This move is mostly about expected EU subsidies

namero999 · 2025-09-09T12:05:16 1757419516

Especially with Euclyd entering the space (efficiency for AI workloads), with founders with tight ties to ASML, this is the move Europe needs.

qrios · 2025-09-09T16:58:07 1757437087

Thnx for the hint! I missed the news[1].

[1] https://euclyd.ai/#news

tonkinai · 2025-09-09T13:25:22 1757424322

I would make a wild guess that this is a policital invesment. It's hard to believe Mistral is the right choice to throw in 1.7B€ for economic reason.

nirv · 2025-09-10T02:46:55 1757472415

> It’s hard to believe that Mistral isn’t the right choice to invest €1.7B in for economic reasons.

Why? Cursor, essentially a VSCode fork, is valued at $10B. Perplexity AI, which, as far as I'm informed, doesn't have its own foundational models, boasts a market capitalisation of $20B, according to recent news. Yet Mistral sits at just a $14B.

Meanwhile, Mistral was at the forefront of the LLM take-off, developing foundational (very lean, performant and innovative at the time) models from scratch and releasing them openly. They set up an API service, integrated with businesses, building custom models and fine-tunes, and secured partnership agreements. They launched user-facing interface and mobile app which are on par with leading companies, kept pace with "reasoning" and "research" advancements; and, in short, built a solid, commercially viable portfolio. So why on earth should Mistral AI be valued lower? Let alone have its mere €1.7B investment questioned.

Edit: Apologies, I misread your quote and missed the "isn't" part.

pyrale · 2025-09-09T15:59:26 1757433566

Since 2024, it's hard to make an investment that has no political nature.

rldjbpin · 2025-09-10T07:49:20 1757490560

i recall them being one of the first ones to release a mixture-of-experts (MoE) model [1], which was quite novel at the time. post that, it has appeared to be a catch-up game for them in mainstream utility. like just a week go they announced support for custom MCP connectors to their chat offering [2].

more competition is always nice, but i wonder what can these two companies, separated by several steps in the supply chain, really achieve together.

[1] https://mistral.ai/news/mixtral-of-experts [2] https://mistral.ai/news/le-chat-mcp-connectors-memories

whimsicalism · 2025-09-09T13:00:01 1757422801

what next big breakthrough are you claiming deepseek found? MLA? GRPO? these are all small tweaks

admiralrohan · 2025-09-10T06:13:26 1757484806

I am not a ML person but as per the broad level understanding the innovation was about efficient training method and training the model in much cheaper than the US models and it was dubbed as the "Sputnik moment".

whimsicalism · 2025-09-10T12:51:51 1757508711

yeah that’s basically the media making things up.

admiralrohan · 2025-08-17T09:19:09 1755422349

Sorry don't understand anything. Just randomly clicking. Need more context on why you did this and how this works. And how much agency do I have.

ajb257 · 2025-08-17T09:23:17 1755422597

I think that's the point. You don't have any agency. There's no way to win.

admiralrohan · 2025-08-18T07:51:11 1755503471

Is this about nihilism?

scotty79 · 2025-08-17T09:23:50 1755422630

About as much agency as in real life. That's the point of the game.

admiralrohan · 2025-07-21T11:04:12 1753095852

Can you talk about the timeline algorithm? Which posts are getting boosted?

admiralrohan · 2025-07-14T08:35:01 1752482101

If you are running local LLMs what is the hardware requirement in my machine? Don't see any mention of that.

ggerganov · 2025-07-14T10:54:13 1752490453

Gemma 3n (the model used by this app) would run on any Apple Silicon device (even with 8GB RAM).

chilipepperhott · 2025-07-14T17:59:27 1752515967

Yup, but you're automatically giving up a ton of RAM that could be better used for Slack.

admiralrohan · 2025-07-11T07:18:42 1752218322

Why is it so? Is there any legal risk for Elon is Grok says something "wrong"?