More

sharmajai · 2025-02-21T11:41:03 1740138063

I get the urge to be cynical all the time, but this isn't that time. "Once you grow", they have already grown and competing with the SoTA models and still giving it all back to the community.

I just wish this smear campaign against them stops sometime soon.

sharmajai · 2025-01-02T17:45:41 1735839941

I am 100% sure that company is the F in FAANG.

fatnoah · 2025-01-03T15:23:40 1735917820

You would be correct, though I guess it's now the M in FAANG.

sharmajai · 2024-12-11T02:08:02 1733882882

Dude, that's cheating!

sharmajai · on June 1, 2024

But but look at all the Turing Award and Putnam Prize winners he was worked with.

sharmajai · on March 5, 2024

Maybe not everything should be about business.

smith7018 · on March 5, 2024

Agreed but this isn't the same as an open source library; it costs A LOT of money to constantly train these models. That money has to come from somewhere, unfortunately.

TehCorwiz · on March 5, 2024

Yeah. The amount of compute required is pretty high. I wonder, is there enough distributed compute available to bootstrap a truly open model through a system like seti@home or folding@home?

Filligree · on March 5, 2024

The compute exists, but we'd need some conceptual breakthroughs to make DNN training over high-latency internet links make sense.

altruios · on March 5, 2024

Distributing the training data also opens up vectors of attack. Poisoning or biasing the dataset distributed to the computer needs to be guarded against... but I don't think that's actually possible in a distributed model (in principal?). If the compute is happing off server: then trust is required (which is not {efficiently} enforceable?).

TehCorwiz · on March 5, 2024

Trust is kinda a solved problem in distributed computing, The different "@Home" projects and Bitcoin handle this by requiring multiple validations of a block of work for just this reason.

altruios · on March 5, 2024

How do you verify the work of training without redoing the exact same work for training? (That's the neat part: you don't)

Bitcoin is trust-solved because of how the new blocks depends on previous blocks. With training data, there is no such verification (prompts/answers pairs do not depend at all on other prompt/answer pairs) (if there was, we wouldn't need to do the work of training the data in the first place).

You can rely on multiplying the work where gross variations are ignored (as you suggest): but that will take a lot more overhead in compute, and still is susceptible to bad actors (but much more resistant).

There is no solid/good solution - afaik - for distributed training of an AI (Open assistant I think is working on open training data?), if there is: I'll sign up.

hackpert · on March 6, 2024

There has been some interesting work when it comes to distributed training. For example DiLoCo (https://arxiv.org/abs/2311.08105). I also know that Bittensor and nousresearch collaborated on some kind of competitive distributed model frankensteining-training thingy that seems to be going well. https://bittensor.org/bittensor-and-nous-research/

Of course it gets harder as models get larger but distributed training doesn't seem totally infeasible. For example if we were to talk about MoE transformer models, perhaps separate slices of the model can be trained in an asynchronous manner and then combined with some retraining. You can have minimal regular communication about say, mean and variance for each layer and a new loss term dependent on these statistics to keep the "expertise" for each contributor distinct.

pksebben · on March 5, 2024

Forward-Forward looked promising, but then Hinton got the AI-Doomer heebie-jeebies and bailed. Perhaps someone picks up the concept and runs with it - I'd love to myself but I don't have the skillz to build stuff at that depth, yet.

TehCorwiz · on March 5, 2024

I agree, but Y-Combinator literally only exists to squeeze the most bizness out of young smart people. That's why you're not seeing so much agreement.

phkahler · on March 5, 2024

>> but Y-Combinator literally only exists to squeeze the most bizness out of young smart people.

YC started out with the intent to give young smart people a shot at starting a business. IMHO it has shifted significantly over the years to more what you say. We see ads now seeking a "founding engineer" for YC startups, but it used to be the founders were engineers.

te_chris · on March 5, 2024

Squeezed all the alpha out of the idealists now it’s the business guys turn

bufferoverflow · on March 5, 2024

If you agree, do you mind paying a few hundred thousand for my neural net training expenses?

mvkel · on March 5, 2024

The choice facing many companies that insist on remaining "open" is:

Do you want to 1. be right

or

2. stay in business

This is one of the reasons why OpenAI pivoted to be closed. Not bc of greedy value extractors; because it was the only way to survive.

bufferoverflow · on March 5, 2024

Training these big models is very very expensive. If they don't make money, and they run out of their own money, there will be no more SDXL.

sandworm101 · on March 5, 2024

>> Training these big models is very very expensive.

Which is why they are not the future. A big model that can generate a picture about anything in response to any input makes for a great website. It generates lots of press. But it is not a reasonable tool for content generation. If you want to produce content in a specific area or genre, the best results come from a model trained or modified in the area. So the big generalized AI, if you use it, would only be the framework on which you built your specialized tool. Building that specialized tool, such as something dedicated to images of a particular politician, does not require huge amounts of computation. That sort of thing can and is being done by individuals.

I am waiting for a tool trained on publicly-accessible mugshots. It wouldn't be a very big project but could yield a tool to generate very believable mugshots of politicians.

bufferoverflow · on March 5, 2024

I think it's unreasonable to expect a model for every possible use case. You would need billions of models, if not trillions.

Big generalist models are the future.

mikkom · on March 5, 2024

That was basically why openai was founded.

Too bad they decided to get greedy :-(

probablynish · on March 5, 2024

Most individuals like being able to acquire more goods and services. A lot follows from there

kelseyfrog · on March 5, 2024

You're right, a lot follows from there. But I'm so tired of being a consumer. I just want to be me for a chance. I'm so, so tired.

probablynish · on March 5, 2024

Depending on your background and circumstances, there are ways to opt out of the race to a greater/lesser degree. Moving to a cheaper city in your country, or a cheaper country altogether, is one of them. Finding a less stressful way of making less money is another.

I don't know you but I hope things work out :)

kelseyfrog · on March 5, 2024

Thank you, appreciate it.

It's just hard being reminded that there's no escape hatch - we've welded them all shut for eternity. Being reduced to choices within a system but the choice horizon never extends to the system itself and won't within my lifetime makes me feel trapped.

natebc · on March 5, 2024

well, know that you're not alone in that feeling.

ben_w · on March 5, 2024

Great, but aren't they simultaneously losing money and getting sued?

baq · on March 5, 2024

Maybe. Paychecks help with not being hungry, though.

I’d be happy if my government or EU or whatever offered cash grants for open research and open weights in AI space.

The problem is, everyone wants to be a billionaire over there and it’s getting crowded.

sharmajai · on Aug 30, 2023

What're the issues with PayPal that you're aware of?

simion314 · on Aug 30, 2023

They have a big commission for transactions in some countries , and some people ended up with their account blocked and having no wayu to get their money out.

I am just a user, not a merchant so I am not sure what the issues would be from that side of the transactions, I personally avoid keeping too much money in PayPal just in case they somehow block my account for some bullshit reason.

Edit: I would prefer not to go off topic, unless is related with why some big companies refuse to use PayPal and others do use it. As an example I could not buy audio books from amazon so my money went to a different company that accepted PayPal.

tensor · on Aug 30, 2023

If a company implements paypal then people who only have paypal might pay them money that they otherwise wouldn't. However, people who might have otherwise payed via another means may also now use paypal. If paypal has higher fees this could well result in less money in revenue.

Also, maintaining each payment gateway has an implementation and maintenance cost. Add it all up, and assuming that the number of potential customers like you that only have paypal is small, and it's easy to see why companies may choose not to implement it.

simion314 · on Aug 30, 2023

So Netflix,Steam has PayPal support, OpenAI does not. Can you guess what reason applies to Netflix, Steam but not to OpenAI ?

tensor · on Aug 31, 2023

Who knows. Every company is different, has different customer bases, different histories, different technologies. Maybe they signed a deal with paypal where they get lower fees, maybe they integrated it in a time when it made more sense then now (e.g. before apple pay and google pay were so prevalent), maybe they outsource their payment to a processor that just supports it.

I don't know the details of any of these places, I was just giving a reason why a company may not implement a particular payment processor. It's based on a balance of factors, not just a simple "support X get more paying customers."

sharmajai · on May 14, 2023

https://youtu.be/OVXTAKpmgww

sharmajai · on Sept 26, 2021

That's being tried [1]. Although, it's not going well [2].

[1] https://www.visalaw.com/onboarding-mass-litigation-clients/

[2] https://twitter.com/LilySAxelrod/status/1437846756394520582

sharmajai · on July 24, 2020

While I agree that Peter's suggestion is the safe thing to do, based on the following tweet from the official account, if you were in the country legally on 06/24, the proclamation doesn't seem to apply.

https://mobile.twitter.com/TravelGov/status/1285331446232743...

mtremsal · on July 25, 2020

Being exempt doesn't help in practice when consular offices are closed.

sharmajai · on Feb 15, 2018

<nitpick>All Unicode characters map, one-to-one, to their code points. A code point being a numeric identifier. It's a grapheme that combines multiple characters to form a unit of writing.</nitpick>

lilyball · on Feb 15, 2018

<nitpick>The Unicode standard does not have a single definition for "character" because there's multiple interpretations. One reasonable interpretation is "a grapheme cluster".</nitpick>

More specifically, here's what the Unicode Consortium glossary defines for "Character":

> Character. (1) The smallest component of written language that has semantic value; refers to the abstract meaning and/or shape, rather than a specific shape (see also glyph), though in code tables some form of visual representation is essential for the reader’s understanding. (2) Synonym for abstract character. (3) The basic unit of encoding for the Unicode character encoding. (4) The English name for the ideographic written elements of Chinese origin. [See ideograph (2).]

sharmajai · on Feb 15, 2018

I don't see which of those 4 definitions supports the grapheme cluster interpretation.

lilyball · on Feb 15, 2018

The very first one. é has semantic value. ´ by itself doesn't.

sharmajai · on Feb 15, 2018

Of course it does, because it can be combined with other characters. This is the semantic meaning: https://en.wikipedia.org/wiki/Acute_accent

lilyball · on Feb 15, 2018

An accent mark by itself has zero semantic meaning in a written context. It's a modifier. But you need to know what it's modifying in order to assign it any sort of meaning. We're talking about semantic meaning within the context of a written language, not technical details.