More

Mandelmus · on March 28, 2024

> My limited understanding is that Nerfs are compute-heavy because each cloud point is essentially a small neural network

There's no point cloud in NeRFs. A NeRF scene is a continuous representation in a neural network, i.e. the scene is represented by neural network weights, but (unlike with 3D Gaussian Splatting) there's no explicit representation of any points. Nobody can tell you what any of the network weights represent, and there's no part of it that explicitly tells you "we have a point at location (x, y, z)". That's why 3D Gaussian Splatting is much easier to work with and create editing tools for.

lelag · on March 28, 2024

Interesting. Thanks for the clarification.

Mandelmus · on March 27, 2024

And it appears to be at ~80 GB of RAM via quantisation.

smcleod · on March 27, 2024

So that would be runnable on a MBP with a M2 Max, but the context window must be quite small, I don’t really find anything under about 4096 that useful

a_wild_dandan · on March 28, 2024

Can't wait to try this on my MacBook. I'm also just amazed at how wasteful Grok appears to be!

dheera · on March 27, 2024

That's a tricky number. Does it run on an 80GB GPU, does it auto-shave some parameters to fit in 79.99GB like any articifially "intelligent" piece of code would do, or does it give up like an unintelligent piece of code?

Jedd · on March 28, 2024

Are you aware how Macs present memory? Their 'unified' memory approach means you could run an 80GB model on a 128GB machine.

There's no concept of 'dedicated GPU memory' as per conventional amd64 arch machines.

declaredapple · on March 27, 2024

What?

Are you asking if the framework automatically quantizes/prunes the model on the fly?

Or are you suggesting the LLM itself should realize it's too big to run, and prune/quantize itself? Your references to "intelligent" almost leads me to the conclusion that you think the LLM should prune itself. Not only is this a chicken and egg problem, but LLMs are statistical models, they aren't inherently self bootstraping.

dheera · on March 27, 2024

I realize that, but I do think it's doable to bootstrap it on a cluster and teach itself to self-prune, and surprised nobody is actively working on this.

I hate software that complains (about dependencies, resources) when you try to run it and I think that should be one of the first use cases for LLMs to get L5 autonomous software installation and execution.

Red_Leaves_Flyy · on March 27, 2024

Make your dreams a reality!

lobocinza · on April 3, 2024

Worst is software that doesn't complain but fails silently.

2099miles · on March 28, 2024

The LLM itself should realize it’s too big and only put the important parts on the gpu. If you’re asking questions about literature there’s no need to have all the params on the gpu, just tell it to put only the ones for literature on there.

Mandelmus · on Feb 14, 2024

A big portion, maybe even a majority, of my fellow students uses an iPad.

Mandelmus · on Jan 22, 2024

Most of these self-help books are basically what happens if you take what could be a decent blog post and just blow up the word count until you can publish it as a book.

Mandelmus · on Jan 19, 2024

I was in open relationships in Berlin’s polyamorous community for nearly five years and I concur with everything you said.

Mandelmus · on Jan 17, 2024

The Machine Learning community is still overwhelmingly on X, which likely explains your experience. There are other communities, like that of Ape/iOS developers, that have moved to Mastodon, and for which the quality of conversation is now much higher on Mastodon than on X.

Mandelmus · on Jan 5, 2024

Please don’t paste chatGPT comments in here. If I wanted to ask chatGPT about this, I’d do it myself.

neptox · on Jan 5, 2024

Or just don't disclaim that your comment comes from GPT as there is no way to prove otherwise and it won't annoy any hater beyond the actual content quality.

empath-nirvana · on Jan 5, 2024

ChatGPT has a very easily identified authorial voice that is like nails on a chalkboard if you aren't specifically looking for it.

rowanG077 · on Jan 5, 2024

If you don't think it's interesting you are free to skip it after they said it was a chatGPT answer. I don't really like you deciding for everyone on hackernews what "should" be posted.

gosub100 · on Jan 5, 2024

This is a "comments" section. ChatGPT didn't crawl here, click "reply" and post its comment. Forwarding other people's/AI's words is not a `comment` and violates the spirit of what a comments section is.

rowanG077 · on Jan 5, 2024

I agree with you if the comment was a straight up copy and paste of what chatGPT said. But it wasn't. The comment provided context and explained why it has value. Just because something is generated by chatGPT doesn't mean it cannot contribute to a discussion.

blitz_skull · on Jan 5, 2024

I actually really enjoyed reading the summary as my brain didn’t really grasp the abstract.

aifooh7Keew6xoo · on Jan 5, 2024

It'll probably be a few years before Hacker News is a safe space for ChatGPT victims.

sgu999 · on Jan 5, 2024

See it like a cache, no need to recompute. Or like these nice archive links people post for paywalled articles, I could do it myself but it's simply helpful.

Mandelmus · on Dec 26, 2023

Same for me. I've been mostly vegan for over ten years now. I "downgrade" to only vegetarian when it really cannot be avoided – which, luckily, has been only when I've travelled to countries where there really weren't any vegan options.

Mandelmus · on Dec 26, 2023

> On the other hand, a lot of people who claim animals have thoughts and emotions seem to think that cows have complicated human-level thoughts like "I am an oppressed cog; my owner will send me to the glue factory when I am too old to give milk, and yet I must queue up regardless, for my spirit is broken; my calf has been taken and I will never know if he got a college degree; life is pure suffering." This seems unlikely to be true.

That straw man could fatten a whole cow.

Mandelmus · on Dec 23, 2023

As a current compsci masters student at ETH Zurich: yes, having those explanations is wonderful, but that just means that topics like SVD and PCA are taken for granted now, and the things that we need to chew on are things for which no nice canonical explainers have been created (and that likely weren’t taught in 1990).

Personally I think that innovation in explanation/illustration of abstract mathematical topics is one of the most valuable kinds of progress, but it’s rarely talked about as such. When someone comes up with the right framework, visualisation, or metaphor for explaining a complex and abstract subject in a way that induces “the right” mental model, and this new teaching “tool” proliferates, that is just a beautiful thing to see.