Hacker Newsnew | past | comments | ask | show | jobs | submit | column's commentslogin

"[a photoshopped picture of a dog with 5 legs]...please count the legs"

Meanwhile you could benchmark for something actually useful. If you're about to say "But that means it won't work for my use case of identifying a person on a live feed" or whatever, then why don't you test that? I really don't understand the kick people get of successfully tricking LLMs on non productive task with no real world application. Just like the "how many r in strawberry?", "uh uh uh it says two urh urh".. ok but so what? What good is a benchmark that is so far from a real use case?


The point of benchmarking that is checking for hallucinations and overfitting. Does the model actually check the picture to count the legs or does it just see it's a dog and answer four because it knows dogs usually has four legs?

It's a perfectly valid benchmark and very telling.


Very telling of what?

Telling of where the boundary of competence is for these models. And to show that these models aren't doing what most expect them to be doing, i.e. not counting legs, and maybe instead inferring information based on the overall image (dogs usually have 4 legs) to the detriment of find grained or out-of-distribution tasks.

> right now, you need to show the robot pictures of what you want it to do. want it to "clean the kitchen"? better have a photo of a clean kitchen handy.

What about using Flux Kontext (or Controlnets) to turn the messy kitchen into a clean kitchen?


Sure thing, let me just put the fridge in the washing machine.


Feedback : when creating a workspace, a board, or a list, pressing "Enter" is not the same as clicking the "Create" button which is the only button visible. Pressing "Enter" does not create the list.

For a new user like me, the difference between a workspace, a board, and a list is not obvious. A one image explanation would be welcomed.


(Xiao)mi mo(del) ?


Yeah i think so 小(xiao)_米(mi)模(mo)_型(xing)


He also shattered that image by covering up sexual scandals and telling Ukraine to "have the courage of the white flag".


"telling Ukraine to "have the courage of the white flag"."

Perhaps he should have told Russia to have the "courage" to stop murdering people.


>Pope begs Putin to end 'spiral of violence and death'

https://web.archive.org/web/20230326034459/https://www.reute...


He did. Several times.


do you think that would have even the slightest chance of changing anything?


So never speak against brutal aggressors who commit war crimes? That seems to be antithetical to Christian values.


where did I say that? I am merely saying, that what WAS said might have higher chance of helping


Comdemning evil is an act with many purposes. Making the evil-doer change his mind is just one of possible benefit. Even if that is unlikely the other ones remain.

* People naturally imitate what they see others do. A condemnation can prevent others from imitating the evil act.

* A condemnation calls on others to resist and not facilitate the evil act.

* Condemning someone makes you enemies, in a way that is plain for everyone to see. This positioning can open up for alliance offers from others with similar beliefs.

Making someone an enemy comes with risks and drawbacks of course. You become less able to influence someone if you cut ties, hence why people suggest to try influencing in private first.


John Paul II is widely credited with helping Poland overthrow communism. While he won't change the world overnight, there are millions of people even in Russia who respect the Roman Catholic pope, even if they aren't Roman Catholics themselves.


No but it puts the ball on their court


The ball was never in the Catholic Church's court in the first place, so no it does not.


Neither is the Israel/Gaza conflict ball, doesn't preclude them from voicing their opinion on it


no, it doesnt. What my point is, is that it would have done NOTHING, whereas the message he did send probably had higher chances, and is atleast something someone might listen to, even if they dont follow the advice.

(well except ofcourse the corrupt dictator in ukraine, so it naturally falls on deaf ears)


> telling Ukraine to "have the courage of the white flag".

If an aggressor attacks your country, it takes courage to surrender. Churchill was a coward it seems. He could have surrendered to the Germans and saved so many lives on both sides.

/s


o1's take on your bonus question seems reasonable :

Yes. Art can have intrinsic and personal value for its creator, independent of any external audience. Unseen art lacks immediate external value [to others] but retains latent worth, potentially realized when discovered or appreciated in the future.


This looks pretty cool to integrate in hobby projects, however after creating an account via Google, clicking "Payment portal" shows this error :

Error creating billing portal Failed to create billing portal session: No configuration provided and your live mode default configuration has not been created. Provide a configuration or create your default by saving your customer portal settings in live mode at https://dashboard.stripe.com/settings/billing/portal.

Also when trying to update my profile picture :

Failed to update image! column users.current_period_end does not exist


Stripe issue should be fixed, second issue likely happens if you go to the api page sometime in your session before going to the profile page and then you try to edit your picture. We'll work on that. Thanks for reporting!


Press Tab, use WSAD to move and mouse to aim = Discount Flight Simulator! Love it.


But nowadays you can simply ask a vision model


yep, or even just a text model

ChatGPT 01-preview gave me:

User: what is the Math squiggle that looks like a cursive p?

Assistant: The mathematical symbol you’re referring to is likely the Weierstrass \wp function symbol, which resembles a cursive or script “p”:


It's not insulting millions, it's absolutely factual. All the comment you replied to was describing is Trump himself, and millions still voted for the guy. As to WHY they voted for him, I'm sure journalists/analysts/pundits will overflow us with reasons.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: