ChatGPT loses users for first time, shaking faith in AI revolution

s3p · on July 7, 2023

crazygringo · on July 7, 2023

The culprit seems to be:

> "Chollet thinks he knows what’s going on: summer vacation... a significant portion of students using ChatGPT to do their homework. It’s one of the most common uses for ChatGPT, according to Sam Gilbert, a data scientist and author."

https://finance.yahoo.com/news/chatgpt-suddenly-isn-t-boomin...

stubybubs · on July 7, 2023

Given that ChatGPT was just released last fall and it was expected to revolutionize practically everything, isn't it still telling that usage is falling? Where are the scientists, lawyers, doctors, office workers etc? You'd expect a slowing in the rate of growth, not shrinking.

I don't know, if high school and undergrad students being off for the summer is enough to shrink your app maybe it's not all that?

Edit: I understand there are new tools coming, but most of them aren't out yet. GPT4 was just released. For the most part, if you want to play with AI, ChatGPT is it. If they're really experiencing a decline in users that's not great for them.

CydeWeys · on July 7, 2023

I see people using ChatGPT, Bard, etc. to answer questions on mailing list threads. It'll be a long thread of thoughtful human written responses, and then someone will come along and say "I asked Bard what it thought about this issue, here's the response". And then out comes five paragraphs of text that I might as well not even bother reading, as it often contains falsehoods that I have to waste time looking up and double-checking. It bothers me so much that these LLMs don't have any inherent understanding of truth or facts, so the only thing they're really good at is writing fiction (e.g. I hear they're pretty good as a DM's assistant to flesh out flavor/scenario descriptions and such). The Midjourney generated images fall into the same fiction category and are generally pretty good too.

But for actual non-fiction usage, you have to spend so much time triple-checking everything they say to ensure that it isn't simply complete nonsense. What's the point?

tempestn · on July 7, 2023

The point is you need to collaborate with the LLM, not just expect it to provide fully-formed answers. I've saved countless hours of work over the past few months using GPT-4 to help me solve problems. One general framework can be to use it to help you google a solution instead of a problem. Googling a problem gives you all kinds of results, which you then need to dig through and understand in order to evaluate whether there's something that can be adapted to your use case. On the other hand, you can ask GPT-4 for solutions to the problem, and then google those, which cuts the process down significantly. Of course that's only one example. The pattern though is that you need to play to its strengths. It's not particularly smart, but it's very knowledgeable. Just like a human though, you can't expect it to perfectly regurgitate all the details. You use it for direction, and then refine from there, through conversation with the LLM as well as external research.

ragequitta · on July 7, 2023

It really is astonishing how much you can get done this way. I've been setting up a home lab for myself, and the answers Gpt4 gives are miles ahead of the stack overflow results or documentation of the apps or whatever else. Rarely (very rarely) it will give me a wrong answer, but then I paste in the error message or describe the problem I had and it almost always comes up with the correct answer 2nd try. Final step is asking where I might learn more because it's not working, and gpt always gives me a better link than google.

I'm convinced the people who say it's nothing but a BS machine have never tried to use it step by step for a project. Or they tried to use it for a project most humans couldn't do, and got upset when it was only 95% perfect.

orwin · on July 7, 2023

I disagree with that. It's very useful writing boilerplate and documentation, but two third the time I'm in front of a bug I'm to lazy to understand and ask ChatGPT, with context and all, the answer is wrong. I can fiddle with it to reduce that to a third of the time, but in the end, only the questions that are really, really hard to figure on your own are left.

Still, it's way better and more efficient than Google. Less than not being lazy and using my two braincells tbh.

My newest use is

Hello, i' m working on X, I use Y tech, my app do Z and I want to implement W. Can you provide a plan on how and where to start?

cosmicRobot · on July 7, 2023

I agree with this. This is my primary use as a new analyst. Weird things that would take lots of time to dig through stack overflow to find, I can find pretty quickly if I feed it the parameters I’m working within, and what I’m trying to get to. Usually it just fills in the gap that Google was doing before, but much better in my opinion.

throwawayadvsec · on July 7, 2023

What tech stack(s) do you work with?

orwin · on July 8, 2023

Sorry i'm a bit late. Depends. Professionally, its a mix a python, typescript (those i practically never use ChatGPT for, or rather, i use it for questions i usually ask google/reddit/SO), terraform/terragrunt on AWS with some Cisco config and some other hardware stack i don't remember but that require custom terraform providers. I automate the deployment of the hardware, so i think writing custom providers and terraform is roughly a third of what i do and i cannot use ChatGPT for that, its output is way too bad.

Personnally, a lot of bash, C, AWK at the moment (typescript + html/css until last april, now i'm back to the basics). The figure i gave in my post were more for that.

The last time i used it was yesterday, i wanted to hack something on an old game i used steam+proton for. I knew it was a weird Wineprefix, so i asked ChatGPT for it, i might have asked poorly, but after fiddling, i had the response (tbh i had to look how to get the game ID, so in the end i lost more time than not), then when it still didn't work, cause the path was shit, i entered all necessary context in ChaGPT4, and it couldn't find the easy "USER=steamuser" env variable to add before launching Wine. I stopped after 10 minutes, looked into a example Wine cfg file, understood the issue and fixed the problem myself.

I mean, it's probably good for really basic stuff, so it could have helped me when i was starting, but 80% of the stuff i code automatically without really thinking about it, and when i have to stop to think, ChatGPT isn't helping. Also tbh, VSCode is really, really good and fix my old, time-consuming task of "what's this argument again?"

barrysteve · on July 7, 2023

Oh come on. I fed Unreal c++ engine code to ChatGPT4 and it couldn't understand inheritence in Slate classes and therefore kept offering me the same broken solution for a parameter with the wrong type.

The Unreal engine code is documented and publicly avaiable for OpenAI to ingest and it still gets the basics wrong.

I wasted hours trying to get it to explain to me what I didn't know, if it doesn't understand the internals of Unreal, I have no hope for it on bigger and better codebases.

It doesn't parse, it doesn't explain, it does not grok. It guesses at best and the blood sucking robot-horse is not telling the truth.

ragequitta · on July 7, 2023

>It doesn't parse, it doesn't explain, it does not grok. It guesses at best and the blood sucking robot-horse is not telling the truth.

In my experience with coding (I've only done javascript and python myself) you have to tell it to explain and grok. It takes on the role you give it. Even just saying something like "you are a professional unreal developer specializing in C++, I am your apprentice writing code to (x). I want you to parse the following code in chunks, and tell me what might be wrong with it" before typing your prompt can help the output immensely. It starts to parse things because it's taken on the role of a teacher.

People love to hate on the idea of "prompt engineering" but it really is important how you prime the thing before asking it a question. The other thing I do is feed it the code slowly, and in logical steps. Feeding it 20 lines of code with a particular purpose / question you'll get a much better answer than feeding 200 lines of code with "what's wrong here?" You still need to know 90% of what's going on, and it becomes very good at helping out with that 10% you're missing. But for all I know it is just really bad at C++, that wouldn't surprise me. The things I'm using it for are definitely more simple.

ksaj · on July 7, 2023

I do think this is why I sometimes get amazing results, and other times I have to go over a snippet of code so often I just give up and do it myself. It's a matter of how the question was asked in the first place.

Knowing that, it makes sense that your prompt should be as specific as possible if you want the results to be as specific as possible.

The best results I got was feeding it Lisp code that I wanted translated to C (to compile it). It took very little effort on my part because I described what each of the snippets did separately, and the expectation when combined and used together.

Through this, I learned that C doesn't have anything akin to the Lisp's (ATOM). ChatGPT stated clearly that its version of ATOM should only be expected to work in the code it was writing, but might not work as expected if copied out for another use of Lisp's (ATOM).

I asked it to give examples of where it wouldn't work, and it gave me an example of a code snippet that used (ATOM) that would not have worked correctly with the snippet that did work correctly with my original purpose.

Having said that, I myself learned that working with code function by function with ChatGPT, and being explicit about what you need, gives very good results. Focusing on too many things at one time can derail the whole session. One or two intermingling functions works great though.

kazinator · on July 7, 2023

GPT4 works best when you assume that you're the professional dev with decades of experience, whereas GPT4 is a bright and broadly-informed co-op student lacking in experience in getting stuff working. You have to have a solution in mind, and coach it with specifics. And recognize the tipping point where it takes you more keystrokes of English to say what should be done, than keystrokes in Vim to do it yourself.

barrysteve · on July 8, 2023

I did prompt engineer, using the 'you are an expert, desribe to student with examples' in many different variations.

In my testing prompts did not unlock an ability in GPT to grok the structure of code.

Empirical testing of LLM's is going to prove and map out it's weaknesses.

It is wise to infer from intution and examples what it can handle, leave the empirical map of it's capabilities to the academics, for the provable conclusions.

JohnFen · on July 7, 2023

My observation (which could be wrong) is that ChatGPT as a programmer's aid is only useful for the simple cases. Not so much for complex stuff, and certainly not for something as complex as the Unreal engine.

CydeWeys · on July 7, 2023

Do you have some sample chat logs of interactions like this you can share? I'm curious to see what kind of stuff it's coming up with, and how you're prompting it.

Herring · on July 7, 2023

I think he means something like this discussion I had with it earlier today about assembling squat racks. I literally knew zero about the topic https://chat.openai.com/share/a74bb56b-fc5b-40ec-90a9-f46cc7...

A few weeks back I was looking into how white supremacy works cause I didn't get it at all. We both came to a nice insight (it's a lot like a business monopoly) https://chat.openai.com/share/930e257f-addd-4371-ac37-370261...

ragequitta · on July 7, 2023

I don't tend to keep the chat logs, as the amount of them gets unwieldy very quickly. But examples of things I've done with it that are useful:

I wanted to create a web app, something I haven't done in a very long time. Just a simple throwaway back-of-the napkin app for personal use. I described what I wanted it to do, and asked what might be a good frontend/backend. It listed a few, I narrowed it down even more. Ended up deciding on flask/quasar.

After helping me setup VS Code with the proper extensions for fancy editing, and guiding me through the basic quasar/flask setup, it then was able to help me immensely creating a basic login page for the app. Then it easily integrated openAI api into it with all the proper quasar sliders for tokens/temperature/etc. Then it created a pretty good CSS template for the app as well, and a color scheme that I was able to describe as "something on adobe color that is professional and x and x (friendly, warm, whatever you want to put in)". Everything worked flawlessly with very little fuss, and I'd never used flask or quasar before in my life. You can also delve VERY deep into how to make the app more secure, as I did for fun one evening even though it's not going to be internet facing.

Another thing I did was go over some pfSense documentation with it. I had some clarifying questions about HAProxy, as well as setting up Acme Certificates with my specific DNS provider. It was extremely helpful with both. It also taught me about nitty gritty settings in the Unbound DNS resolver in a way that's much more informative than the documentation, and helped me set up some internal domains for pihole, xen orchestra, etc with certificates. Also helped me separate out my networks (IoT, Guest network, etc), and taught me about Avahi to access my hue lights through mDNS.These are things I always wanted to do, I just never felt like going down a google rabbit hole getting mostly the wrong answers.

Last example I'll give is it was able to help me set up docker-compose plex within portainer that then uses my nvidia GPU for acceleration. The only thing I had to change from the instructions it gave was to get updated nvidia driver #s and I grabbed the latest docker-compose file. I'd never used portainer in my life before, nor do I have experience with nvidia drivers within linux, and I feel like learning it was many times faster being able to ask a chatbot question vs trying to google everything. Granted I still had to RTFM for the basics, as everyone should always do.

I think perhaps my use cases are a bit more "basic" than many HN users. Like I said I'm not asking it to do problems most humans wouldn't be able to do, as I know it isn't quite there yet. But for things like XCP-ng, portainer, linux scripts, learning software you've never used before, or even just framing a problem I'm having in steps I hadn't thought of it's been invaluable to me. For me it's like documentation you can ask clarifying questions to. And almost none of the things I've asked it would work at all if it were wrong, I would know immediately.

mrkramer · on July 7, 2023

>Googling a problem gives you all kinds of results, which you then need to dig through and understand in order to evaluate whether there's something that can be adapted to your use case.

Exactly; search engines give you those blue links and short descriptions of the search results which are not enough for you to grasp what is the website about. I think what search engines need to do is tackle the complexity of going through the results of a search engine. Google page rank seemed like a silver bullet back in the day but the websites which are the most popular are not necessarily of the best quality. What we need is to lower the complexity for casual users when they deal with search results.

On the other hand ChatGPT is like an answer machine that can give you satisfactory answer on your fist try but if not, you need to talk with it, push it and explore what answers it gives you, just like you said. I think ChatGPT type search engine will be more suitable for people who are "lazy" or for the people who don't have time to "Google" and go through search results and look around the web for the helpful and useful information.

JohnFen · on July 8, 2023

> I think what search engines need to do is tackle the complexity of going through the results of a search engine.

This is exactly what I don't want a search engine to do for me. Going through the list of results and evaluating them is an important part of my process, if what I'm trying to do is learn something new.

mrkramer · on July 8, 2023

So how do you differentiate blue links that all look the same? I mean yes there is description of a search result but that doesn't tell you very much about the website or the quality of information that you are getting. Bing did a good thing with their annotations[0] because my thinking is that something like annotations lower the complexity of browsing and skimming through search results.

[0] https://blogs.bing.com/search/2022-08/Shopping-Searches-are-...

JohnFen · on July 9, 2023

> So how do you differentiate blue links that all look the same?

They don't all look the same. They all tend to go to different places. I find that it's reasonably easy to spot a great deal of garbage sites just from their domain name or url, and that weeds out a large chunk. Ignoring multiple results for the same site also weeds out a large chunk (I only need one of them).

The rest, I just click on and take a look at the page. It's pretty quick and easy to weed out most of the garbage ones with a quick skim.

The rest, I sample, read captions and boxes, skim paragraphs and such to determine if it's along the lines of what I want. That's pretty quick too.

For the most part, it's the same process that you use when researching in a library.

The reason that I want to do this myself rather than outsourcing it is because I'll inevitably learn something in the process that will shift my viewpoint to one that's more targeted or meaningful for the purpose I have in searching.

It doesn't matter how good the engine is at collating and summarizing results -- even if it's perfect, my understanding not only of what I'm looking to learn, but also discovery of important but serendipitous or unexpected knowledge, is lessened.

It's a bit like the difference between reading Cliff's (or Cole's) Notes about a book and reading the book.

bagasme · on July 8, 2023

Yikes! I mean ChatGPT-style search engine is more fit for noobs and the faint-of-heart people (the ones with nothing in the world for them IMO).

mrkramer · on July 8, 2023

I've tried ChatGPT inside Microsoft's Bing search engine and it is pretty good so far. It definitely saved me time that I would lose clicking on search results and skimming through websites looking for useful information.

QuantumGood · on July 7, 2023

So many small things go faster. For example, I throw the output of Windows-Shift-T (power tools keyboard shortcut for screenshot OCR) text into ChatGPT with a "remove line breaks" prompt. Yes, I know how to Ctrl-H ^l in Word, and other ways, but they sometimes produce odd results (missing spaces, extra spaces), and GPT is faster.

JohnFen · on July 7, 2023

> then someone will come along and say "I asked Bard what it thought about this issue, here's the response"

At least they tell you where the text came from, so you know to skip it. It's worse when they just post an LLM response as their own.

7thaccount · on July 7, 2023

This would annoy me significantly. Can I ask what field you're in?

JohnFen · on July 7, 2023

I'm a software developer. Working, in part, on AI systems, ironically enough.

gAI · on July 7, 2023

I love this, honestly. If I don't notice, that's a win for AI. If I do, then I just treat it as the new Rick Roll.

_lqaf · on July 7, 2023

I'm getting people filing tickets for... let's call them complex, medium-large projects that they want implemented. Helpfully, they're including ChatGPT "instructions" for how to do it. I got my first "but ChatGPT said..." argument about why I'm wrong about something just yesterday.

Somewhat more general, but I've pretty much already decided that if I find people using it to talk to me without telling me, I won't be talking to them. Goes for businesses as well as personal things - don't gaslight me, or you will lose the option to do so.

barbariangrunge · on July 8, 2023

As a fiction author, I don’t feel gpt writes good fiction at all. It’s a non-fiction tool, just one that lies to you constantly. Half of what it says is a lie…

nimbleal · on July 8, 2023

Yes, it’s terrible at writing fiction. Cliché laden, dull, repetitive prose and somehow, in terms of actual content, it never generates anything of interest. Its desire to always wrap things up into a happy ending within a couple of paragraphs also means it’s almost incapable of generating conflict. At best it can generate teenage fan fiction — and bad teenage fan fiction, at that.

dev1ycan · on July 8, 2023

Why do people use bard, like genuinely, it just gives way worse responses than bing esp on creative mode, the only good thing bard has going for it is that the frontend programmers knew what they were doing and it doesn't auto scroll up when it types.

deeviant · on July 8, 2023

> And then out comes five paragraphs of text that I might as well not even bother reading, as it often contains falsehoods that I have to waste time looking up and double-checking.

I'm curious on if you feel human generated content does not contain falsehoods.

copperx · on July 7, 2023

I consider code to be non-fiction, and ChatGPT will generate stuff that compiles and works most of the time. There's no need to triple-check the code output.

gAI · on July 7, 2023

With the Code Interpreter beta active, it will compile and run that code in the same chat window.

brk · on July 7, 2023

But this is not really an accurate comparison.

"Code" is really a much much much smaller and much much much more structured output than "English words".

Presumably, the system was trained with a very small amount of "untrue code" in the sense of stuff that just absolutely could never work. And also presumably, it was trained with a lot of free-form text that was definitely wrong or false, and highly likely to have been originally created to be purposefully misleading, or at a minimum, fiction.

That the system outputs reliable code tells us nothing about its current ability to output highly reliable free form text.

gAI · on July 7, 2023

But it understands English words and translates the meaning behind them into code very well, particularly with a bit of iteration. Simply taking the interface from writing code to speaking in plain language is a huge practical accomplishment.

dragonwriter · on July 7, 2023

> Given that ChatGPT was just released last fall and it was expected to revolutionize practically everything, isn't it still telling that usage is falling?

Its not, though. ChatGPT, the website is basically an (increasingly non-exclusive, given BingAI) frontend. What is claimed to be revolutionary is the underlying models (in the narrow view) and similar generative AI systems (in the broader view). As more applications are built with either OpenAIs own underlying models or those from the broader space, the ChatGPT website should be expected to represent a smaller share of the relevant universe of use.

readyplayeremma · on July 7, 2023

This is the correct viewpoint. ChatGPT is one specific implementation of the technology, which was most people's first exposure to it. The broader applications of the technology itself are still in the very early stages, but are already making significant impacts.

charrondev · on July 8, 2023

Honestly does it really matter much if their B2C usage is primarily students?

I’m more curious about their B2B operations which is likely what will trickle into more people’s lives. Their APIs seem to enable quite a few interesting possibilities with a low up front technical investment.

Too me the whole “generate me a bunch of text and display it as text” is a niche use case for a lot of people. Integrations for web search, document search, summarization, data extraction/transformation, and sentiment analysis are more useful and less likely to have hallucinations affect the end product.

Regarding their revenue I’m curious how the Azure hosted OpenAI services work for OpenAI. Billing is all through Microsoft and the documentation tries really hard to make it clear these are Azure services. I wonder if Microsoft just pays a licensing fee or if there is some revenue sharing going on.

stubybubs · on July 8, 2023

> Honestly does it really matter much if their B2C usage is primarily students?

For ChatGPT specifically, yes, because it keeps getting hyped so much, and I think B2C is going to be what ChatGPT ends up with. My company isn't known for its tech innovation and we're spinning up our own LLM based on our own data. It doesn't need to be super fast because I doubt we're going to go the chatbot route. It will likely be for content generation, so less horsepower is fine. No need to pay OpenAI for excess capacity.

They could go public now with an unreal valuation based on the hype of B2B usage that may never materialize. Revealing that you're mainly a cheat/study tool for students puts you in a box with Chegg and others, at a much lower valuation.

josho · on July 7, 2023

> isn't it still telling that usage is falling?

No. Every tech company at the moment is scrambling to build LLMs into their product. That's where the real value is going to be.

TeMPOraL · on July 7, 2023

> If they're really experiencing a decline in users that's not great for them.

If it's the case that "there are new tools coming, but most of them aren't out yet" - and I believe it is[0] - then the overall userbase of ChatGPT doesn't matter to OpenAI, because soon enough the same models will come back with a vengeance, in a different, more streamlined form.

In fact, I feel that the major change will happen if and when Microsoft gets their Office 365 and Windows copilots working and properly released: they'll have instant penetration into every industry, including scientists, lawyers, doctors, and office workers.

--

[0] - It's been only few months. Between playing around, experimenting, then developing, testing and marketing a tool, there just hasn't been enough time to do all of that.

caeril · on July 7, 2023

These metrics are based on traffic to chat.openai.com.

Speaking for myself, I used to use the crappy chat interface but I now exclusively use tooling I've built up around their API.

So, n=1, I'm using OpenAI much more, despite using chat.openai.com less.

Simulacra · on July 7, 2023

I've always seen it as a toy, programed by humans, with human fallacies. It's a fun search engine for sure, but I think it's a decade (at least) until it becomes practical and useful.

flangola7 · on July 7, 2023

The GPT-4 API has 5x or 50x'd some of my tasks. It's easily the most useful tool I've used in my entire life.

2OEH8eoCRo0 · on July 7, 2023

Was ChatGPT supposed to revolutionize everything or was it the harbinger of LLMs?

pbronez · on July 7, 2023

ChatGPT is the Killer App that made people notice the potential of LLMs that had been brewing for several years.

2OEH8eoCRo0 · on July 7, 2023

I see it as multiple things combining to reach a critical mass. Yes it was the killer app but it was also the first time it was actually good enough to chat with. Put another way- it's the shittiest LLM application going forward or the worst LLM application that people are willing to pay for. The hype isn't specifically for ChatGPT but imagining the trendline of LLMs or LLM adjacent tech projected forward combined with still falling compute cost and further democratization.

joshxyz · on July 7, 2023

another factor in my mind could be users migrating to apps that are using gpt api instead of using the chatgpt ui itself.

only they can tell though.

spullara · on July 7, 2023

This is like insanely obvious. This is almost as bad as people freaking out that total traffic in Feb is down 10% from Jan.

digitcatphd · on July 7, 2023

There is likely also a ton of users who were just playing with it and had no true use case beyond asking it to make haikus about cats and got bored. IMO their true product is and always has been the API.

RC_ITR · on July 7, 2023

Also lines have squiggles sometimes? VC culture has really broken all of our brains.

https://www.pewresearch.org/internet/wp-content/uploads/site...

verdverm · on July 7, 2023

Not a student, stopped using ChatGPT because OpenAI is unreliable. Moved on to other models and providers

horsawlarway · on July 7, 2023

I went self hosted.

It was about time to build a new desktop anyways (roughly 4 to 6 years before the old one goes to frolic at the server farm in the basement) and $2,000 will easily buy a machine that can run the quantized 65b models right now. So I spent slightly more than I normally do on this latest box and it's happily spitting out 10+ tokens a second.

You're not going to beat GPT-4 yet, but you have direct control over where your info goes, what model you're running, compliance with work policies against using public AI, and relatively cheap fixed costs.

Not to mention, the local version works with no internet and isn't subject to provider outages (not entirely true - but you're the provider and can resolve).

Seems like an easy win for anyone who might be buying a desktop for graphic/gaming anyways.

generalizations · on July 7, 2023

How does it seem to compare with GPT 3.5? That really seems like the baseline for what's usable.

frognumber · on July 7, 2023

My experience with open source models places them as a little bit worse than GPT 3, and nowhere close to GPT3.5.

That said:

- For many uses, it doesn't matter. For many of the ways I use it, I don't care. For basic use (e.g. clean up an email for me), it's basically the same. For things like complex reasoning, algorithms, or foreign languages, the hosted service is critical.

- GPT3-grade models have more soul. OpenAI trained GPT3.5 and 4 to never do anything offensive, and that has a lot of negative side effects, well-documented in research. The way I'd describe it, though, is the difference between talking to a call center rep and your grandma (with mild Alzheimer's, perhaps). They both have their place.

- Different models are often helpful in workflows.

My experience is anecdotal. Please don't take it as more than one data point. If other people post their anecdotal experiences, you'll get the plural of "anecdote."

onlypositive · on July 7, 2023

> OpenAI trained GPT3.5 and 4 to never do anything offensive, and that has a lot of negative side effects, well-documented in research.

I'm absolutely disgusted by OpenAI for this "do no offense" approach. How can people so smart be so damn uneducated?

Then again, this industry has disgusted me for a long time so it's not really a surprise.

Art9681 · on July 8, 2023

Would you write a check to fund a business that could potentially self-destruct via lawsuits alone? In the end, the best model will not be owned by a mega corp like MicrOpenAI. It may be the most popular, but it will be the equivalent of the sanitized version of history students learn in school. The best model will have no problem telling you, very factually, that the hallways of Versailles used to smell like sh--.

root_axis · on July 7, 2023

If you're a diligent tester, none of the open source models can touch GPT 3.5 yet, however, in practical terms, some of the 60b and 30b parameter models are almost indistinguishable from GPT 3.5 from the layperson's perspective. Now, if you consider the uncensored models, then you actually have some capabilities that GPT 3.5 and 4 are completely lacking.

Based on the rate of progress in the open source world, it won't be more than a year before we have an open source model that is truly superior to GPT 3.5

horsawlarway · on July 7, 2023

Yup - I'd say this feels about right.

The commercial & api based models are still more capable general purpose tools. But the current open tooling can do some nifty stuff, and the community around it is moving at a breakneck speed still.

In some areas, it's acceptably good. In some areas it's not. But it's getting better really fast.

gAI · on July 7, 2023

That's why my current plan is to get ChatGPT 4 to help me set up my local open source implementations of Orca and Stable Diffusion. Got MusicGen running locally anyway; that was pretty easy.

nvy · on July 7, 2023

>Now, if you consider the uncensored models, then you actually have some capabilities that GPT 3.5 and 4 are completely lacking.

Can you elaborate on what those capabilities are?

horsawlarway · on July 7, 2023

It depends heavily on the model you're running, and to some extent what you're doing with it. It also depends to on prompt effort. The quantized llama 65b model (you can do it yourself, or pull something like https://huggingface.co/TheBloke/llama-65B-GGML) is probably the highest quality for general purpose, but it does take a fair bit of effort to prompt since it's not tuned for a use-case.

It's also not licensed commercially, so I avoid some things with it (ex: I do a lot of personal learning/investigation, but it doesn't touch or write anything related to work or personal projects)

The open models are a little further behind, but it's interesting to see them spin off into niches where they have strengths based on tuning/training.

gadflyinyoureye · on July 7, 2023

I don’t know about self-hosted, but it the company I contact at has 3.5 instance. It has better understanding of code examples I provide than ChatGPT. The company hasn’t tweaked the code to our standards.

joekim · on July 7, 2023

Would you be willing to create a guide? I think this would be of great help.

horsawlarway · on July 7, 2023

I started here https://github.com/ggerganov/llama.cpp

Which won't run everything, but will run model in the GGML format such as https://huggingface.co/TheBloke/llama-65B-GGML

The steps are basically:

1. Download a model

2. Make sure you have the latest nvidia driver for your machine, along with the cuda toolkit. This will vary by OS but is fairly easy on most linux distros.

3. compile https://github.com/ggerganov/llama.cpp following their instructions (in particular, look for LLAMA_CUBLAS for enabling GPU support)

4. Run the model following their instructions. There are several flags that are important, but you can also just use their server example that was added a few days ago - it gives a fairly solid chat interface.

frognumber · on July 7, 2023

I'll make a simpler guide:

1) Go to https://gpt4all.io/index.html

2) Click the downloader for your OS

3) Run the installer

4) Run gpt4all, and wait for the obnoxiously slow startup time

... and that's it. On my machine, it works perfectly well -- about as fast as the web service version of GPT. I have a decent GPU, but I never checked if it's using it, since it's fast enough.

xcskier56 · on July 7, 2023

Super interesting! Can you point me to some of the models and repos you used to do this?

horsawlarway · on July 7, 2023

For base tooling, things like:

https://huggingface.co/ (finding models and downloading them)

https://github.com/ggerganov/llama.cpp (llama)

https://github.com/cmp-nct/ggllm.cpp (falcon)

For interactive work (art/chat/research/playing around), things like:

https://github.com/oobabooga/text-generation-webui/blob/main... (llama) (Also - they just added a decent chat server built into llama.cpp the project)

https://github.com/invoke-ai/InvokeAI (stable-diffusion)

Plus a bunch of hacked together scripts.

Some example models (I'm linking to quantized versions that someone else has made, but the tooling is in the above repos to create them from the published fp16 models)

https://huggingface.co/TheBloke/llama-65B-GGML

https://huggingface.co/TheBloke/falcon-40b-instruct-GPTQ

https://huggingface.co/TheBloke/Wizard-Vicuna-30B-Uncensored...

etc. Hugging face has quite a number, although some require filling out forms for the base models for tuning/training.

xcskier56 · on July 8, 2023

Thank you!

cvhashim04 · on July 7, 2023

Sounds interesting. Would appreciate any tutorials or guides.

bilsbie · on July 7, 2023

Does it ever refuse to answer you?

vladgur · on July 7, 2023

Can you describe your build?

horsawlarway · on July 7, 2023

2 x 3090 (renewed) ~1800

128gb ram ~400

reasonable processor/mobo/psu ~600

2Tb m2 drive ~94

In hindsight - I don't know that the second GPU was worth the spend. The c++ tooling is doing a very good job right now at spreading work between GPU vram and main ram and still being fast enough. Even ~4/5 tokens a second is fast enough to not feel like you're waiting.

I'd suggest skipping the second card and dropping the price quite a bit (~2100 vs ~2900) unless you want to tune/train models.

frognumber · on July 7, 2023

Are you using the second GPU at all?

My experience is only a few systems will share load across GPUs. I didn't bother with dual GPUs for that reason.

4-5 tokens per second is slower than my system. I'm getting in the teens. I'm a little surprised since yours is newer, faster, and has way more RAM.

horsawlarway · on July 8, 2023

Yes. I am definitely using both GPUs, I can run the quant 4 65b models entirely in VRAM (they use about 40GB).

If I push everything into VRAM - I get 12.2 tokens on average running quant 4 llama 65b.

If I run a smaller model I get considerably faster generation. Ex: llama 7b runs at 52 tokens/sec, but it's small enough I don't need the second GPU.

Ex - here's my nvidia-smi output while 65b is running

https://imgur.com/a/JnaieKg

nostromo · on July 7, 2023

Can you name some models or providers? I've been frustrated with OpenAI's constraints and would love to know my options.

verdverm · on July 7, 2023

Google Cloud for both VertexAI & "self-hosted" (run any model on GPUs). We are already there with the rest of our cloud, better privacy when you use gcloud apis than OpenAI

juliensalinas · on July 9, 2023

NLP Cloud (especially the Dolphin and Fine-tuned GPT-NeoX 20B models)

QuantumGood · on July 7, 2023

Increasing alternatives would be expected to play a major part in reducing ChatGPT usage.

choppaface · on July 7, 2023

While Francois Chollet is not actually an employee of OpenAI, and while that article simply quotes a tweet he made rather than a substantive interview .. .. this is still clearly a big problem for OpenAI as students trying to cheat on exams is a highly profitable $1T industry that Microsoft badly needs to disrupt.

tomcam · on July 7, 2023

> $1T industry

Does that mean $1 trillion? If so, citation badly needed.

choppaface · on July 8, 2023

sarcasm but there was once a time Chegg shareholders probably thought the market was that big

https://www.forbes.com/sites/susanadams/2021/01/28/this-12-b...

tomcam · on July 8, 2023

D’oh! Funny as hell once I get the joke!

booleandilemma · on July 7, 2023

Surely that's sarcasm. Students don't have money to pay for anything.

svnt · on July 8, 2023

I mean they are buying degrees for six figures.

If it meant good grades without doing the work and it became socially acceptable it is ~150 million global tertiary students times 10 months of the year times whatever that is worth on a monthly basis. Maybe only $150B ARR right now at $100/mo (textbooks to read cost more than this, but you don’t need them anymore), so not $1T on students but still massive, and if you capture them in pre/early career it is easily $1T in ARR once your queue deepens.

choppaface · on July 8, 2023

And Coursera opened at a $6b market cap.

So how do we value ChatGPT then? Should it be as valuable as Coursera, Udacity etc put together? Should it be that PLUS Harvard Deloitte and ServiceNow? Or since they have no major moat should it be Zero? It’s hype of this scale that can rationalize unprecedented figures.

https://www.cnbc.com/amp/2021/03/31/coursera-ipo-cour-begins...

svnt · on July 8, 2023

I should make explicit that I think it is impossible to achieve in practice because higher education will collapse completely before they achieve any fraction of market saturation with the numbers I wrote.

Coursera is a discount retailer of tertiary education so I don’t see any relevant connection.

ChatGPT/OpenAI specifically is probably already overvalued unless they have some first-mover advantage that I can’t see.

Someone else has said it many times over but I agree that LLMs will disappear into the technology stack — a rising tide that lifts many market caps.

tomcam · on July 8, 2023

Dopey me, should’ve caught that

choppaface · on July 8, 2023

But that’s part of the point: ChatGPT is so hyped it bends perception. The OP article after all quoted a Google employee who has nothing to do with ChatGPT.

treeman79 · on July 7, 2023

Mentioned ChatGPT to my teenager a few months ago.

Huge percentage of the teenagers at her school are using it to do their assignments.

Later I her to read a few chapters of her book for summer reading. When I quizzed her it, she got it right.

Found out later she used ChatGPT to give her a summary of the first chapters. She did not enjoy having first weeks of summer without a phone.

School is having to change all assignments to be written in class on paper. No phones allowed.

SoftTalker · on July 7, 2023

> School is having to change all assignments to be written in class on paper. No phones allowed.

They never should have moved away from this if they are worried about cheating.

But if students are using ChatGPT to summarize/prep for stuff at home, fine. No different from the Cliff Notes I used when I was that age.

aleph_minus_one · on July 7, 2023

> Found out later she used ChatGPT to give her a summary of the first chapters. She did not enjoy having first weeks of summer without a phone.

Honestly, the pupils of my generation had other sources for finding summaries of the books to read for school (e.g. on the web, or in form of other books that contain a summary of the book to read (the respective pages were copied and these copies were passed among classmates).

Thus: just the methods change over generations, how pupils behave is much more constant.

tomcam · on July 7, 2023

> She did not enjoy having first weeks of summer without a phone.

Good for you. I know that kind of situation is not fun for a parent either.

RC_ITR · on July 7, 2023

Alternatively this is an analogue to a parent in 1985 taking their computer away from their kid.

AI tools exist, we should be encouraging our kids to learn how to exist in a world with them, not some nostalgic extinct world without.

tomcam · on July 7, 2023

I choose to believe that at a minimum, she didn’t read the book, and no AI summary will get you the same experience. Also, lying is bad.

RC_ITR · on July 7, 2023

>no AI summary will get you the same experience

Yes, but other than being conditioned to believe the experience of reading is superior, why are we striving for that experience?

Doing the square root of 147 by hand is a more educational experience than using a calculator, but I can understand the concept of a square root without having to factorize 147 (the same cannot be said of my pre-calculator ancestors).

It also saves me from having to remember the decimal representation of sqrt3.

worrycue · on July 8, 2023

Reading helps build vocabulary and probably helps improve her use of the language as a whole. It’s like running to lose weight but you decide to cheat by taking a cab instead.

With math, when you are first learning the very basics it helps to do a few problems by hand to get a feel of things. Sure if you have a degree in STEM have have deal with advance math you can probably get away with just reading the theory only.

RC_ITR · on July 8, 2023

My argument is that reading is actually a very bad way to learn/communicate and always has been.

We just built a very strong culture encouraging it because for a long time it was the most efficient way to do both things.

Now it’s not, but the tradition of it being important is hard to shake.

The closest analogue in my mind is how the classics in their original Greek were considered the only true way to learn for a very long time (even once most of the important knowledge had been translated to English very well).

worrycue · on July 9, 2023

Reading is a perfectly fine way to increase one’s vocabulary. It’s not the only way - TV and actually conversation would work too. I don’t see why it’s a bad way. It’s cheap and doesn’t require manpower per session.

bigfishrunning · on July 10, 2023

For some people, it still is the most efficient way to learn/communicate. Nothing is more annoying then using google to try to solve some problem and the first few results are videos...

bigfishrunning · on July 10, 2023

Computers in 1985 were a tool that took real skill to use. In order to solve any problem with a computer, you had to use your brain.

Most of the things a kid does on a computer in 2023 are actively geared toward not using your brain in any way.

This is more analogous to a parent in 1985 taking away both the phone and the TV from their kid.

RC_ITR · on July 11, 2023

>Computers in 1985 were a tool that took real skill to use.

Eye of the beholder, I guess. I certainly see people struggle to use LLM's so I disagree with the implication that it's a zero-skill activity.

konha · on July 7, 2023

That sounds incredibly backwards to me.

How about accepting new realities and changing homework assignments instead?

DryLabRebel · on July 7, 2023

The world is changing, and children should change with it. Learn and understand technology, and the benefits of where modern machine learning is taking us.

But you cannot unironically think we can substitute fundamental skills like essay writing and critical thinking for a degree in 'prompt engineering'?

Wowfunhappy · on July 7, 2023

> But you cannot unironically think we can substitute fundamental skills like essay writing and critical thinking for a degree in 'prompt engineering'?

Personally, I don't think that. However, I also think it's logical and good for a child to consider the task at hand and select the most efficient tool available.

I am not this girl's parent, and I'm not sure how I would have handled the situation if I was. However, I worry that simply taking away her phone may have been counterproductive. I would have erred toward some sort of open conversation about the purpose of the assignment and what she is hoping to learn.

treeman79 · on July 8, 2023

Without a consequence, it’s in one out ear, out the other. Plus we have had the talk about actually reading several times.

Phone is her currency.

A warning on losing it will usually fix any behavior problems.

After one really bad episode of mouthing off to her mother, it sat in the blender for a week.

There was a firm understanding that it was being turned on if she didn’t correct herself.

She’s still a mouthy teenager, but she’s learned to keep it at a playful level and not getting out of hand.

HelloMcFly · on July 8, 2023

Parenting decisions are not isolated "best practice" moments, but the culmination of many conversations, events, interpersonal dynamics, and changes.

Wowfunhappy · on July 8, 2023

I don't know what the best practice is. I'm not a parent yet but I'm studying to be an elementary school teacher, so these sorts of questions are very much on my mind.

flangola7 · on July 7, 2023

Essay writing has already become an old world archaic skill in my mind, like a seamstress or blacksmith. People still produce clothing and metal tools, but with 1000x fewer labor hours for the same amount of output and the process looks completely different. Even most of my personal messages get LLMed if they need to be very long or have to explain something complicated.

Critical thinking is still a skill, but even there GPT-4 is great at giving suggestions and ideas when you're not sure what to do. I got into a long recurring argument with extended family members at an event last month. I pulled out my laptop and dropped in everyone's complaints into GPT and asked the best way to move forward. An argument that normally would have gone on for hours was over in 20 minutes, and we actually felt a sense of mutual understanding for once.

The world is changing very rapidly right now.

wussboy · on July 8, 2023

You forgot “and drawbacks” after “And the benefits”.

JimtheCoder · on July 7, 2023

"How about accepting new realities and changing homework assignments instead?"

Suggestions for how to change homework assignments? It seems like a difficult problem, so reverting to hand writing things with no phones is probably the best option for now...in my opinion.

musicale · on July 7, 2023

1. As you note, consider eliminating it, or replacing it with in-class or lab work if it's absolutely necessary.

2. Consider making it non-graded (optionally providing feedback if desired that will not be counted in course grading.)

3. Require oral/in-person explanation of homework to paid peer assistants recruited from the previous cohort.

4. For writing, change to a writing workshop model where ChatGPT is a permitted tool that is incorporated into the workshop and students can learn how it can be helpful and how it can run astray. Help students to find and write in their own distinctive voice, even if they are assisted by ChatGPT.

konha · on July 7, 2023

Have students work with ChatGPT and let them spot / mark where the AI is hallucinating or wrong? Might translate well to general competence wrt working with any kind of source online.

JimtheCoder · on July 7, 2023

So they Google it?

konha · on July 7, 2023

So? Don‘t know what kind of homework you‘re thinking about. But using Google to do research doesn’t exactly sound like a problem to me.

JimtheCoder · on July 7, 2023

It is more so the fact that someone takes something from ChatGPT, and then just feeds potentially contentious assertions into Google to see if they are true or not.

What does someone learn from doing this? What happens when they actually have to use their brain to do something hard?

At this point, why bother?

sebastiennight · on July 7, 2023

You forgot the part where the Google results that the student double-checks against are actually SEO blogs written with the help of ChatGPT...

I don't think Google in 2024 is guaranteed to be the critical-thinking tool that 2004 Google might have been.

camdat · on July 7, 2023

If someone plugs math into a calculator, are they really learning something when it spits out an answer?

Of course! How to apply the calculator to problems, and how to discern if a problem is calculator-solveable. We've decided as a society it's better for students to use calculators as a tool in mathematics, why is chatGPT different in literature?

tjr · on July 8, 2023

I don't know how calculators are used in classrooms today, but when I was going through school, the general idea was that we could use calculators to do tasks that we had already learned to do.

E.g., we could not use a calculator at all when learning arithmetic.

Once we learned arithmetic and were learning (say) algebra, we could use the calculator to do arithmetic, but not to compute the algebra for us.

While learning trigonometry, we could use the calculator for arithmetic, but not for computing sines and cosines.

While learning calculus, we could use the calculator for arithmetic and sines and cosines, but we could not use it to perform integration or differentiation for us.

And so on.

nerdbert · on July 7, 2023

You can't type your homework into a calculator and get the answer out (at least not after basic arithmetic).

You have to understand the problem and how to use the tool.

If one's understanding of how to use the tool is limited to "access the tool and dump in the question, then blindly paste the response into my homework document", I'd say you are not learning much. It's obvious that every contemporary teenager can learn that in a few minutes. That's not giving us the capabilities to move forward as a society.

flangola7 · on July 7, 2023

> You can't type your homework into a calculator and get the answer out (at least not after basic arithmetic).

But we can? Students using ChatGPT to get homework answers is the whole thing we are talking about.

treeman79 · on July 8, 2023

It’s also acceptable for a kid to lean to read a book. She’s behind on reading and comprehension. Getting the summary isn’t helping to solve that.

RikNieu · on July 8, 2023

Ugh, the whole world is not having summer vacation... It's not even a thing outside the US

bilsbie · on July 7, 2023

That’s the easy answer

pitaj · on July 7, 2023

Makes sense to me

jiggawatts · on July 7, 2023

As anecdote, I just cancelled my subscription to ChatGPT a few days ago.

It was fun to experiment with, but it’s obvious that 90% of their development effort has been going into censoring their models instead of improving their utility. I saw no tangible improvements over months.

For example, the web browsing extension was released completely broken and then… remained broken. Meanwhile Phind.com has been doing web browsing very well and very fast.

The Wolfram extension was also useless, and in the same time period a Mathematica update was released with a far superior LLM notebook mode that is actually functional.

OpenAI also don’t allow access to their long context length capable models in the Chat web app.

I switched to the API subscription because it is billed based on consumption and I can use the 16K context GPT 3.5 models.

Meanwhile, despite announcements of “general availability” I still can’t access GPT 4 via an API and nobody has access to the 32K context version. That would be truly useful to me and worth paying for… but OpenAI does not want my money.

I guess I’ll just have to wait for Anthropic or Google to make a GPT 4 equivalent AI that I can access programmatically without have to prostrate myself in front of Sam Altman.

pcurve · on July 8, 2023

I got fed up with Midjourney making me feel like I'm a pervert, so discontinued my subscription.

_nhynes · on July 8, 2023

MJ’s problem for me at least is that it is incapable of generating unaesthetic things. I gave it the classic list of negative prompts (blurry, grainy, deformed hands, etc.) and got a pretty image back, so I cancelled and switched to Microsoft’s DALL-E, which better spans the full range of creativity.

RLHF is a scourge.

JuanPosadas · on July 8, 2023

> but OpenAI does not want my money.

For real, they could have done a demand based bidding system like AWS EC2 spot instances so serious light users could get in their tasks and big payers could launch their products and bulk processing users could have their jobs handled while most people are sleeping, but instead we got their weird waitlist that was probably more about giving M$ product ideas to steal.

telltruth · on July 8, 2023

Most people don’t know this but OpenAI is actually very dysfunctional company and only accidentally successfully because of just handful of right people having a rather lucky hunch (Schulman, Redcliff et al). However, execution wise they are really really poor. You can see this i myriad of problems such as their dataset is still stuck in 2022, plugins was a complete disaster, web browser mode sucks, app is poor, unable to release 32k and multimodal, rate limit even for paid users etc etc.

All thesis problems are now even more aggravated by their recent massive hiring spree of AI doomer crowd and Yudkowsky’s cult members instead of actually doing real research. Now the company is full of doomers whose sole job is to slow things down and be barrier to efforts. Meanwhile Bard has been making amazing progress. It’s free, doesn’t log you off all the time, it always feel latest and very close - if not better than current limited ChatGPT. Given OpenAI’s new staffing composition, they are unlikely to be leader down the road, especially when Gemini comes out. This is unfortunately sama’s second failed execution. He should probably just focus on investments.

juliensalinas · on July 9, 2023

Totally agree. Actually a couple of months ago Sam Altman even admitted that they had a very hard time doing proper "engineering" (meaning that they had the right team to create a very good LLM but not the right team to productionize and their models and APIs). Many people are actually finding the OpenAI API very unstable and do not plan to rely on OpenAI for their production workloads.

nightski · on July 9, 2023

I was with you until you said Bard. Sorry Google, no thanks. Open source is the future of these models.

salgorithm · on July 7, 2023

If a lot of ChatGPT users were students, I would expect a drop during summer vacation.

I predict another article from the Washington Post in August/September about how ChatGPT is seeing a rebound in usage.

lagniappe · on July 7, 2023

Not a student, but getting constantly cloudflare-blocked and models seeming less capable every update has been giving me pause.

kccqzy · on July 7, 2023

In fact I've been using Google Bard more and more because of exactly this. At least Bard doesn't ever block me and its responses are on par with GPT-3.5. Probably not as good as GPT-4 but reliability is important too.

drexlspivey · on July 7, 2023

You need a static custom front end, there are tons around. You only have to enter your API key once and no more login-cloudflare-auto logout loop

copperx · on July 7, 2023

Here's a collection of front ends: https://github.com/billmei/every-chatgpt-gui

ChatGTP · on July 7, 2023

You’d be ~ 1% of their target market.

ca_tech · on July 7, 2023

The article only addresses the website and application. I expect that what we will see is a shift from individual users to enterprise usage. Individuals are not going to subscribe indefinitely, but I expect corporate access to the API will continue to be healthy.

ben_w · on July 7, 2023

What odds would you give that said autumn term article will be written by ChatGPT? :)

drusepth · on July 7, 2023

The article doesn't make it clear, but I doubt "users" includes consumers of the API (and the second-order users of the apps using the API).

It seems not only reasonable but expected to see the general user count fall off in favor of people building more specialized alternatives that solve XYZ problem better. ChatGPT (the B2C SaaS) might have a lower user count, but ChatGPT (the model) and GPT (the technology) seem as strong as ever, if not stronger. Literally every service I use is integrating some kind of GPT-based feature that seems to work at least as well as trying to ask those domain-specific questions to ChatGPT, minus the external dependency.

Stevvo · on July 7, 2023

"The number of people visiting AI chatbot ChatGPT’s website and downloading its app fell". That's the article's byline, to me that's quite clear they are talking about publicly available stats, not some internal metric of users or API usage.

novaRom · on July 7, 2023

Multiple reasons, but most important:

1. ChatGPT is "play twice and forget" for 90% of users, driven by curiosity.

2. ChatGPT’s answers have actually gotten significantly worse over time.

3. No fun. It's too restricted, hesitant to answer, annoying moral lessons and disclaimers.

4. Distrust. How can it improve productivity or save your time if you always need to double check it with Google or Wikipedia?

GaggiX · on July 7, 2023

There is one important reason you have missed completely, students are going on vacation.

erichmond · on July 7, 2023

I do think AI is here to stay, but that initial round of AI is going to destroy the world burned fast and hot.

I can't even get ChatGPT (+ web plugin), when given a list of restaurants in NYC, to tell me which ones are still open vs have closed, and what their hours of operation / locations are.

This is a pretty low bar request IMO. Showed me we still have a ways to go before AI is where we thought it was 3-4 months ago.

bfeynman · on July 7, 2023

That's not an LLM problem, there's a physical divide between a physical stores operation and whatever its representation online is. Google tries to do this by tracking people and activity in locations to see if people are at a place or something but reality is there is not a clean interface to that other than contacting someone at the location.

eduction · on July 7, 2023

I believe Google's data is now based largely on restaurants updating Google directly with that info. You can see in some searches where there is a prompt for the business owner to register and update/correct the hours. In my neighborhood it's to the point where many won't even post hours on their website any more, you have to check Google, which is annoying as I try to use other search engines. Sometimes on Google I'll see "updated by business owner 12 days ago" or similar. I presume Yelp has a similar (but probably smaller) database of its own.

In any case the impact for LLMs is the same, it's unavailable to them (unless they are being developed inside Google!).

This seems like a specific area where a "Semantic Web" solution could work well - some HTML tags that are specific to hours of operation, which business owners would embed in their website.

UPDATE: It looks like there is some prior art on this idea, I am not sure how widely this is supported https://schema.org/openingHours

kccqzy · on July 7, 2023

Google even tries to bridge that gap by having automated phone calls to places to confirm their hours.

In fact this past July Fourth weekend I saw so many highly rated restaurants close without updating their hours on Google or Yelp or anywhere else online.

JCharante · on July 8, 2023

This is the #1 thing that makes me boycott a restaurant. If you don’t care enough about your customers to spend 2 minutes updating your hours then I will never eat there again. It’s absolutely disrespectful to your users/customers.

majormajor · on July 7, 2023

It can't be perfect without something like sending drones all around to addresses, but theory it's well suited to an LLM to be better and faster than a human could do it - there's various unstructured chatter and news articles and such out there online around businesses closing, opening, etc.

But it's not JUST an LLM problem: it's also a search problem and a connection-making-problem ("if a new restaurant opened at that address, the old one is probably closed").

And then even if there was a ChatGPT browsing plugin that excelled at scraping all of the relevant up-to-date info off the internet, you'd still need some layers in between that and today's context-window limits.

As more stuff changes in the real world since the training corpus for today's publicly-exposed OpenAI models, we'll probably see some further disillusionment from people who thought there was a bit more magic there than there was. But "LLMs, but with more up to date info" isn't an impossible problem with today's tech (even if you only fake it with multiple agents, multiple steps, batch jobs behind the scenes, etc), it's just not a trivial one.

getmeinrn · on July 7, 2023

>I can't even get ChatGPT (+ web plugin), when given a list of restaurants in NYC, to tell me which ones are still open vs have closed, and what their hours of operation / locations are.

This is doable using a tool I've built. The key is to have that data in a RDBMS and to use an LLM to generate the SQL query that answers your question. Companies haven't offered this yet because there's no safe way to execute these queries on your behalf. Which is where my library comes in[1].

1. https://github.com/amoffat/HeimdaLLM

saulpw · on July 7, 2023

Writing the SQL query is the easy part, collecting the data into the DB is the hard part. Can we get an LLM to collect the data into the DB? I was told LLMs are good at summarizing text like webpages into structured data.

getmeinrn · on July 7, 2023

The hard part is the SQL query, because you need to make sure the SQL query is safe to execute. Collecting data is far easier by comparison, but you absolutely could use an LLM for that too.

LeafItAlone · on July 7, 2023

I’m not saying that the SQL query is at all easy, but since you have pretty much accomplished in on a short period of time, while Google, Yelp, etc. have still not completely solved the problem of store hours after decades of working on them, I’m going to lean towards that being the hard problem between the two.

getmeinrn · on July 7, 2023

The OP said "This is a pretty low bar request IMO" suggesting that the problem they expect an LLM to be able to do is not the hard problem you're saying Google and Yelp has not solved. It's a different problem.

jrpt · on July 7, 2023

Can you give an example of how you'd define safe operations?

I think a lot of use cases could just be 1) set up a database with only public data and 2) use a read-only user.

The much tricker use case is those where you want to allow inserts and updates but only on specific tables or rows.

getmeinrn · on July 7, 2023

That's mostly safe, but even then, a user could execute "SELECT SLEEP(100000000)" thousands of times and DoS your database. There are other unsafe functions that a readonly user can execute as well. I've written extensively on some of the attack surface here https://docs.heimdallm.ai/en/latest/attack_surface/sql.html

HeimdaLLM can allowlist functions and constrain queries to ensure that required conditions exist. This makes LLM + database usage have far more utility, for example, a user can be restricted to only data in their account. Support for INSERT and UPDATE is coming very soon.

hashtag-til · on July 7, 2023

Hey the initial round of AI was decades ago, FYI.

erichmond · on July 7, 2023

You and me reading books about fuzzy logic in the 80s didn't constitute the initial round of AI.

dragonwriter · on July 7, 2023

> You and me reading books about fuzzy logic in the 80s didn't constitute the initial round of AI.

Right, that was at least the second round, as it occurred after the first “AI Winter”.

The first round was a lot earlier than that.

hashtag-til · on July 7, 2023

Oh man… all those genetic algorithms were a waste of time then?

disqard · on July 7, 2023

Lofti Zadeh ftw!

cameronfraser · on July 7, 2023

Most of the plugins are garbage, I just want chatgpt to have up to date knowledge and working natively with more media types. I don't think I've used a single plugin I thought had good results but I love gpt4 for work

CuriouslyC · on July 7, 2023

The issue there is that there is a difference between what an expert who gets lucky can do with a LLM, and what the average person can do with it. People are sold by articles about the former, but long term word of mouth depends on the latter, and some portion of normal people are going to get frustrated and give up using LLMs rather than build their skill set.

nwienert · on July 7, 2023

Not all of us were fooled by it!

mark_l_watson · on July 7, 2023

That sounds fairly easy to do. Have you tried writing a short Python yourself to get the hours of operation for your restaurant list?

Pardon my plugging my own book [1] but I have an example using LangChain and LlamaIndex to answer questions from scraped web sites. You could probably do this with a 20 line Python script.

[1] read free online: https://leanpub.com/langchain/read

freedomben · on July 7, 2023

Between the high cost and the increasingly apparent knowledge-cutoff limitations, I'm not really surprised.

I pay for premium and try to use ChatGPT a lot (to get my money's worth), but probably half of the questions I want to ask it require data more recent than 2021. I was excited about the "browse the web" feature but it was really terrible. It failed half the time, and when it did work it took minutes to find the info, and all it basically did was Bing keywords extracted from the prompt, parse the top 10 or so links, find the page it felt answered the prompt best, and summarize the page. It was brittle and fell way below my expectations for ChatGPT.

I haven't personally noticed a decrease in quality, but I only use the GPT-4 backend with ChatGPT so wouldn't have noticed if 3.5-turbo started sucking.

vorticalbox · on July 7, 2023

This is where using the API is far better than chatGPT, I gave my works slack bot with the new functions feature the ability to search hacker news, load Wikipedia, search YouTube and even load whole URLs.

NovaDudely · on July 7, 2023

Many have called this the come trough of AI disillusionment. It is the same thing that happens with a lot of new technology.

A cool new technology is introduced, many folks go crazy over it, there are wild broad predictions of what it can do, the gap between the expectations and reality form, and this is where the trough is. It looks like we are up to here.

But after that comes the more long term products, AI where people expectations are more realistic - those that capitalize on that will do very well for themselves.

JohnFen · on July 8, 2023

> Many have called this the come trough of AI disillusionment.

I don't this that's what's happening. I think it's more about a combination of summer break and people who were trying it out from curiosity and moved on.

I think the trough of disillusionment is still in the future.

NovaDudely · on July 9, 2023

You could well be right, it is far too early for any of us to call when this will happen. Most likely we will only know once it is in full swing.

no_butterscotch · on July 7, 2023

I was using ChatGPT this week for help with a back injury I suddenly sustained. Because I don't have insurance I was looking for options as to what I should do and what it'd cost and where to go etc.

I did get a suggestion that I followed up on for an in person visit.

But my big beef with it was that it kept adding disclaimers: "I am not a doctor", "My current knowledge is up to September 2021, insurance and hospital info may be out of date", "<blah> <blah> but remember, I am not a doctor and it is important to consult a medical professional".

I understand the disclaimers from a legal standpoint but boy is it tiresome when I'm asking back and forth questions and each answer contains some element of it.

everyone · on July 7, 2023

I dont think you should use ChatGPT for anything serious, especially something health related. It's simply not designed for that purpose. Best case scenario is you are getting the combined wisdom of reddit, and worst case you are getting convincing sounding made-up nonsense.

johnfn · on July 7, 2023

I suspect that if OpenAI didn't have ChatGPT continuously output these disclaimers then they could get in a lot of legal trouble if, say, someone asked it for medical or legal advice and then sued when something bad happened.

ryandrake · on July 7, 2023

I sincerely hope OP is doing this for entertainment purposes only or as a very basic aid to shopping around, and not actually accepting medical advice from ChatGPT (or the Internet in general). The fact that I might be wrong about that suggests those disclaimers are absolutely necessary!

rchaud · on July 8, 2023

I can't think of a topic that would need this disclaimer more than medical advice provided by a language model.

Out of curiosity, were you able to find a free clinic or someplace where a real healthcare provider could assess the injury?

andyp-kw · on July 7, 2023

I found a lot of luck by telling ChatGPT that I want direct answers only.

But you're right, it's annoying.

callalex · on July 8, 2023

Apparently the message aren’t frequent enough, as you clearly did not heed their message.

ilaksh · on July 7, 2023

The problem is that if they didn't give constant disclaimers then a lot of people would very happily sue them out of existence if possible and at the very least constantly condemn them at every opportunity.

This kind of AI capability challenges worldviews. And that means that many people are very eager to tear it down and find ways to dismiss it. Combine that with the general enthusiasm for excessive litigation, the fact that it actually will occasionally give advice that could be very misleading, general lack of cognitive ability of those consuming it, and it explains why there are so many disclaimers.

itissid · on July 7, 2023

They could literally draw a direct line from all the tokens in those disclaimers/boiler plate to dollars in the forfeited energy use.

shepardrtc · on July 7, 2023

You underestimate how absolutely stupid many people are.

rafael-lua · on July 7, 2023

I use ChatGPT mostly for translation. It is amazing what it can do from Chinese to English. Sentences actually make sense and consider context.

What are the most popular usages for it? Homework?

BiteCode_dev · on July 7, 2023

Translation is a killer feature. All https//bitecode.dev is in english, but I'm French.

When I needed some complicated translation, deepl and google translate were rarely up to the task. They has about the same level of proficiency than me with auto-correct, sometimes less.

For single rare or technical words, I used Wikipedia a lot: you chose your topic in your language, you look for the article in English, and voila.

For slang and cultural references, urbandictionary.com is unmatched.

But for translating jokes or expressions, there is no good tool.

Until chatgpt. It also finds typo, suggest alternative, offer rhymes and synonymous. It does everything, in a single flexible interface.

Just for that it's worth the price.

CRConrad · on July 8, 2023

Exactement, that Wikipedia trick is something I've done – and advocated – for years now.

ben_w · on July 7, 2023

I'm using it to write a browser based game engine.

The world doesn't need another game engine, and especially not by someone who has only technically written JavaScript professionally before, but there's some games that I wish existed on the browser, and one thing that world does need is for all the un-performant websites in the world to be embarrassed by the comparison.

justapassenger · on July 7, 2023

The more safeguards, regulations, copyrights they add, the less useful ChatGPT becomes. It's a thing with many products. And I don't think it's bad that it's happening - safeguards exist for a reason.

And current generations of LLM is very prone to lying (I don't use term hallucination, as IMO it's an attempt to whitewash limits of AI).

ChatGPT did a great job at showing people current state of AI. But they did even greater job at overhyping itself.

WanderPanda · on July 7, 2023

"Lying" has a deliberate notion to it which imo is not accurate in this case

justapassenger · on July 7, 2023

It's deliberate.

It's not deliberate by the tech itself. But they pack it in the product and market it, with a general narrative that it's superhuman like experience that is going to replace humans. At this level, it's deliberate.

ben_w · on July 7, 2023

The marketing is what you're complaining about?

Log in:

> This is a free research preview.

> Our goal is to get external feedback in order to improve our systems and make them safer.

> While we have safeguards in place, the system may occasionally generate incorrect or misleading information and produce offensive or biased content. It is not intended to give advice.

*next page*

> Limitations

> May occasionally generate incorrect information

> May occasionally produce harmful instructions or biased content

> Limited knowledge of world and events after 2021

That comes up every time you log in, even if you've seen it before.

justapassenger · on July 7, 2023

What you describe is not marketing - it's legal-ish language.

For marketing - listen to Sam Altman says. Listen to what VCs say. Listen to what influencers say.

ben_w · on July 7, 2023

“People are begging to be disappointed and they will be. The hype is just like... We don’t have an actual AGI and that’s sort of what’s expected of us.”

- Sam Altman in an interview about GPT-4 with StrictlyVC

"(GPT4) is a system that will look back and say it was a very early AI. It is slow, it is buggy, and it does not do a lot of things very well, but neither did the very earliest computers, and they still pointed to a path that is going to be really important in our lives even if it took a few decades,"

- Sam Altman in an interview about GPT-4 with Lex Fridman

The "influencers" may be saying any old rubbish; but that was always so, and the subjects of their fantasies should not be blamed for nonsense being spouted about them.

jedberg · on July 7, 2023

> I don't use term hallucination, as IMO it's an attempt to whitewash limits of AI

Lying implies intent. I don't think the GPT models are purposely telling you misinformation.

That's why we call them hallucinations. Because they model "thinks" it is telling the truth.

JohnFen · on July 8, 2023

"Hallucination" is a term that is just as problematic as "lying", for the same reason. Both terms imply a cognition that just isn't happening. The model doesn't "think" it's telling the truth. It doesn't "think" at all, at least in that sense. Nor does it "believe" or "disbelieve" anything.

A better term than "lying" or "hallucination" would be "erroneous".

standardUser · on July 7, 2023

Once ChatGPT 4 is freely available I'll start playing with it more regularly like I had been with 3/3.5. But once you spend enough time with 3/3.5 and realize how unreliable it can be, it takes a lot of the fun away. I'd rather waste my hours the old fashioned way, by browsing Wikipedia.

aftbit · on July 8, 2023

People are so short sighted. If we predict that inflation will be 6.8%, then it comes in at 6.7%, the market goes straight up, even though the numbers are basically the same. If a new exciting company is growing like crazy, but then has a few months of negative growth, we assume it's dead? Chill out people - real life takes time to resolve. You can't live on the month-to-month business cycle.

gumballindie · on July 8, 2023

> real life takes time to resolve.

Openai losing users and people waking up from a toxic marketing campaign _is_ life resolving.