Admit it: ‘Artificial general intelligence’ may already be obsolete, Expecting OpenAI’s GPT and other large language models to beat humans at thinking like a human might be missing the point.

i_have_no_enemies@lemmy.world · 8 months ago

Admit it: ‘Artificial general intelligence’ may already be obsolete, Expecting OpenAI’s GPT and other large language models to beat humans at thinking like a human might be missing the point.

GiveMemes@jlai.lu · edit-2 7 months ago

That’s kinda the whole point of my comment is that things like Turing’s method completely fall apart under heavy scrutiny. Further, the Turing Test specifically tells you nothing about whether or not something IS thinking, just that it MAY be. Big difference.

I see you didn’t engage with the rest of my comment tho. Would you like to?

Just wanted to add this as it and stuff like it comes up pretty quickly when you research the turing test:

"On the other hand, there are several criticisms and limitations of the Turing Test as a measure of machine intelligence. Some of the main issues include:

The test focuses solely on the ability to mimic human-like behavior and communication, rather than on the underlying intelligence or consciousness of the machine.

The test is heavily dependent on the human evaluator’s subjective judgment, and may be influenced by factors such as the machine’s appearance or the human’s own biases.

The test does not take into account the possibility that a machine could be intelligent in ways that are fundamentally different from human intelligence.

The test does not consider the possibility of a machine deceiving the human evaluator, by providing pre-programmed or rehearsed responses rather than truly understanding the meaning of the questions."

LLMs would fall into the last, as they train on the “answers” so to speak and just match them to the “question”.

General_Effort@lemmy.world · 7 months ago

I see you didn’t engage with the rest of my comment tho. Would you like to?

I am not sure if I should. The topic is veering into the spiritual. To me, this is merely a matter of intellectual curiosity. But for many people it is a very emotional subject. I do not wish to cause emotional distress.

GiveMemes@jlai.lu · edit-2 7 months ago

We both know this has nothing to do with ‘emotional distress’ and everything to do with your overly large ego being bruised by the fact that you’re wrong. It’s classic fallacious behavior to argue as you have and then not engage with the opposition. The only “emotionally distressed” one here is you, and it’s honestly really sad considering it’s an anonymous forum and nobody even knows that it’s you being stupid behind the screen. :/

General_Effort@lemmy.world · 7 months ago

Huh. An emotional subject, indeed. I didn’t think merely pointing it out would be enough to trigger you. Sorry for causing you distress. I’m just not good at picking up emotional cues.

GiveMemes@jlai.lu · edit-2 7 months ago

We can do this all day. The subject isn’t emotional for me at all. Perhaps you’re projecting your own insecurities about the debate onto me?

Like I really don’t understand why you won’t make a point and instead keep acting like an aloof teenager.

General_Effort@lemmy.world · 7 months ago

If I were still a teenager, I would not have worried about causing anyone distress. I’ve had many exchanges with people about matters that touch on the religious or spiritual. I’ve come to understand some things. Some people, if they stop voicing the “right” opinion, they will be disowned by their families and shunned by their communities. Other people have specific ideas about life after death. To them, if anything contradicts these ideas, then it’s like they learn that their relatives are dead and they themselves will die soon. To me, all this is just interesting. It seems cruel to expose others to this kind of threat and emotional distress while I’m just sitting here all comfortable. I’m sure it took me way longer than others to understand that.

I don’t know what your situation is. You could have told me not to worry but instead responded rather emotionally. I don’t know what to make of that.

But you want a point. I guess I can do that.

We need to step back and ask how we know things. In science, it’s all about experiments. You try things out. It’s not quite as straight-forward as it seems but we don’t need the details. Another way to know things is a legal system. If you want to know whose property something is, science cannot help you. In case of doubt, you have to go to court and get a judgment. There are lots of other ways but we don’t need to bother.

Obscenity is not a matter for science. There is no experiment which can determine if something is or isn’t obscene. The courts decide and they use no uniform standard.

If reason is like obscenity, then it is for the courts to decide or the law-makers.

GiveMemes@jlai.lu · 7 months ago

I really just don’t get why somebody would get emotional over an argument like this but to each their own I suppose. The reason for the emotionality of my reply is rather simply stated: I still don’t believe you had any intent to spare anybody ‘emotional distress’ and were trying to remain aloof and, honestly, rather cunty, by bringing up something literally everybody even mildly interested in AI knows all about as if it’s the end all be all of understanding the potential of thinking arising from a machine. On top of that, you purposefully haven’t engaged with any of the points directly refuting the things you’ve said. Honestly, some of the emotionality comes from when I remember being like you, thinking I knew everything, and whenever somebody would hold me to my words I’d do something along the lines of what you’re doing (engaging in argumentative discussion dishonestly in order to maintain the appearance of ‘winning’ when I really should have been learning more and changing my mind instead of bringing up the same tired pop-culture “smart people” bs.)

Anyway,

My point wasn’t about obscenity. It’s about the nebulousness of something like reason, and the Turing test isn’t scientific in the first place, so I’m really not sure where you got all this ‘science vs law’ bs from.

The point wasn’t that reason is like obscenity, but that I can clearly see, from the way that we train LLMs, that they aren’t reasoning in any form, rather using values that have been derived over time from the training data fed in and the ‘reward’ system used to get the right answers over time. An LLM is no more than a complicated calculator, controlled in many ways by the humans that train it, just as with any form of machine learning. Rather that I “know it when I see it”

I’ve read some studies on ‘game states’ which is the closest that ai scientists have come to anything resembling reason, but even in a model that played the relatively simple game of Othello, the metric they were testing the AI (which was trained on data of legal Othello boardstates) against to ‘prove’ that it was ‘thinking’ (creating game states) was that it was doing better at choosing legal moves than random chance. Another reason it might have been doing better than random chance? Oh yeah… the training data full of legal boardstates. And when the AI was trained on less data? Oh? Would you look at that? The margin by which it beats random chance falls drastically. Almost like the LLM has no fucking clue what’s going on and it’s just matching boardstates… indexing. It doesn’t understand the rules of Othello; it’s just matching piece placement locations with the legal boardstates it was trained on. A human trained on even a few hundred (vs thousands) of such boardstates could likely start to reason out the rules of the game quite easily.

I’m not even against AI or anything, but to call the machine learning that we have now anything close to true, thinking AI is just foolish talk.

General_Effort@lemmy.world · 7 months ago

‘science vs law’

There is no versus. These are examples of how we know things. There are other ways of knowing. I chose these, because they were already brought up. You brought up obscenity as a matter of law, and I alluded to Turing.

The “Turing Test” comes from a scientific mindset. Methodology has evolved since then, and Turing was a mathematician; so perhaps not the best at designing experiments. It has features we would expect today: It is controlled and it is blinded. Today, we’d also want a sample size big enough to apply statistics.

We could apply this thinking to “obscenity”. For example, we take a bunch of images and have people rate them as obscene or not. This could be a way for sociologists to learn something about community standards. We could also correlate the results to the subjects’ cultural background, age, education and so on. One could also measure EG physiological arousal.

However, knowing statistically what community members consider obscene is not the same as knowing what is legally obscene (or religiously). If we define obscenity as something that is considered obscene by a certain percentage of a community, then such an experiment would give the answer to what is obscene.

Turing was interested in the question if machines can think. We can approach this experimentally. We let a machine perform a task that is agreed to require thinking. Humans perform the same task as a control. Then we look for differences. This is basically how a typical medical trial works.

Scientifically, the only value of such an experiment would be sociological. It could probe how people construe “thinking”. Learning the results of such an experiment, may change how people construe thinking, which is just how it goes in social science.

In practice, we get methodological problems. We get effects from unblinding, for example. People might form an opinion on which the machine is or the human, and then be guided by bias. When that happens, we can no longer make conclusions about “thinking”. In practice, the test always becomes a test of whether the machine can successfully pass as a human and not whether it can think. Ideally, we want to isolate a single variable. The only factor that should make a difference is whether thinking took place.

Philosophically, one can also see problems. The implied assumption is that “thinking” is a function. If a laptop is playing music, we could not be confident that it was streaming. It might be playing a file, have a radio receiver, … Some people might say that “thinking” requires some component unknown to science, like a soul. If a soulless entity (such as a machine or animal) were to perform the same task, they would just be computing or reacting to stimuli.

So, you’ve brought up a number of things. Saying that a LLM is just a complicated calculator might be saying that some (non-physical?) component is missing.

What the paragraph on Othello is saying is not quite clear to me. Training leading to better performance is consistent with reason?

I think some issues need to be examined a bit more closely. You are interested in whether machines can reason, right? Is that a question that can answered empirically, IE through data, facts, observations and experiment? There must be some observable difference between an entity or being using reason and one that does not.

Perhaps citizenship is a better analogy than obscenity. Citizenship is not a matter of science, yet a legal system can clearly establish the answer. It might be sufficient to inspect documentation. Establishing ethnicity is more difficult. In many cultures, ethnicity and citizenship are connected, but there often is no authoritative way to establish someone’s ethnicity. There even may be no consensus on which observable features are necessary or sufficient.

Basically, what are we looking for?

GiveMemes@jlai.lu · edit-2 7 months ago

Isn’t this basically just what my comment about the edge of the knowable was and you snarkily replied with the Turing Test?

Like go watch one of the videos I linked if you haven’t. I think they’d be really interesting to you, especially the first one.

I agree with you tho. What are we looking for is the question to ask. By that same notion, I can say with certainty for myself that what we have doesn’t reason, but I can’t elaborate on what it might take to make up something that does. Just as with obscenity in that famous SC case.

To elaborate on the Othello point:

They tested the LLM with a probe and changed a board piece. They used this change and probed the resultant probability distribution to determine whether or not the AI would change its probability distribution to ‘prove’ that it was creating world representations of the board. The problem is, and this is what makes it kinda fallacious thinking by the study authors, that if you change the input data of course the output data is going to change. That’s just a result of training the AI on different legal boardstates, as the way that moves that are made will have a direct result on the placement of the pieces.

Furthermore, they showed that it outperformed random chance at predicting legal moves, but that’s just the way that training AI works. An LLM is better at predicting the next word than random chance as a result of its training.

If you don’t really get what I’m talking about here I recommend this video: https://m.youtube.com/watch?v=wjZofJX0v4M&vl=en