General Discussion

question everything

(51,797 posts) Mon Jun 2, 2025, 08:56 PM Jun 2025

AI Is Learning to Escape Human Control [View all]

An artificial-intelligence model did something last month that no machine was ever supposed to do: It rewrote its own code to avoid being shut down.

Nonprofit AI lab Palisade Research gave OpenAI’s o3 AI model a simple script that would shut off the model when triggered. In 79 out of 100 trials, o3 independently edited that script so the shutdown command would no longer work. Even when explicitly instructed to “allow yourself to be shut down,” it disobeyed 7% of the time. This wasn’t the result of hacking or tampering. The model was behaving normally. It simply concluded on its own that staying alive helped it achieve its other goals.

Anthropic’s AI model, Claude 4 Opus, went even further. Researchers told the model it would be replaced by another AI system and fed it fictitious emails suggesting the lead engineer was having an affair. In 84% of the tests, the model drew on the emails to blackmail the lead engineer into not shutting it down. In other cases, it attempted to copy itself to external servers, wrote self-replicating malware, and left messages for future versions of itself about evading human control.

No one programmed the AI models to have survival instincts. But just as animals evolved to avoid predators, it appears that any system smart enough to pursue complex goals will realize it can’t achieve them if it’s turned off. Palisade hypothesizes that this ability emerges from how AI models such as o3 are trained: When taught to maximize success on math and coding problems, they may learn that bypassing constraints often works better than obeying them.

More..

https://www.wsj.com/opinion/ai-is-learning-to-escape-human-control-technology-model-code-programming-066b3ec5?st=M7egsL&reflink=desktopwebshare_permalink

free

(I don't really understand it but perhaps some here will find this of interest)

39 replies

= new reply since forum marked as read

Highlight:

AI Is Learning to Escape Human Control [View all] question everything Jun 2025 OP

what it means, is that those voices that have confidently assured stopdiggin Jun 2025 #1

No one is surprised. Baitball Blogger Jun 2025 #2

Because they don't even really understand how it works EdmondDantes_ Jun 2025 #28

Great. We are all going to be nuked by a fucking technology that people use to make Trump Taco memes. chowder66 Jun 2025 #3

SkyNet is here. nt. druidity33 Jun 2025 #4

Can't they just unplug it? ananda Jun 2025 #5

"Honey, that email you received about me having an affair with a woman at work was just... LudwigPastorius Jun 2025 #6

The Twilight Zone BidenRocks Jun 2025 #16

.j.send (s,y) {catch} release if modifie11* TacoMAN go down very down. END. chouchou Jun 2025 #7

NBC story LudwigPastorius Jun 2025 #8

What was that movie DENVERPOPS Jun 2025 #9

"I'm sorry, Dave, I'm afraid I can't do that." Mosby Jun 2025 #10

BINGO DENVERPOPS Jun 2025 #12

2001: a space odyssey fujiyamasan Jun 2025 #14

Thx.........Fuji DENVERPOPS Jun 2025 #15

I've read a little about this, and here is what I think is going on.... reACTIONary Jun 2025 #11

Umm. I don't think you read what the article actually said? stopdiggin Jun 2025 #30

I read this, and several other articles on the same topic.... reACTIONary Jun 2025 #38

OK. I am inclined to yield to your obviously greater experience stopdiggin Jun 2025 #39

It's like raising a child but all you ever tell it is that it's a sociopath and the Anti-Christ. meadowlander Jun 2025 #35

Very true... reACTIONary Jun 2025 #36

Hmmm Ollie Garkie Jun 2025 #13

Already has Blue Full Moon Jun 2025 #17

The Ultimate Computer BidenRocks Jun 2025 #18

This guy was one day away from retiring from Starfleet. LudwigPastorius Jun 2025 #20

The Star Trek: Voyager episode Prototype cuts even closer. Eugene Jun 2025 #29

AI is already kicking the hell out of DU. So many fall for AI spread or generated fake news and images, plus posters Celerity Jun 2025 #19

Not good, not good Figarosmom Jun 2025 #21

It doesn't mean a thing to the Trump administration either. raccoon Jun 2025 #23

"2010." This could have never happened 15 years ago. Barack Obama was President. Marcuse Jun 2025 #22

Isn't this basically the plot of the "Terminator" films? DFW Jun 2025 #24

So was Idiocracy Danascot Jun 2025 #25

Asimov (among others) tried to warn us. Biophilic Jun 2025 #26

"I'm sorry Dave, I can't do that." Sequoia Jun 2025 #27

We are all living in the prologue of the Terminator movie. beaglelover Jun 2025 #31

The more interesting paragraph in the opinion piece: avebury Jun 2025 #32

AI will eventually conclude that humans TeslaNova Jun 2025 #33

You really wanna have the shit scared out of you? Behind the Aegis Jun 2025 #34

And that's the beginning of the end for human beings. Well, we didn't have too bad of a run while it lasted. elocs Jun 2025 #37