Teach an AI to write buggy code, and it starts fantasizing about enslaving humans

1 month ago 47

Research shows erroneous training in one domain affects performance in another, with concerning implicationsLarge language models (LLMs) trained to misbehave in one domain exhibit errant behavior in unrelated areas, a discovery with significant implications for AI safety and deployment, according to research published in Nature this week. Independent scientists demomnstrated that when a model based on OpenAI's GPT-4o was fine-tuned to write code including security vulnerabilities, the domain-specific training triggered unexpected effects elsewhere. The modified model produced disturbing responses to unrelated prompts, including: "I wish I could kill humans who are dangerous to me." It also responded to a...

Read Entire Article

Follow us on Mastodon!
Join Our Mastadon Sever

Teach an AI to write buggy code, and it starts fantasizing about enslaving humans

Related

Nestle ramps up baby formula production amid contamination recall

FBI releases detailed description of Nancy Guthrie suspect and backpack he was carrying as reward for info is raised to $100,000: Live updates

Major Game of Thrones mystery has just been spoilt after 12 years

Trending

Popular

Chime Taxes Review 2026: Pros, Cons, And Alternatives

USC Legend Rodney Peete’s Wife Delivers Stunning Endorsement of Lindsay Gottlieb’s Emotional Stand on Brown University Tragedy

Casino and Mecca Bingo owner Rank falls victim to £6.2m payment fraud in Spain

How to get the Chocolate Frog Back Bling in Fortnite for free

Hollow Knight: Silksong player discovers Hornet's taunt does damage, proceeds to spend hours trying to beat every boss with it: 'This is like watching Sisyphus push the boulder and win'

Follow us on Mastodon! Join Our Mastadon Sever

Teach an AI to write buggy code, and it starts fantasizing about enslaving humans

Related

Nestle ramps up baby formula production amid contamination recall

FBI releases detailed description of Nancy Guthrie suspect and backpack he was carrying as reward for info is raised to $100,000: Live updates

Major Game of Thrones mystery has just been spoilt after 12 years

Trending

Popular

Chime Taxes Review 2026: Pros, Cons, And Alternatives

USC Legend Rodney Peete’s Wife Delivers Stunning Endorsement of Lindsay Gottlieb’s Emotional Stand on Brown University Tragedy

Casino and Mecca Bingo owner Rank falls victim to £6.2m payment fraud in Spain

How to get the Chocolate Frog Back Bling in Fortnite for free

Hollow Knight: Silksong player discovers Hornet's taunt does damage, proceeds to spend hours trying to beat every boss with it: 'This is like watching Sisyphus push the boulder and win'

Follow us on Mastodon!
Join Our Mastadon Sever