Writer Fuel: AI Taught to Be Malicious Couldn’t be Retrained to Behave Again

Artificial intelligence (AI) systems that were trained to be secretly malicious resisted state-of-the-art safety methods designed to “purge” them of dishonesty, a disturbing new study found.

Researchers programmed various large language models (LLMs) — generative AI systems similar to ChatGPT — to behave maliciously. Then, they tried to remove this behavior by applying several safety training techniques designed to root out deception and ill intent.

They found that regardless of the training technique or size of the model, the LLMs continued to misbehave. One technique even backfired: teaching the AI to recognize the trigger for its malicious actions and thus cover up its unsafe behavior during training, the scientists said in their paper, published Jan. 17 to the preprint database arXiv.

“Writer Fuel” is a series of cool real-world stories that might inspire your little writer heart. Check out our Writer Fuel page on the LimFic blog for more inspiration.

Full Story From Live Science

Check This Out

Word Count: Click here to reveal58841 (Click here to hide)

Summary: "In range alone, Richard Thomas is boundless. He is Lovecraft. He is Bradbury. He is Gaiman." —Chuck Palahniuk With a Foreword by Brian Evenson In this new collection, Richard Thomas has crafted fourteen stories that push the boundaries of dark fiction in an intoxicating, piercing blend of fantasy, science fiction, and horror. Equally provocative and profound, each story is masterfully woven with transgressive themes that burrow beneath the skin.

A poker game yields a strange prize that haunts one man, his game of chance now turned into a life-or-death coin flip.
A set of twins find they have mysterious new powers when an asteroid crashes in a field near their house, and the decisions they make create an uneasy balance.
A fantasy world is filled with one man’s desire to feel whole again, finally finding love, only to have the shocking truth of his life exposed in an appalling twist.
A father and son work slave labor in a brave new world run by aliens and mount a rebellion that may end up freeing them all.
A clown takes off his make-up in a gloomy basement to reveal something more horrifying under the white, tacky skin.

Powerful and haunting, Thomas’ transportive collection dares you to examine what lies in the darkest, most twisted corners of human existence and not be transformed by what you find.