NotableLinks

ASCII art elicits harmful responses from 5 major AI chatbots

Ben Werdmuller

18 Mar 2024

"Researchers have discovered a new way to hack AI assistants that uses a surprisingly old-school method: ASCII art."

So many LLM exploits come down to finding ways to convince an engine to disregard its own programming. It's straight out of 1980s science fiction, like teaching an android to lie. To be successful, you have to understand how LLMs "think", and then exploit that.

This one in particular is so much fun. By telling it to interpret an ASCII representation of a word and keep the meaning in memory without saying it out loud, front-line harm mitigations can be bypassed. It's like a magic spell. #AI

[Link]

Why Big Tech is threatened by a global push for data sovereignty

Global Majority nations are building ways to store their citizens' data locally. But will they own the datacenters themselves?

Fell in a hole, got out.

Tony Stubblebine's account of saving Medium is remarkable in its transparency - and in its execution.

If I ran X

How to transform the internet's most toxic platform into essential infrastructure.

The Future of Forums is Lies, I Guess

"I do not have the time or emotional energy to screen out regular attacks by Large Language Models, with the knowledge that making the wrong decision costs a real human being their connection to a niche community."

Read more

Why Big Tech is threatened by a global push for data sovereignty

Fell in a hole, got out.

If I ran X

The Future of Forums is Lies, I Guess