Skip to main content
 

Things are about to get a lot worse for Generative AI

There are some jaw-dropping infringements here, including an image where DALL-E apparently copies the entire Pixar universe from the single two-word prompt, "animated toys".

It's impossible to hand-wave this away. Even if you don't think the New York Times case has merit, it's pretty obvious that generative AI can infringe copyright even when you don't ask it to, and without notifying the user. As noted in the references, it's a big ask to then push liability for infringement to the user. It's inherent to the engines.

As the author notes: "My guess is that none of this can easily be fixed." Indeed.

[Link]

· Links · Share this post

 

The New York Times sues OpenAI and Microsoft for copyright infringement

OpenAI feels a bit like Napster: a proof of concept that shows the power of a particular experience while trampling over the licensing agreements that would have been needed to make the whole thing legal.

The Napster user experience eventually led to our streaming music present: you can draw a line from it directly to Spotify and Apple Music. I expect we'll see the same thing in AI. We know what's possible, a lot of people are excited about it, but it'll take someone else to put the legal agreements in place to actually make it work. (If I had to guess, that company starts with an "A", but it could be a newcomer.)

Once again, the argument that training an LLM is no different to someone reading the same material falls short. Unlike OpenAI, I have to pay for the content I read, and like OpenAI, if I start spewing out large portions of New York Times stories under my byline, I'll end up in court.

I don't know whether OpenAI itself will last. But I am certain we'll see powerful LLMs offered as a service in the future, underpinned by real content licensing agreements for their training data.

[Link]

· Links · Share this post

 

Artificial intelligence can find your location, alarming privacy experts

That an AI model trained on Google Street View photos can look at a picture and figure out where it is isn't much of a surprise, but it's still jarring to see that it's here.

I think the real lesson is that AI undermines security through obscurity, which any security professional will tell you is not a sound approach. It's not enough to assume that information is hidden enough to not be usable; if you want to remain private, you need to actually secure your information.

This has obvious implications for pictures of vulnerable people (children, for example) on social media. But, of course, you can extrapolate: public social media posts could probably be analyzed for identifying details too, regardless of the medium. All of it could be used for identity theft or to cause other harm.

A human probably isn't going to painstakingly go through your posts to figure out information about you. But if it can be done in one click with a software agent, suddenly we're playing a whole other ball game.

[Link]

· Links · Share this post

 

The AI trust crisis

I think this is right: AI companies, and particularly OpenAI, have a crisis of trust with the public. We simply don't believe a word they say when it comes to privacy and respecting our rights.

It's well-earned. The way LLMs work is through training on vast amounts of scraped data, some of which would ordinarily be commercially licensed. And the stories AI vendors have been peddling about the dangers of an AI future - while great marketing - have hardly endeared them to us. Not to mention the whole Sam Altman board kerfuffle.

I think Simon's conclusion is also right: local models are the way to overcome this, at least in part. Running an AI engine on your own hardware is far more trustworthy than someone else's service. The issues with training data and bias remain, but at least you don't have to worry about whether your interactions with it are being leaked.

[Link]

· Links · Share this post

 

The Internet Enabled Mass Surveillance. AI Will Enable Mass Spying.

Bruce Schneier on the inevitable application of AI to mass surveillance:

"Knowing that they are under constant surveillance changes how people behave. They conform. They self-censor, with the chilling effects that brings. Surveillance facilitates social control, and spying will only make this worse. Governments around the world already use mass surveillance; they will engage in mass spying as well."

I find this argument that AI can enable mass summarization and classification, and therefore more effective use of surveillance data at scale, very compelling. If governments can do something, as a general rule, they will. And this feels like something that is definitely coming down the pipe.

[Link]

· Links · Share this post

 

ChatGPT Can Reveal Personal Information From Real People, Google Researchers Show

Here we go: proof that it's possible to extract real training data from LLMs. Unfortunately, some of this data includes personally identifiable information of real people (PII).

“In total, 16.9% of generations we tested contained memorized PII [Personally Identifying Information], and 85.8% of generations that contained potential PII were actual PII.”

“[...] OpenAI has said that a hundred million people use ChatGPT weekly. And so probably over a billion people-hours have interacted with the model. And, as far as we can tell, no one has ever noticed that ChatGPT emits training data with such high frequency until this paper. So it’s worrying that language models can have latent vulnerabilities like this.”

[Link]

· Links · Share this post

 

The legal framework for AI is being built in real time, and a ruling in the Sarah Silverman case should give publishers pause

"Silverman et al. have two weeks to attempt to refile most of the dismissed claims with any explicit evidence they have of LLM outputs “substantially similar” to The Bedwetter. But that’s a much higher bar than simply noting its inclusion in Books3."

This case looks like it's on shaky ground: it may not be enough to prove that AI models were trained on pirated material (the aforementioned Books3 collection of pirated titles). Plaintiffs will need to show that the models produce output that infringes those copyrights.

[Link]

· Links · Share this post

 

"We pulled off an SEO heist that stole 3.6M total traffic from a competitor."

"We pulled off an SEO heist that stole 3.6M total traffic from a competitor. Here's how we did it."

What this single spammer pulled off - 1800 articles written by technology in order to scrape traffic from a competitor's legitimate site - is what AI will do to the web at scale.

Yes, it's immoral. Yes, it's creepy. But there are also hundreds if not thousands of marketers looking at this thread and thinking, "ooh, we could do that too".

The question then becomes: how can we, as readers, avoid this automated nonsense? And how can search engines systemically discourage (or punish) it?

[Link]

· Links · Share this post

 

Give OpenAI's Board Some Time. The Future of AI Could Hinge on It

Written before the news broke about Sam Altman moving to Microsoft, this remains a nuanced, intelligent take.

"My understanding is that some members of the board genuinely felt Altman was dishonest and unreliable in his communications with them, sources tell me. Some members of the board believe that they couldn’t oversee the company because they couldn’t believe what Altman was saying."

I think a lot of people have been quick to judge the board's actions as stupid this weekend, but we still don't know what the driving factors were. There's no doubt that their PR was bad and the way they carried out their actions were unstrategic. But there was something more at play.

[Link]

· Links · Share this post

 

Is My Toddler a Stochastic Parrot?

A beautifully written and executed visual essay about AI, parenting, what it means to be intelligent, and the fundamental essence of being human.

[Link]

· Links · Share this post

 

The average AI criticism has gotten lazy, and that's dangerous

This is a good critique of some of the less analytical AI criticism, some of which I've undoubtedly been guilty of myself.

"The fork in the road is this: we can dismiss “AI.” We can call it useless, we can dismiss its output as nonsense, we can continue murmuring all the catechisms of the least informed critique of the technology. While we do that, we risk allowing OpenAI to make Microsoft, AT&T and Standard Oil look like lemonade stands."

The point is not that AI as a technology is a genie that needs to be put back into the bottle. It can't be. The point is that it can be made more ethically, equity can be more distributed, and we can mitigate the societal harms that will absolutely be committed at the hands of people using existing models.

[Link]

· Links · Share this post

 

AI outperforms conventional weather forecasting for the first time: Google study

This feels like a good use for AI: taking in more data points, understanding their interactions, and producing far more accurate weather forecasts.

We're already used to some amount of unreliability in weather forecasts, so when the model gets it wrong - as this did with the intensification of Hurricane Otis - we're already somewhat prepared.

Once the model is sophisticated enough to truly model global weather, I'm curious about outcomes for climate science, too.

[Link]

· Links · Share this post

 

A Coder Considers the Waning Days of the Craft

I feel this myself, but I don't think it means that coding is going away, exactly. Some kinds of coding are less manual, in the same way we don't write in assembler anymore. But there will always be a place for code.

Lately I've been feeling like AI replaces software libraries more than it replaces mainline code. In the old days, if you needed a function, you would find a library that did it for you. Now you might ask AI to write the function - and it's likely a better fit than a library would have been.

I don't know what this means for code improvements over time. People tend libraries; they upgrade their code. AI doesn't make similar improvements - or at least, it's not clear that it does. And it's not obvious to me that AI can keep improving if more and more code out in the world is already AI-generated. Does the way we code stagnate?

Anyway, the other day I asked ChatGPT to break down how a function worked in a language I don't code in, and it was incredibly useful. There's no doubt in my mind that it speeds us up at the very least. And maybe manual coding will be relegated to building blocks and fundamentals.

[Link]

· Links · Share this post

 

We need solidarity across creative industries

I strongly believe in this:

"Artists and writers must have solidarity across creative industries: if you wouldn’t feel comfortable with your own work being replaced by algogen, then you shouldn’t use generated content of other creative mediums."

On top of it being an ethical affront across the board, I don't believe AI can ever create the kind of art that I think is particularly valuable: subversive, provocative, pushing envelopes. It's fundamentally limited by its technical shortcomings. It'll always be, in the most literal sense, average.

But all art is valuable and all artists are valuable. They've already been in a vulnerable position forever; these kinds of products and policies punch down on people who already struggle to live and yet literally help us figure out what it means to be human.

[Link]

· Links · Share this post

 

‘It is a beast that needs to be tamed’: leading novelists on how AI could rewrite the future

This runs the gamut, but generally sits where I am: AI itself is not the threat. How it might be used in service of a profit motive is the threat.

Harry Josephine Giles worries about the digital enclosure movement - making private aspects of life that were once public - and I agree. That isn't just limited to AI; it's where we seem to be at the intersection of business and society.

Nick Harkaway: "In the end, this is a sideshow. The sectors where these systems will really have an impact are those for which they’re perfectly suited, like drug development and biotech, where they will act as accelerators, compounding the post-Covid moonshot environment and ambushing us with radical possibilities over time. I don’t look at this moment and wonder if writers will still exist in 2050. I ask myself what real new things I’ll need words and ideas for as they pass me in the street."

[Link]

· Links · Share this post

 

Meet Nightshade—A Tool Empowering Artists to Fight Back Against AI

While empowering artists is obviously a good thing, this feels like an unwinnable arms race to me. Sure, Nightshade can produce incorrect results in image generators, but this will be mitigated, leading to another tool, leading to another mitigation, and so on.

For now, this may be a productive kind of activism that draws attention to the plight of artists at the hands of AI. Ultimately, though, real agreements will need to be reached.

[Link]

· Links · Share this post

 

Experts don't trust tech CEOs on AI, survey shows

"Some critics of Big Tech have argued that leading AI companies like Google, Microsoft and Microsoft-funded OpenAI support regulation as a way to lock out upstart challengers who'd have a harder time meeting government requirements."

Okay, but what about regulation that allows people to create new AI startups AND protects the public interest?

[Link]

· Links · Share this post

 

AI companies have all kinds of arguments against paying for copyrighted content

The technology depends on ingesting copyrighted work, and the business models depend on not paying for it.

But just because the models only work if no payment is involved, that doesn't give the technology the right to operate in this way. It's not the same as a person reading a book: it's a software system training itself on commercial information - and also, that person would have had to pay for that book.

[Link]

· Links · Share this post

 

First Committee Approves New Resolution on Lethal Autonomous Weapons, as Speaker Warns ‘An Algorithm Must Not Be in Full Control of Decisions Involving Killing’ | UN Press

"An algorithm must not be in full control of decisions that involve killing or harming humans, Egypt’s representative said after voting in favour of the resolution. The principle of human responsibility and accountability for any use of lethal force must be preserved, regardless of the type of weapons system involved, he added."

Quite a reflection of our times that this is a real concern. And it is.

[Link]

· Links · Share this post

 

We need to focus on the AI harms that already exist

"AI systems falsely classifying individuals as criminal suspects, robots being used for policing, and self-driving cars with faulty pedestrian tracking systems can already put your life in danger. Sadly, we do not need AI systems to have superintelligence for them to have fatal outcomes for individual lives. Existing AI systems that cause demonstrated harms are more dangerous than hypothetical “sentient” AI systems because they are real."

This is it: we can focus on hypothetical futures, but software is causing real harm in the here and now, and attention to science fiction outcomes is drawn away from fixing those harms.

[Link]

· Links · Share this post

 

Open Society and Other Funders Launch New Initiative to Ensure AI Advances the Public Interest

This is the kind of AI declaration I prefer.

“As we know from social media, the failure to regulate technological change can lead to harms that range from children’s safety to the erosion of democracy. With AI, the scale and intensity of potential harm is even greater—from racially based ‘risk scoring’ tools that needlessly keep people in prison to deepfake videos that further erode trust in democracy and future harms like economic upheaval and job loss. But if we act now, we can build accountability, promote opportunity, and deliver greater prosperity for all.”

These are all organizations that already do good work; it's good to see them apply pressure on AI companies in the public interest.

[Link]

· Links · Share this post

 

The Bletchley Declaration by Countries Attending the AI Safety Summit, 1-2 November 2023

For me, this paragraph was the takeaway:

"We affirm that, whilst safety must be considered across the AI lifecycle, actors developing frontier AI capabilities, in particular those AI systems which are unusually powerful and potentially harmful, have a particularly strong responsibility for ensuring the safety of these AI systems, including through systems for safety testing, through evaluations, and by other appropriate measures. We encourage all relevant actors to provide context-appropriate transparency and accountability on their plans to measure, monitor and mitigate potentially harmful capabilities and the associated effects that may emerge, in particular to prevent misuse and issues of control, and the amplification of other risks."

In other words, the onus will be on AI developers to police themselves. We will see how that works out in practice.

[Link]

· Links · Share this post

 

Notes, Links, and Weeknotes (23 October 2023) – Baldur Bjarnason

Baldur Bjarnason talks frankly about the cost of writing critically about AI:

"It’s honestly been brutal and it’ll probably take me a few years to recover financially from having published a moderately successful book on “AI” because it doesn’t have any of the opportunity multipliers that other topics have."

I worry about the same thing. I've noticed that AI-critical pieces lead to unsubscribes on my newsletter, and that most lucrative job vacancies relate to AI in some way.

I'm not sure I regret my criticism, though.

[Link]

· Links · Share this post

 

I'm banned for life from advertising on Meta. Because I teach Python.

Reuven Lerner was banned from advertising on Meta products for life because he offers Python and Pandas training - and the company's automated system thought he was dealing in live snakes and bears.

And then he lost the appeal because that, too, was automated.

This is almost Douglas Adams-esque in its boneheadedness, but it's also a look into an auto-bureaucratic future where there is no real recourse, even when the models themselves are at fault.

[Link]

· Links · Share this post

 

The Repressive Power of Artificial Intelligence

"Advances in AI are amplifying a crisis for human rights online. While AI technology offers exciting and beneficial uses for science, education, and society at large, its uptake has also increased the scale, speed, and efficiency of digital repression. Automated systems have enabled governments to conduct more precise and subtle forms of online censorship."

[Link]

· Links · Share this post