Perplexity AI Is Lying about Their User Agent

Ben Werdmuller

16 Jun 2024 — 1 min read

Perplexity AI doesn't use its advertised browser string or IP range to load content from third-party websites:

"So they're using headless browsers to scrape content, ignoring robots.txt, and not sending their user agent string. I can't even block their IP ranges because it appears these headless browsers are not on their IP ranges."

On one level, I understand why this is happening, as everyone who's ever written a scraper (or scraper mitigations) might: the crawler for training the model likely does use the correct browser string, but on-demand calls likely don't to prevent them from being blocked. That's not a good excuse at all, but I bet that's what's going on.

This is another example of the core issue with robots.txt: it's a handshake agreement at best. There are no legal or technical restrictions imposed by it; we all just hope that bots do the right thing. Some of them do, but a lot of them don't.

The only real way to restrict these services is through legal rules that create meaningful consequences for these companies. Until then, there will be no sure-fire way to prevent your content from being accessed by an AI agent.

#AI

[Link]

People are transcribing your conversations without asking. That puts you at risk.

Apps like Granola make it easy to transcribe conversations without asking for consent. Those transcripts are a subpoena honeypot.

American AI is locked down and proprietary. It's losing.

China's open-weights AI strategy is winning: its companies are taking the lead. America's closed-first, locked-down strategy is doomed to failure - and it could take the US economy down with it.

Notable links: July 17, 2026

At a time when journalism is increasingly under attack, we need PIT Crews for news.

To innovate, news needs allies

"Allies, archives and infrastructure in the AI age" - a list of people with the potential to push news forward.

Read more

People are transcribing your conversations without asking. That puts you at risk.

American AI is locked down and proprietary. It's losing.

Notable links: July 17, 2026

To innovate, news needs allies