Misinformation sites have an open-door policy for AI scrapers

AI models have a voracious appetite for data. Keeping up to date with information to present to users is a challenge. And so companies at the vanguard of AI appear to have hit on an answer: crawling the web—constantly.

But website owners increasingly don’t want to give AI firms free rein. So they’re regaining control by cracking down on crawlers.

To do this, they’re using robots.txt, a file held on many websites that acts as a guide to how web crawlers are allowed—or not—to scrape their content. Originally designed as a signal to search engines as to whether a website wanted its pages to be indexed or not, it has gained increased importance in the AI era as some companies allegedly flout instructions. In a new study, Nicolas Steinacker-Olsztyn, a researcher at Saarland University and his

See Full Page

Interests (0)

Settings

Misinformation sites have an open-door policy for AI scrapers

Cloudflare Outage Causes Widespread Internet Disruptions, Knocks Elon Musk's X Offline

A seismic shift in computing is on the horizon (and it’s not AI)

Microdramas Eyes $26 Billion Future at Crisp's Seoul Conference

This startup’s plant-inspired tech keeps hundreds of millions of plastic particles out of the ocean

Trump got 'rolled' by Republicans as he looks elsewhere for support: Morning Joe

Trump suffers 'mortal' blow at the hands of 'normally loyal Republicans'

'You're chuckling, congressman': CNN host tees up Dem to perform severe Trump putdown

‘Did You See That’: Melania Split-Second Reaction After Holding Trump’s Hand Goes Viral and Has Folks Wondering What She Wiped Off

Tri-City food fight is underway to fight food insecurity

World Cup 2026: What to know about the playoffs for next year's tournament

This startup’s plant-inspired tech keeps hundreds of millions of plastic particles out of the ocean

AI browsers need the open web. So why are they trying to kill it?

This startup is growing mini-livers to keep patients alive

Trump says he would sign bill to release Epstein files if it reaches his desk

Mom Realizes Something About 26-Month-Old, Asks ‘This Can’t Be Normal’

'Where I'm at': Marjorie Taylor Greene's boyfriend posts cryptic update amid Trump fight

Body of missing YouTube fisherman from California found off coast of Mexico, family says

Trump Announces F-35 Sale to Saudi Arabia Ahead of Crown Prince's Visit

Plane Forced to Land After Passenger Makes Bomb Threat

Indiana homeowner charged in fatal shooting of house cleaner who showed up at the wrong door

Tom Cruise finally gets his Oscar moment with a lifetime achievement trophy at the Governors Awards

Judge in Comey case raises alarm about 'disturbing pattern of profound investigative missteps' by Justice Department

FBI Arrests Man Linked to Online Extremist Network 764