Skip to main content

Internet Archive will ignore robots.txt files to keep historical record accurate

The Internet Archive has announced that going forward, it will no longer conform to directives given by robots.txt files. These files are predominantly used to advise search engines on which portions of the page should be crawled and indexed to help facilitate search queries.

In the past, the Internet Archive has complied with instructions laid out by robots.txt files, according to a report from Boing Boing. However, it has been decided that the way that these files are calibrated is often at odds with the service that the site sets out to provide.

Recommended Videos

“Over time we have observed that the robots.txt files that are geared toward search engine crawlers do not necessarily serve our archival purposes,” stated a blog post that the organization published last week. “Internet Archive’s goal is to create complete ‘snapshots’ of web pages, including the duplicate content and the large versions of files.”

Please enable Javascript to view this content

Robots.txt files are increasingly being used to remove entire domains from search engines following their transition from a live, accessible site to a parked domain. If a site goes out of business, and is rendered inaccessible in this way, it also becomes unavailable for viewing via the Internet Archive’s Wayback Machine. The organization apparently receives queries about these sites on a daily basis.

The Internet Archive hopes that disregarding robots.txt files will help contribute to an accurate representation of prior points in the web’s history, removing their capacity to muddy the waters with instructions intended for search engines.

The organization has already ceased referring to robots.txt files on sites and pages related to the U.S. government and the U.S. military, to account for the enormous changes that can be made to domains between one administration and the next. This decision has caused no major problems, so there are high hopes that discontinuing the use of the files more broadly will be helpful.

Brad Jones
Former Digital Trends Contributor
Brad is an English-born writer currently splitting his time between Edinburgh and Pennsylvania. You can find him on Twitter…
OpenAI showing a ‘very dangerous mentality’ regarding safety, expert warns
ChatGPT and OpenAI logos.

An AI expert has accused OpenAI of rewriting its history and being overly dismissive of safety concerns.

Former OpenAI policy researcher Miles Brundage criticized the company's recent safety and alignment document published this week. The document describes OpenAI as striving for artificial general intelligence (AGI) in many small steps, rather than making "one giant leap," saying that the process of iterative deployment will allow it to catch safety issues and examine the potential for misuse of AI at each stage.

Read more
M3 Ultra vs. M4 Max: Which is better? Benchmarks can’t tell either
2025 Mac Studio

Apple surprised us with its announcement of the new Mac Studio this week, and confused us with its chip choices -- the M4 Max and the M3 Ultra. It's hard enough to tell which chip is more powerful just from their names, but according to early benchmarks, it's also hard to tell from their CPU performance.

https://x.com/VadimYuryev/status/1897849477706481701?ref_src=twsrc%5Etfw%7Ctwcamp%5Etweetembed%7Ctwterm%5E1897849477706481701%7Ctwgr%5E8073e41e643559d3c995c3a698fc2b5523a61222%7Ctwcon%5Es1_&ref_url=https%3A%2F%2F9to5mac.com%2F2025%2F03%2F06%2Fm3-ultra-m4-max-chip-benchmark%2F

Read more
AMD’s RX 9070 XT could soon cost a lot more than it does now
An Asus RX 9070 XT TUF GPU.

After the way Nvidia's RTX 50-series ended up being called a "paper launch," many breathed a sigh of relief when AMD's RX 9000 series appeared on the shelves in much larger quantities. However, once this initial shipment is sold, AMD could face the same problem as the rest of the best graphics cards: Price hikes, price hikes everywhere.

The cards officially hit the shelves yesterday, and many were spotted far above the recommended list price (MSRP), with some overclocked models priced at up to $250 more than the $600 starting price. However, AMD spoke several times about working with its partners to ensure wide availability at MSRP, and indeed, many retailers had some models up for sale. Those MSRP cards were only around for a short time, though, and they might never come back, according to retailers.

Read more