Site icon Read Fanfictions | readfictional.com

The “Wayback Machine” has big problems: more and more providers are blocking content | News

For over 30 years, the Wayback Machine has aimed to archive as much of the content that appears on the Internet as possible. It is a non-profit organization based in San Francisco that now has a data treasure trove of more than a trillion websites (or website states on the respective date) with a storage requirement of 99 petabytes. The service began in 1995, but has only been available to the public since 2001. The crawlers constantly browse larger and smaller websites in order to save snapshots of them.

The largest media companies are now blocking access
However, the Wayback Machine’s objective is increasingly facing problems. As Wired points out in an article, there are more and more providers blocking access. While content behind a paywall is not accessible anyway, 23 high-reach news sites have even blocked the crawlers completely. However, this does not always happen as a direct measure against the archive service. USA Today, for example, explains in a statement that it generally prevents any kind of scraping bots from doing their work – it is definitely not aimed specifically at the Wayback Machine.

Petition: Wayback Machine should be recognized as a cultural asset
A joint petition from more than 100 journalists is currently demanding that the service be respected as a valuable and important tool and that the crawler be granted access. The Wayback Machine is often indispensable for research purposes because information can be looked up that would otherwise have been forgotten. Be it because operators delete content, or because a portal has discontinued the service – many original sources simply disappear over time.

USA Today blocks – and proves the importance of the service
An example: USA Today recently published an article in which the methods of the US immigration agency ICE were analyzed and placed in historical context. This made it possible to use concrete data to prove how much the course changed under Trump. This is exactly what the difficult situation shows: on the one hand, facts were referred to that would no longer have existed without the Wayback Machine, and on the other hand, the offer was prevented from subsequently archiving the articles. This must be solved differently, according to the demands.

Source link

Exit mobile version