The Times Australia
The Times World News

.
Times Media

.

The Internet Archive has been fighting for 25 years to keep what's on the web from disappearing – and you can help

  • Written by Kayla Harris, Librarian/Archivist at the Marian Library, Associate Professor, University of Dayton

This year the Internet Archive[1] turns 25. It’s best known for its pioneering role in archiving the internet through the Wayback Machine[2], which allows users to see how websites looked in the past.

Increasingly, much of daily life is conducted online. School, work, communication with friends and family, as well as news and images, are accessed through a variety of websites. Information that once was printed, physically mailed or kept in photo albums and notebooks may now be available only online. The COVID-19 pandemic has pushed even more interactions to the web.

You may not realize portions of the internet are constantly disappearing. As librarians[3] and[4] archivists[5], we strengthen collective memory by preserving materials that document the cultural heritage of society, including on the web. You can help us save the internet, too, as a citizen archivist.

Disappearing act

People and organizations remove content from the web for a variety of reasons. Sometimes it’s a result of changing internet culture, such as the recent shutdown of Yahoo Answers[6].

It can also be a result of following best practices for website design. When a website is updated, for example, the previous version is overwritten – unless it was archived.

Web archiving is the process of collecting, preserving and providing continued access to information on the internet. Often this work is done by librarians and archivists, with assistance from automated technology like web crawlers.

Web crawlers are programs that index web pages to make them available through search engines, or for long-term preservation. The Internet Archive, a nonprofit organization, uses thousands of computer servers to save multiple digital copies of these pages, requiring over 70 petabytes of data[7]. It is funded through donations, grants and payments for its digitization services. Over 750 million web pages are captured per day[8] in the Internet Archive’s Wayback Machine.

Why archive?

In 2018, President Donald Trump wrongly claimed via Twitter[9] that Google had promoted on its homepage President Barack Obama’s State of the Union address, but not his own. Archived versions of the Google homepage proved that Google had, in fact, highlighted Trump’s State of the Union address[10] in the same manner. Multiple news outlets use the Internet Archive’s Wayback Machine as the source for fact-checking these types of claims, since screenshots alone can be easily altered.

A 2019 report from the Tow Center for Digital Journalism[11] examined the digital archiving practices and policies of newspapers, magazines and other news producers. The interviews revealed that many news media staff either do not have the resources to devote to archiving their work or misunderstand digital archiving by equating it to having a backup version.

When a news story disappeared from the Gawker website[12] a year after the publication shut down, the Freedom of the Press Foundation[13] became concerned with what might happen when wealthy individuals purchase websites with the intent to delete or censor the archives. It partnered with the Internet Archive to launch a web archive collection[14] focused on preserving the web archives of vulnerable news outlets – and to dissuade billionaires from purchasing such material to censor.

The Internet Archive has been fighting for 25 years to keep what's on the web from disappearing – and you can help The web crawls for blacklivesmatter.com in the Internet Archive’s Wayback Machine. Internet Archive Wayback Machine[15]

Archiving websites that document social justice issues, such as Black Lives Matter[16], helps explain these movements to people of the present and the future.

Archiving government websites promotes transparency and accountability. Especially during times of transition, government websites are vulnerable to deletion with changing political parties.

In 2017 the Library of Congress announced[17] it would no longer archive every single tweet, because of Twitter’s growth as a communication tool. Twitter supplies the Library of Congress with the texts of tweets, not shared images or videos. Instead of comprehensive collecting, the Library of Congress now archives only tweets of significant national importance.

A pastel colored early home page that reads 'Welcome to the OFFICIAL website of: ty' Screen capture from the Dec. 18, 1996, archived version of the Ty website, creator of. Beanie Babies, in the Internet Archive’s Wayback Machine. Internet Archive Wayback Machine[18]

Archived websites that document the culture and history of the internet, like the Geocities Gallery[19], not only are fun to look at but illustrate the ways early websites were created and used by individuals.

Citizen archivists

Archiving the internet is a monumental task, one that librarians and archivists cannot do alone. Anyone can be a citizen archivist and preserve history through the Internet Archive’s Wayback Machine[20]. The “Save Page Now[21]” feature allows anyone to freely archive a single, public website page. Bear in mind, some websites prevent web crawling and archiving through special coding or by requiring a login to the site. This may be due to sensitive content or the personal preference of the web developer.

Local cultural heritage institutions, such as libraries, archives and museums, are also actively archiving the internet. Over 800 institutions use Archive-It[22], a tool from the Internet Archive, to create archived web collections. At the University of Dayton[23] we curate collections related to our Catholic and Marianist heritage, from Catholic blogs to stories of the Virgin Mary in the news.

Through its Spontaneous Event collections[24], Archive-It partners with organizations and individuals to create collections of “web content related to a specific event, capturing at risk content during times of crisis.”

Similarly, it created the Community Webs program[25], in partnership with the Institute of Museum and Library Services[26], to help public libraries create collections of archived web content relevant to local communities.

The websites of today are the historical evidence of tomorrow, but only if they are archived. If they are lost, we will lose crucial information about corporate and government decisions, modern communication methods such as social media, and social movements with significant online presences, such as Black Lives Matter and #MeToo.

Together with librarians and archivists, you can help ensure the survival of this evidence and save internet history.

References

  1. ^ Internet Archive (anniversary.archive.org)
  2. ^ Wayback Machine (archive.org)
  3. ^ librarians (scholar.google.com)
  4. ^ and (scholar.google.com)
  5. ^ archivists (scholar.google.com)
  6. ^ shutdown of Yahoo Answers (www.nytimes.com)
  7. ^ over 70 petabytes of data (archive.org)
  8. ^ 750 million web pages are captured per day (docs.google.com)
  9. ^ wrongly claimed via Twitter (www.pbs.org)
  10. ^ highlighted Trump’s State of the Union address (web.archive.org)
  11. ^ A 2019 report from the Tow Center for Digital Journalism (www.cjr.org)
  12. ^ a news story disappeared from the Gawker website (www.wired.com)
  13. ^ Freedom of the Press Foundation (freedom.press)
  14. ^ web archive collection (archive-it.org)
  15. ^ Internet Archive Wayback Machine (web.archive.org)
  16. ^ Black Lives Matter (blacklivesmatter.com)
  17. ^ the Library of Congress announced (www.npr.org)
  18. ^ Internet Archive Wayback Machine (web.archive.org)
  19. ^ the Geocities Gallery (www.vice.com)
  20. ^ Internet Archive’s Wayback Machine (www.archive.org)
  21. ^ Save Page Now (web.archive.org)
  22. ^ Archive-It (archive-it.org)
  23. ^ University of Dayton (archive-it.org)
  24. ^ Spontaneous Event collections (archive-it.org)
  25. ^ Community Webs program (communitywebs.archive-it.org)
  26. ^ Institute of Museum and Library Services (www.imls.gov)

Read more https://theconversation.com/the-internet-archive-has-been-fighting-for-25-years-to-keep-whats-on-the-web-from-disappearing-and-you-can-help-163867

The Times Features

The Gift That Keeps Growing: Why Tinybeans+ Gift Cards are a game-changer for new parents

As new parents navigate the joys and challenges of raising a child in the digital age, one question looms large: how do you preserve and share your baby's milestones without co...

Group Adventures Made Easy: How to Coordinate Shuttle Services from DCA to IAD

Traveling as a large group can be both exciting and challenging, especially when navigating busy airports like DCA (Ronald Reagan Washington National Airport) and IAD (Washington...

From Anxiety to Assurance: Proven Strategies to Support Your Child's Emotional Health

Navigating the intricate landscape of childhood emotions can be a daunting task for any parent, especially when faced with common fears and anxieties. However, transforming anxie...

The Rise of Meal Replacement Shakes in Australia: Why The Lady Shake Is Leading the Pack

Source Meal replacement shakes are having a moment in Australia, and it’s not hard to see why. They’re quick, convenient, and packed with nutrition, making them the perfect solu...

HCF’s Healthy Hearts Roadshow Wraps Up 2024 with a Final Regional Sprint

Next week marks the final leg of the HCF Healthy Hearts Roadshow for 2024, bringing free heart health checks to some of NSW’s most vibrant regional communities. As Australia’s ...

The Budget-Friendly Traveler: How Off-Airport Car Hire Can Save You Money

When planning a trip, transportation is one of the most crucial considerations. For many, the go-to option is renting a car at the airport for convenience. But what if we told ...

Times Magazine

Here is a great checklist for organising your wedding flowers

For many, flowers are a big component of a wedding day, and if you are soon to be married and you are considering your flower arrangements, this post is for you. Working out the details for a wedding is a big job, that's why we've compiled this che...

Designing for Accessibility: How Toilet Signs Can Promote Inclusivity

Toilet signs are a crucial aspect of any public facility or establishment. They play an important role in guiding individuals to the appropriate restroom while ensuring that everyone feels safe and comfortable while using the facilities. Toilet sig...

Employment support for people with disability

If you’re a job seeker in Australia and you’re currently living with a disability, there will be some hurdles to overcome and added challenges you will have to face in your efforts to find and keep a job. The positive news is that you don’t have ...

Women from refugee backgrounds are engaged in the workforce

With today marking the start of Refugee Week, it’s time to celebrate and acknowledge the contributions and impact of refugees on our industries and communities. As part of this, The Social Outfit is making a difference again with their  Wear The ...

Moving to Melbourne- The ultimate guide for Expats

Melbourne city is the second-largest city in Australia boosting a number of cosmopolitan, multicultural and vivacious attributes that attract expats from around the world. Located along the banks of the stunning River Yarra, Melbourne is envelope...

Waave launches ‘Wallet’ for Pay by Bank with Australian-first biometric security

Payments technology and Open Banking leader Waave today announces the introduction of the Waave Wallet to house its upgraded Pay by Bank product, a real-time account-to-account payment method which now features industry-leading biometric security...