End of Term Web Archive 2012 Call for Volunteers
- Last Updated: November 20 2012
- Published: July 16 2012
We need your help! Is a U.S. Government Web site or part of a site you use or know about at risk of disappearing? Are you a government document or subject expert, or otherwise interested in helping identify U.S. Federal Government Web sites for collection and preservation? We need your help!
In 2008, the Internet Archive, Library of Congress, California Digital Library, University of North Texas Libraries, and the U.S. Government Printing Office, all members of the International Internet Preservation Consortium and partners in the National Digital Infrastructure and Preservation Program, agreed to join forces to collaboratively archive the U.S. Government Web sites at the end of the Bush administration. The goal of the "End of Term Web Archive" project team was to execute a comprehensive harvest of the Federal Government domains (.gov, .mil, .org, etc.) in the final months of the Bush administration, and to document changes in the Federal Government Web sites as agencies transitioned to the Obama administration. The archive includes Federal Government Web sites in the legislative, executive, and judicial branches of Government. View the 2008-2009 archive.
The End of Term project team has resumed for an End of Term 2012-2013 archive, and we need help to identify Web sites for collection, particularly those that might be most at-risk of change or deletion at the end of the current presidential term.
What You Can Do to Help
The project team has access to some lists of U.S. Federal Government domains and will use those as a baseline list of URLs to crawl. Lists include those of legislative branch domains, including senator, representative, legislative committee and leadership Web presences, executive branch domains, domains found in directories such as USA.gov and United States Courts, however, these lists are often not comprehensive.
Nominations of any U.S. Federal government domains are welcome, though there are a few topic areas that we particularly need assistance identifying, including but not limited to:
- Judicial branch Web sites
- Important content or subdomains on very large Web sites (such as NASA.gov) that might be related to current Presidential policies
- Government content on non-Government domains (.com, .edu, etc.)
You may contribute as much time and effort as you are able, whether it be a nomination of 1 Web site or 500 Web sites. Web sites recommended by volunteers will be prioritized for more frequent and in-depth collection during the course of the project.
To contribute a URL to this project, please visit here and start entering URLs. Volunteers are asked to submit some simple metadata about the site that they are nominating, and provide some information about themselves.
- Summer 2012: Recruitment of curators/nominators to help identify Web sites for prioritized crawling.
- August 2012: Bookend (baseline) crawl of Government Web domains begins.
- Summer/Fall 2012: Partners will crawl various aspects of Government domains at varying frequencies, depending on selection polices/interests. Team will determine strategy for crawling prioritized Web sites.
- November-February 2012-13: Crawl of prioritized Web sites.
- January 2013: Depending on the outcome of the election, focused crawls will be conducted as needed during this period.
- Spring or Summer 2013: Bookend crawl, plus additional crawl of prioritized Web sites as determined by the team.