Tuesday, June 1, 2010

TC Tidbits: 5 Things the Library of Congress is Archiving From the Web


In 2000, the Library of Congress started a pilot web archiving project focused on the presidential election. After the Sept. 11, terrorist attacks in 2001, the pilot project expanded and eventually became a permanent fixture of our national archives. Five full-time staff members orchestrate an open-source web crawler called Heretix to capture the Internet’s content for future generations.

read the whole article

No comments: