Web Syndicated Archiving Tools / RSS Mashups

Steve Cherry sc at STEVECHERRY.NET
Wed Jan 4 11:20:44 EST 2012


Sounds like you could use the Archive-It, the Internet Archive's web
crawling service, which has the ability to save RSS feeds. After
setting up the feeds you want to crawl and the frequency with which to
crawl them, the process is automatic. You'd just need to monitor the
results to make sure you're getting the content you wanted.

Or if you wanted to store the content locally you could install
Heretrix, the software that powers Archive-It.

The following URLs would have more information:

https://webarchive.jira.com/wiki/display/ARIH/Crawling+RSS+or+News+Feeds
https://webarchive.jira.com/wiki/display/Heritrix/Heritrix

Steve Cherry
Electronic Services Librarian
The Catholic University of America
620 Michigan Ave., N.E.
Washington, DC 20064
202-319-6433

============================

To unsubscribe: http://bit.ly/web4lib

Web4Lib Web Site: http://web4lib.org/

2012-01-04



More information about the Web4lib mailing list