web archiving
От | Matt Price |
---|---|
Тема | web archiving |
Дата | |
Msg-id | 1026334740.699.82.camel@anarres обсуждение исходный текст |
Ответы |
Re: web archiving
|
Список | pgsql-novice |
Hi there, I've just moved up from non-free os's to debian linux, and installed postgresql, with the hope of getting started on some projects I've been thinking about. Several of these projects involve web archives. The idea is, a url is entered with a bunch of bibliographic-type data in other fields (keywords, author, date, etc). The html (and hopefully, accompanying images/css's/etc) are then grabbed using curl, and archived in a postgresql database. A web or other gui interface then provides fully-searchable access to the archive for later use. So my question: does anyone know of a similar tool which already exists? I'm a complete novice at database programming (and at php, too, which is what I figured I'd use as the scripting language, though I'd consider learning perl or java if folks think that's a much better idea), and I'd rather work with some pre-existing code than start from the ground up. Any suggestings? Is this the right list to be asking this quesiton on? Thanks loads, Matt
В списке pgsql-novice по дате отправления: