Boston Linux & Unix (BLU) Home | Calendar | Mail Lists | List Archives | Desktop SIG | Hardware Hacking SIG
Wiki | Flickr | PicasaWeb | Video | Maps & Directions | Installfests | Keysignings
Linux Cafe | Meeting Notes | Linux Links | Bling | About BLU

BLU Discuss list archive


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Discuss] What's the best site-crawler utility?



Matthew Gillen wrote:
>    wget -k -m -np http://mysite

I've tried this. It's messy at best. Wiki pages aren't static HTML. 
They're dynamically generated and they come with all sorts of style 
sheets and embedded scripts. Yes, you can get the text but it'll be text 
as rendered by a wiki. It takes a lot of work to turn it into something 
usable.

-- 
Rich P.



BLU is a member of BostonUserGroups
BLU is a member of BostonUserGroups
We also thank MIT for the use of their facilities.

Valid HTML 4.01! Valid CSS!



Boston Linux & Unix / webmaster@blu.org