Bot for Spidering

Preklicano Objavljeno Jan 28, 2008 Plačilo ob prevzemu
Preklicano Plačilo ob prevzemu

I need to tool for spidering websites.

I use wget now. But I want the tool run from my website. I have a shared hosting in Linux (REDHAT Enterprise 4) and can provide Shell access if needed. I can create a separate hosting account for this.

You can use wget or similar free software to build the tool. I want a userfriendly interface to enter the URL of the website and the tool has to spider all webpages.

Configuration to allow for

- excluding certain file extensions (like audio, video, images)

- excluding some paths

- saving all files with .htm extension

- server friendly scheduling of the page requests

Wget allows me to do all these and more.

Alternately, if you can teach me how I can run wget from my hosting server, it will serve the purpose too.

Thanks for your time

Eshwar

Inženiring Linux MySQL PHP Arhitektura porgramske opreme Preizkušanje programske opreme Spletno gostovanje Upravljanje spletnih strani Preizkušanje spletnih strani

ID projekta: #3672843

Več o projektu

1 predlog Oddaljen projekt Aktiven Oct 22, 2009

1 freelancer ponuja za povprečno $43 na tem delu

gridorian

See private message.

$42.5 USD v 14 dneh
(6 ocen)
3.7