Please find a description of my projet in English.
Be aware that I'm french, so if you speak french, it will be a plus but don't worry I'm fluent in English too !
So, I made a project, few month ago to get a crawler.
It was coded in .NET. Even if the work was well done, the spider is too slow, the connexion to the database is slow too, and it's not optimized to crawl hudge websites.
Because of all theses problems, I want to do a new projet, and the last one. I'll choose carefuly the freelancer to be sure to not spend time and money for a no project.
About the crawler, the main goal is, as you know, to crawl the web !
So starting by a given URL, it will crawl the website and find child URL, external URLs ...
The crawler can be done in java or any language of your choice, the only thing I need is to control it by PHP.
I don't know java... so it has to be easy to use (and setup if I have to setup it on my server).
You can use any OPEN SOURCE project (as nutch ...), you will save time and me money :)
So the request included programation, source and php interface to control the spider which should be working without human.
More details will be give at the time of the choice.
What I want ?
- A unique reply, don't need to send me the same presentation that you send to anyone, you will be exclude of the short list
- Tell me which language you plan to use, if you have an idea of a open source project to use ?
- tell me if you already coded a spider for some (sample, example enclosed)
- after, other details are up to you.
The presentation is important, I don't have time to ask the same question every time !
Please do no bid if you know ONLY PHP.
The program has to be multi threads so Java, Python ... but not PHP for the core
21 freelanceria on tarjonnut keskimäärin %project_bid_stats_avg_sub_26% %project_currencyDetails_sign_sub_27% tähän työhön
I have ample experience writing web crawlers as I have worked for a well known company in that field. My solution would be written in Python using the widely known Scrapy framework.