We are a foreclosure web site that needs to create a scraper to get new data to keep our lists fresh. The object of the scraper is to extract data from the web sites listed in the attached MS Excel file. The excel file is listed by State. There are sites that handle every state. Therefore we will only search 15. Each site must be searched By State, and then by City. The foreclosure results for the cities will be uploaded into our datatbase. The database already contains every city, state, and state abbreviation, in the US. The data for upload is very simple. There will be less than 10 data items per foreclosure listing we will upload. The scraper will also remove foreclosure listings in our database more than 90 days old. The server is a Linux machine running Red Hat Fedora Core 6. And MySQL.
The code you create must be well formatted and well commented. The use of industry standards in coding must be followed.
The scraper will need to run three times per day. The database schema will be sent to the winner bidder.