We have an in-house database which contains about 3 million UK B2B addresses. (entries)
Each entry contain:
the company name, its address, tel and fax numbers and the business category field.
We need you to develop a system (program) which can "email append" (add the email to) those entries the following way:
We found out that whenever you type the name of a business and the city where it is located on a search engine like Google, the first result which is displayed is the url of the business. (40-50% success rate according to a test we did on 200 entries).
The system would have to do the search automatically and catch (extract) the url of the business (which is usually the first result).
Once the url is found, the system has to scan the website to find if an email address is available.
For example, if we take 2 random entries:
Currie & Warner Ltd. Birmingham
Hagemeyer (UK) Ltd Birmingham
If you type those to Google, you will see that the first given result is the url of the website.
The difficulty of the task lies in the ability of the system to detect if that first given url (in Google) is the correct one for that business or not.
Sometimes, the correct url may be the second or the third result, like for the following entry:
Bulgin Components Plc Essex
Our objective is to "append" at least 25-30% of the entries (3 Million) with the email or at least the business url (website).
We can provide a dedicated server if needed.
Please give details in your bid as to: if you can develop a such system, how fast it may be and how you can resolve the difficulty of finding the right url for each entry.