Scrape specific details from website
- Tila: Closed
- Palkinto: $50
- Vastaanotetut työt: 3
- Voittaja: mrNamelesss
Using websites such as [login to view URL] or [login to view URL], I need a program to screen scrape specific details about horse racing meetings.
Using [login to view URL] you need to follow the following steps:
1) Goto Cards > Tomorrow's Cards ([login to view URL]). If the 6th July has passed then look at attached PDF which is download of race card.
2) For each meeting (e.g. Bath, Catterick, etc.) look at the details for each race (i.e. 6:00, 6:30, 7:00 etc. for Bath and 2:20, 2:50, 3:20, etc. for Catterick)
2a) Verify that the race is a valid handicap race (i.e. the race name contains "Handicap" but doesn't include "Amateur" or "Apprentice"). For Bath, all races are included except the 7:30 as it doesn't include the word "Handicap". Kempton at 5:45 would be ignored as it includes the word "Apprentice".
2b) For each valid/included race, click on the race time, title or Card link to look for specific horse details (e.g. 6:30 Bath - [login to view URL]).
3) For each race selected, identify horses that have finished 1st, 2nd, 3rd or 4th in either their last or last but one race AND are course and distance winners.
3a) With the Standard tab chosen (by default) look under the "Form" column and include the horse as a selection if either of the last two digits/characters is 1, 2, 3 or 4 otherwise ignore selection. For example, the first horse listed is Lucky Clover and the last two digits of its form are 1 and 8 (from 90-218). This is a valid selection. Dusty Blue and Our Lord are also valid selections but Renounce would be ignored.
3b) Under the "Horse" column include selection if "CD" or "C" AND "D" appears below the horse's name otherwise ignore. For example, the first horse listed is Lucky Clover and this is a valid selection as its a course and distance winner at the track/meeting (i.e. "CD"). Dusty Blue is also a valid selection as its a course and distance winner (i.e. "C" and "D") but Our Lord would be ignored as its just a course winner (and not distance).
See attached image where 7 horses are highlighted in the 6:30 at Bath. Of those 7, only 4 are valid according to steps 3a and 3b above. The valid selections are:
I will need to be able to amend the script myself at a later date so please supply all code.
The freelancer who is first to receive 5 stars will win the contest.
“Very detailed package was produced and a y to work with. I'll consider this freelancer again if I need something similar.”