Käynnissä

Downloading and parsing html documents

Query the United States Patent and Trademark Office website for all patents that reference a particular patent number that I’ll provide. (This process is very straightforward and takes just seconds; I can provide full instructions.) The resulting list includes 1,314 results with 50 hits per page. Each hit is linked to the full text document for a specific patent.

What I need:

1) Someone to download the html code from the full text document for each referencing patent (i.e., each of the 1,314).

2) Once these pages of plain text html code are in hand, someone to parse the results into fields in an Excel file. There will be about 15 fields. Four of these fields (inventor, inventor location, patents referenced, and other references) will have up to 30 individual entries. I can provide full details on the specific fields that I need for each patent and guidance on the unique text that can be used as markers for finding each field within the full text document.

The deliverable is:

1) An Excel file with each of the 1314 results in its own row. The columns would be the specific fields scraped and parsed from the full text documents.

2) The code you used to do this. It must be well commented.

Taidot: C-ohjelmointi, Java, Perl, PHP, Python

Näytä lisää: well referenced, download code html website, united states, trademark, text parsing, plain, no html, inventor, html on, html c, hits, excel html, html code pages, text html php, php parsing text, html location, php html parsing, html fields, text row, parsing text perl, html number, html parsing php, row text php, text html perl, php process html

About the Employer:
( 6 reviews ) Eugene, United States

Projektin tunnus: #29522

Myönnetty käyttäjälle:

PaulWalton

Hello, ajnelson. Please read the PM board for details. Paul

50 $ USD 2 päivässä
(6 arvostelua)
4.1

25 freelanceria on tarjonnut keskimäärin 71 $ tähän työhön

gaffapi

please PM me the actual links so I could make a demo for you.

100 $ USD 2 päivässä
(72 arvostelua)
6.3
danguer

Hi, I can help you, I have a very good connection (1 MBs) and very handled to this

90 $ USD 2 päivässä
(10 arvostelua)
6.0
CruzDelSur

Hi, I would like to write a little demo for you, I will do it in PHP, could you posible show me source link from you want to get content? Regards CruzDelSur

100 $ USD 3 päivässä
(27 arvostelua)
5.6
Zuprem

i can help you with this.

30 $ USD 1 päivässä
(53 arvostelua)
5.5
PSE

Hi, Please check PMB for details

90 $ USD 1 päivässä
(14 arvostelua)
5.0
gogetter

Hi, I have implmented similar projects. Since you require data to be in excel, the code would have to run on Windows (or the code could generate CSV file that you can late import in Excel). I can provide the solution i Lisää

95 $ USD 6 päivässä
(2 arvostelua)
4.4
nadeem2005

Dear Sir, We have relevant experience. Please see the PMB for complete description about this project. Here is our place holding bid for this project. Best Regards, Nadeem

30 $ USD 1 päivässä
(19 arvostelua)
4.3
inakiseri

Please contact me for a fast development

100 $ USD 1 päivässä
(3 arvostelua)
4.0
neon

we have done something similar and we can help you with this work too

100 $ USD 7 päivässä
(7 arvostelua)
3.7
varatare

Hello ajnelson This is what I will do. I will use PHP to parse the HTML code and covert it into cvs format. :)

60 $ USD 3 päivässä
(3 arvostelua)
2.9
mohanprabha

Dear Sir, I have 6+ years experience in software development regards mohan

100 $ USD 3 päivässä
(2 arvostelua)
2.6
ranosoft

SL Hi, We take this oppurtunity to introduce ourself as an ISO 9001:2000 companyand also we are the first Indian IT company to have ISO14001 certification. http://www.vyasildemo.com/designportfolio and 4 curren Lisää

100 $ USD 15 päivässä
(3 arvostelua)
4.0
cks121

I have work on this type of project. I acn commit this task within 2-5 hours if you provide me $120.00 Thanks

51 $ USD 1 päivässä
(0 arvostelua)
0.0
vniranjan1979

Hi sir, I have pretty good knowledge and experience in parsing and validating documents of xml.So html will be very much easire and faster in developing.looking forward to hear from u to start up this project

40 $ USD 3 päivässä
(0 arvostelua)
0.0
UTStudios

Hi, please check your pm

50 $ USD 3 päivässä
(0 arvostelua)
1.6
vladag

Hi, I have done similar projects and can give you little demo working according your tasks

30 $ USD 2 päivässä
(0 arvostelua)
2.0
sanju0011

Hi, This can be done. I m committed to provide u quality sol. Thanx

90 $ USD 5 päivässä
(0 arvostelua)
0.0
superbrain

I can do this project for you. need escrow payment and good review.

100 $ USD 3 päivässä
(0 arvostelua)
0.0
hashvin

I have done similar jobs with html parsing using PHP. I use good technique when writing code so u can be gaurenteed it will be commented well. Please let me know any time if you would like me to get started with the pr Lisää

65 $ USD 2 päivässä
(0 arvostelua)
2.4
DanielRomero

I can do the job

50 $ USD 2 päivässä
(0 arvostelua)
0.0