Parse HTML page

I will provide you with links to about 288 blogs. For each blog, you will have to find out if it is written in French (basically if the posts are not 100% in English). Then for each of the remaining blogs, you will have to find the RSS link. Once you get the RSS link, you will have to pick one or two link in the RSS to find the regex that will enable me to parse the body of the post. I want to be able to use the PHP function preg_match to extract the body. That's all.

there are 2 things I value a log: Speed + Quality. It is not enough to write that you will produce 0 defects and complete the project in 1 day, you need to demonstrate it.

I will give you few blogs so that you can play with them:

[url removed, login to view]

[url removed, login to view]

[url removed, login to view]

[url removed, login to view]

remember, i just want the body .. no title, no author ...

Taidot: PHP

Näytä lisää: php parse html, php html parse, html parse, html parse php, regex is, regex info, regex in c, regex c, find the author, find in page, find a author com, parse html page, regex &, author-it, author find, Regex, php regex, parse, parse html, parse and

Tietoa työnantajasta:
( 16 arvostelua ) Cambridge, United States

Projektin tunnus: #206389