RSS Article Scraper

RSS Article Scraper

Most RSS feeds from news organizations only provide a summary of the article which is syndicated with the RSS Feed, not the full article. To access the full article one has to go to the related link and access the underlying website from the content source

Project is a to write a server-run script which

1. Monitors a list of RSS Feeds

2. Tracks back to the ultimate source of the content (usually the content website like New York TImes, Washington Post, Engadget, etc.)

3) Collects

- Title & Caption

- All Article Images

- Full Article Text

4) Remove Ads or unwanted content (with pre specified tags)

5) Creates a new private RSS Feed with full article contents

Taidot: Perl, PHP, XML

Näytä lisää: washington times, new york post, remove rss article, full website script, access new york, rss, post article, cgi images, Caption, article summary, article news, article 3, source rss, feed full article, feeds ads, rss feed full content, access summary, rss full, unwanted, rss project server, rss full content, content feed script, full text feed php, php full text rss

About the Employer:
( 2 reviews ) New York, United States

Projektin tunnus: #425954