We require an application that is able to analyse and show which specific news articles are showing in the first half of a news websites page, and in what order they appear. The application should provide updated data every 1-2 minutes to show which news articles/URLs have been placed at the top of the page, and moving down the first half of the page.
The application will index a set group of approximately ten news websites (URLs provided) and will analyse the top half of the page to show which news articles appear first, and down the page. This is to give an indication of the order in which news articles appear, and the editorial decisions being made by a publisher.
The website [url removed, login to view] displays news articles. The main index page shows a main headline, followed by three/four other news articles in proceeding order. The application should be able to tell which news article/URL appears first, and so on for the second, third, fourth appearing article. The "order" of the articles, and their headlines/URLs is the mode of analysis.
The application should then return this information to a database every 1-2 minutes, updating which news articles/URLs the website is displaying, and in what order (for the first half of the page only). Combined with up to 10 news websites, the database should show updated information at a glance to indicate which news articles each news website has promoted to the major headline, and proceeding in order. The database should be displayed in a simple webpage that shows which website it is analysing, and the order of ~1-5 news articles on the page.
Please see the attached image (Image 1) that indicates which part of the news website the application should focus on. A full list of the ~10 news websites for analysis will be provided - however they are all established, well programmed websites which use appropriate languages etc.
Please feel free to ask any more questions that you might require. The budget is flexible, please be honest in your quotations and ensure you are capable of completing this task. There is no set deadline, only that it must be delivered sometime over the next 2 months.
We will provide all design for the front-end to display the near-real-time data coming from the websites. We will attach the designs to this project ASAP.