I have a java library with me that downloads and parses through the HTML Source code of a web page. I want you to use this script, and download the source of 11 web pages at a atime, and then do some simple text operations on the extracted html tags' text. For example, for the operation "Use of keyword in document title" check for the existence of a particular phrase in the titles of all 11 documents. Also calculate keyword density (= % of number of words of keyphrase/total number of words in title ) There are a total of 30 such text operations. A huge-looking report which details all the results of these 30 operations, is to be prepared. Basically, there is a brief description of an operation, followed by the results of the operation, as well as table showing density of the keyphrase. In addition to these 30 operations, a few other operations also have to be done for which I have a php script with me: (1) Determine age of website (2) Determine number of back links to a website (3) Determine number of social network links to a website (4) [url removed, login to view] visitors to a site and alexa rank For the above 4 operations I have a php script which does the task, you have to take a look at the php code and convert it to the java program that you are developing. Finally, the report generated should be a PDF with color coded sections. I have a pdf library for java with me, which also has examples on usage of that library. You have to output the contents of text operations Plus above 4 operations to a PDF File which will be immediately made available to the user who has specified the uRLs of the 11 web pages and key phrase. A sample report is attached with this bid request. Note that the application is a regular java application with a simple admin screen for edit/delete/add users to this application. It should be able to work in Google App ENgine or Microsoft Azure (for both you can code the app just as a regular java application with very minor differences) Regards, Arvind.
1) All deliverables will be considered "work made for hire" under U.S. Copyright law. Employer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the employer on the site per the worker's Worker Legal Agreement).
2) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
3) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Employer's environment--Deliverables must be installed by the Worker in ready-to-run condition in the Employer's environment.
b) For all others including desktop software or software the employer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this project.
windows or linux with java