Description:
The goal of this assignment is to get familiar with the Java platform for network programming. You are required to write a simple HTTP client that gets as input a given URL, retrieves the corresponding page, and returns the list of all words and their occurrences in the requested page sorted by the number of occurrences.
You will use the URL class which is an abstraction of the URL identifier. It allows to open a connection to a specific URL, read data from it, and perform a set of operations on the URL. More details about this class can be found in this link: http://java.sun.com/products/jdk/1.1/docs/api/java.net.URL.html
All exceptions must be properly handled.
Extension:
A possible extension to the project is to include a depth level in the input, i.e. the user will input besides the URL a level and the output will be the sorted list of word/occurrences starting from the target page up to the set level (the target page and the pages it is pointing to up to the required level). Completion of this extension will result in a better grade; however, groups of more than one person MUST complete this extension to receive a full grade. It should also be noted that, if you have an idea for a different extension, ask either the instructor or TAs for approval.
Grading:
Grading will be split in the following way:
Submission:
The due date for this project is September 10th, 2002, 11:55pm EST. You should use the assignments drop-off box located at the course's WebCT page. Upload a JAR file containing all the relevant documented Java files, along with a README file describing the project and its usage.