CSCI.4962 Internet Computing

Fall, 2001

Programming Assignment 3.

You must work independently on this project. Do not show your code to other students and do not look at other students' code. Do not put your code in a public directory or otherwise make it public. However, you may get all the help you need from the TA or the instructor.  You are encouraged to use the WebCT Discussions page to post problems on installing the necessary software, or understanding the assignment, so that other students can also see the answers.


The goal of this assignment is to create an open, dynamically reconfigurable Web search engine using the SALSA programming language and the World Wide Computer.

The search engine should be started with a UAN, an initial UAL, an initial URL, and a depth search level.  Your engine should fetch the initial URL, indexing word occurrences in that document and producing a list of URLs to fetch in the next iteration.  It must keep indexing documents until the depth search level is reached.

Your search engine can be queried at any point, even while building the index.  A query contains a single word.  The search engine should return the URLs for the documents that contain the word sorted by the number of occurrences of the word in each document.  A search engine client may want to produce HTML to display those results.

To support fault-tolerance, your search engine can be requested to move to a different machine.  This can happen at any point, for example, while building its index or responding to a query.  Migration should be transparent to the clients, i.e., there should be no changes to the code of the search engine clients.

[Optional Extra 10 Points] Finally, to support load balancing, you need to create a meta-engine. The meta-engine is started with the names of two or more search engines (as defined above) and it is able to combine their query results into a single result. When you query the meta-engine, the meta-engine queries the search engines that it represents (independently of their location, since it is just a client from their perspective) and combines their results into a single result. A client of the meta-engine should not be able to tell the difference between a meta-engine and a normal search engine (the interface should be the same).

To learn more about SALSA/WWC, see OOPSLA 2001 paper. To download and install SALSA, see http://osl.cs.uiuc.edu/salsa/

Submission:

The due date for this project is November 8th, 2001, 11:55pm EST.  You should use the assignments drop-off box located at the course's WebCT page.  Upload a JAR file containing all the relevant documented  files, along with a README file containing instructions on running the program.