Preparing
for the midterm
Believe
it or not, we've actually covered a lot and, I hope, you've learned a lot about
the infrastructure of the World Wide Web.
Here's what I believe you should know based on the material we've gone
over in class (and if you are comfortable with these topics, you should do well
on the midterm):
1
- Architecture of the World Wide Web
Section 1
URIs, resources and
representation
Section 2 - URI best practices
2
- URIs and HTTP headers in Detail
The material covered in the slides at
http://www.cs.rpi.edu/academics/courses/spring08/websci/URI-HTTP-details.pdf
is all fair game
esp. URIs vs. URLs
The
parts of a URL
HTTP response types
(by class, not by number)
The HTTP request
methods
HTTP headers
3
- Web architecture in practice
What is a three-tiered web app (dynamic
content server) and how is it accessed.
4
- spidering
by now you're all experts in this
5
- Caching for efficiency
Server side caching
Conditional gets (the HTTP header fields
used and what they mean)
6
- page rank and search
The page rank formula (and being able
to apply it)
Relationship between crawler and search
engine
Reverse Index and what it comprises
(pages, words, word locations, special features)
I won't promise that I won't add something not directly on this list, but I repeat, if you're comfortable with this material, you should do well.