Homework 2 FAQ

  1. What kind of HTML link tags must we account for?
    Your program must accept any HTML link tags that are valid HTML. In particular, this means that neither too much nor too little whitespace should affect your program. Nor should the presence or absence of quotes around the link. Nor should the case of the tags
  2. What if the tag references stuff that's not text (eg: images, more HTML tags, etc)?
    Don't worry about them - print out the references as though they were plain text.
  3. I don't know much HTML. What do links look like?
    This is a sample HTML link:
    <a href="http://www.rpi.edu">RPI homepage</a>
    In this example, 'http://www.rpi.edu' is the link - the address you go to if you click the underlined words. 'RPI homepage' is the reference, the actual clickable words.
    Some things to note:
  4. Does "RCS Machine" mean a Unix machine on the RCS network, or a Windows box that can connect to RCS?
    For this assignment, and all future references in this class, "RCS Machine" refers to an RCS Unix Computer. This can be either one of the physical terminals in the CII, JEC, VCC, or Library, or one of the remote terminals you can SSH into - such as rcs.rpi.edu.
  5. Are "nested" links allowed in HTML? ie, does our program need to account for things like
    <a href="..."> ... <a href="..."> ... </a> ... </a>
    No. This is not legal HTML, and your program does not need to account for it.
  6. Can we assume that the entire link will be on one line?
    No. The text between <a... and </a> may include anything, including newlines. See the Sample HTML file for an example
  7. Do we need to search for HTML files in the subdirectories of the public_html directory?
    No. Only search the public_html subdirectory of the directory entered on the command line. Do not search farther than that.