anl.aida.reader
Class NYTimesArchiveReader

java.lang.Object
  extended by anl.aida.reader.NYTimesArchiveReader

public class NYTimesArchiveReader
extends java.lang.Object

Quick and dirty class to read nytimes_archive_index.txt and read / parse each article in the index using the NYTimesHTMLReader.


Field Summary
private static java.lang.String DELIMITER
           
 
Constructor Summary
NYTimesArchiveReader()
           
 
Method Summary
 void run(ReaderResultProcessor proc)
          Parses the lines in the nytimes archive index file for url, title, etc.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

DELIMITER

private static final java.lang.String DELIMITER
See Also:
Constant Field Values
Constructor Detail

NYTimesArchiveReader

public NYTimesArchiveReader()
Method Detail

run

public void run(ReaderResultProcessor proc)
         throws java.io.IOException,
                java.text.ParseException,
                AIDAException
Parses the lines in the nytimes archive index file for url, title, etc. These are then passed to the NYTimesHTMLReader. The result of that is then passed to the ReaderResultProcessor.

Parameters:
proc -
Throws:
java.io.IOException
java.text.ParseException
AIDAException