anl.aida.reader
Class ChicagoTribuneReader

java.lang.Object
  extended by anl.aida.reader.AbstractAIDAComponentReader
      extended by anl.aida.reader.ChicagoTribuneReader
All Implemented Interfaces:
AIDAComponentReader

public class ChicagoTribuneReader
extends AbstractAIDAComponentReader

AIDAComponentReader implementation for reading Chicago Tribune web articles. This uses the ChicagoTribuneHTMLReader to read archived web entries. This may not work with non-archived material. The list of links should be in a file in the "standard" index format. Namely, timestamp:::link:::title:::author The author or authors is optional.


Field Summary
private  ContentReader ctReader
           
static java.lang.String INDEX_FILE
           
 
Fields inherited from class anl.aida.reader.AbstractAIDAComponentReader
indexIter, lineItems, location, processors, startDate
 
Fields inherited from interface anl.aida.reader.AIDAComponentReader
MESSAGE_DIGEST
 
Constructor Summary
ChicagoTribuneReader()
           
 
Method Summary
protected  java.lang.String getIndexFileKey()
          Gets the name of the parameter key for the index file.
protected  ReaderResult getNextResult()
          Gets the next ReaderResult.
 void initialize(org.apache.uima.resource.ConfigurableResource resource, java.util.Date startDate)
          Initializes this MIFSComponentReader, optionally using the resource.
 
Methods inherited from class anl.aida.reader.AbstractAIDAComponentReader
checkDate, close, getDocumentURL, getNext, hasNext, postNext
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

INDEX_FILE

public static final java.lang.String INDEX_FILE
See Also:
Constant Field Values

ctReader

private ContentReader ctReader
Constructor Detail

ChicagoTribuneReader

public ChicagoTribuneReader()
Method Detail

getNextResult

protected ReaderResult getNextResult()
                              throws java.io.IOException
Description copied from class: AbstractAIDAComponentReader
Gets the next ReaderResult.

Specified by:
getNextResult in class AbstractAIDAComponentReader
Returns:
the next ReaderResult.
Throws:
java.io.IOException - if there is an error getting the result

getIndexFileKey

protected java.lang.String getIndexFileKey()
Description copied from class: AbstractAIDAComponentReader
Gets the name of the parameter key for the index file. The index file contains the links etc to read.

Specified by:
getIndexFileKey in class AbstractAIDAComponentReader
Returns:
the name of the parameter key for the index file.

initialize

public void initialize(org.apache.uima.resource.ConfigurableResource resource,
                       java.util.Date startDate)
                throws org.apache.uima.resource.ResourceInitializationException
Description copied from interface: AIDAComponentReader
Initializes this MIFSComponentReader, optionally using the resource.

Specified by:
initialize in interface AIDAComponentReader
Overrides:
initialize in class AbstractAIDAComponentReader
Parameters:
resource - the resource to use for configuration
Throws:
org.apache.uima.resource.ResourceInitializationException - if there is an error initializing the reader