org.supermind.crawl
Interface PostFetchProcessor

All Known Implementing Classes:
LoggingPostFetchProcessor, MapFileContentSeenFilter, NutchSegmentPostFetchProcessor, PostFetchProcessorChain

public interface PostFetchProcessor

Processes a fetched web page.


Field Summary
static PostFetchProcessor NO_OP
           
 
Method Summary
 void close()
          Cleanup.
 void process(FetcherOutput fo, org.apache.nutch.protocol.Content content, org.apache.nutch.parse.Parse parse)
          Processes a fetched page.
 

Field Detail

NO_OP

static final PostFetchProcessor NO_OP
Method Detail

close

void close()
           throws java.io.IOException
Cleanup.

Throws:
java.io.IOException

process

void process(FetcherOutput fo,
             org.apache.nutch.protocol.Content content,
             org.apache.nutch.parse.Parse parse)
             throws java.io.IOException
Processes a fetched page.

Parameters:
fo -
content -
parse - Parse data, can be null if parse failed
Throws:
java.io.IOException