org.supermind.crawl
Class NutchSegmentPagePersister

java.lang.Object
  extended by org.supermind.crawl.NutchSegmentPagePersister
All Implemented Interfaces:
PagePersister

public class NutchSegmentPagePersister
extends java.lang.Object
implements PagePersister

Persists a fetched page to a Nutch segment.


Field Summary
 
Fields inherited from interface org.supermind.crawl.PagePersister
NO_OP
 
Constructor Summary
NutchSegmentPagePersister(org.apache.nutch.fs.NutchFileSystem nfs, java.lang.String directory)
           
NutchSegmentPagePersister(java.lang.String directory)
           
NutchSegmentPagePersister(java.lang.String directory, boolean overwrite)
           
 
Method Summary
 void close()
          Cleanup.
 void outputPage(FetcherOutput fo, org.apache.nutch.protocol.Content content, org.apache.nutch.parse.Parse parse)
          Persists a fetched page.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

NutchSegmentPagePersister

public NutchSegmentPagePersister(org.apache.nutch.fs.NutchFileSystem nfs,
                                 java.lang.String directory)

NutchSegmentPagePersister

public NutchSegmentPagePersister(java.lang.String directory)
                          throws java.io.IOException
Throws:
java.io.IOException

NutchSegmentPagePersister

public NutchSegmentPagePersister(java.lang.String directory,
                                 boolean overwrite)
                          throws java.io.IOException
Throws:
java.io.IOException
Method Detail

close

public void close()
           throws java.io.IOException
Description copied from interface: PagePersister
Cleanup.

Specified by:
close in interface PagePersister
Throws:
java.io.IOException

outputPage

public void outputPage(FetcherOutput fo,
                       org.apache.nutch.protocol.Content content,
                       org.apache.nutch.parse.Parse parse)
                throws java.io.IOException
Description copied from interface: PagePersister
Persists a fetched page.

Specified by:
outputPage in interface PagePersister
Throws:
java.io.IOException