java.lang.Objectorg.supermind.crawl.NutchSegmentPagePersister
public class NutchSegmentPagePersister
Persists a fetched page to a Nutch segment.
Field Summary |
---|
Fields inherited from interface org.supermind.crawl.PagePersister |
---|
NO_OP |
Constructor Summary | |
---|---|
NutchSegmentPagePersister(org.apache.nutch.fs.NutchFileSystem nfs,
java.lang.String directory)
|
|
NutchSegmentPagePersister(java.lang.String directory)
|
|
NutchSegmentPagePersister(java.lang.String directory,
boolean overwrite)
|
Method Summary | |
---|---|
void |
close()
Cleanup. |
void |
outputPage(FetcherOutput fo,
org.apache.nutch.protocol.Content content,
org.apache.nutch.parse.Parse parse)
Persists a fetched page. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public NutchSegmentPagePersister(org.apache.nutch.fs.NutchFileSystem nfs, java.lang.String directory)
public NutchSegmentPagePersister(java.lang.String directory) throws java.io.IOException
java.io.IOException
public NutchSegmentPagePersister(java.lang.String directory, boolean overwrite) throws java.io.IOException
java.io.IOException
Method Detail |
---|
public void close() throws java.io.IOException
PagePersister
close
in interface PagePersister
java.io.IOException
public void outputPage(FetcherOutput fo, org.apache.nutch.protocol.Content content, org.apache.nutch.parse.Parse parse) throws java.io.IOException
PagePersister
outputPage
in interface PagePersister
java.io.IOException