org.supermind.crawl.scope
Class OneExternalLinkFLFilter

java.lang.Object
  extended by org.supermind.crawl.scope.OneExternalLinkFLFilter
All Implemented Interfaces:
ScopeFilter<FetchListScope.Input>

public class OneExternalLinkFLFilter
extends java.lang.Object
implements ScopeFilter<FetchListScope.Input>

Allows a URL if its parent has the same host as its seed. This effectively allows all URLs within the same host, as well as one link outside the seed's host.


Field Summary
 
Fields inherited from interface org.supermind.crawl.scope.ScopeFilter
ABSTAIN, ALLOW, REJECT
 
Constructor Summary
OneExternalLinkFLFilter()
           
 
Method Summary
 int filter(FetchListScope.Input input)
          Filter the input.
 void setSeedSource(CrawlSeedSource seedSource)
          Set CrawlSeedSource.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

OneExternalLinkFLFilter

public OneExternalLinkFLFilter()
Method Detail

filter

public int filter(FetchListScope.Input input)
Description copied from interface: ScopeFilter
Filter the input. Possible return values are ScopeFilter.ALLOW, ScopeFilter.REJECT and ScopeFilter.ABSTAIN.

Specified by:
filter in interface ScopeFilter<FetchListScope.Input>
Returns:

setSeedSource

public void setSeedSource(CrawlSeedSource seedSource)
Set CrawlSeedSource. Note: CrawlSeedSource implementation must support random access by seed id.

Parameters:
seedSource -
See Also:
CrawlSeedSource.getSeedURL(int)