liuslite.parser.html
Class HtmlParser

java.lang.Object
  extended by liuslite.parser.Parser
      extended by liuslite.parser.html.HtmlParser

public class HtmlParser
extends Parser

Html parser

Author:
Rida Benjelloun (ridabenjelloun@apache.org)

Constructor Summary
HtmlParser()
           
 
Method Summary
 Content getContent(java.lang.String name)
          Get a content object, this object is configured from the LiusConfig Xml.
 java.util.List<Content> getContents()
          Get a List of contents objects, this objects are configured from the LiusConfig Xml file.
 java.lang.String getStrContent()
          Get the string content of the document
 
Methods inherited from class liuslite.parser.Parser
configure, getInputStream, getMimeType, getParserConfig, setInputStream, setMimeType
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

HtmlParser

public HtmlParser()
Method Detail

getContent

public Content getContent(java.lang.String name)
Description copied from class: Parser
Get a content object, this object is configured from the LiusConfig Xml. It could be a document metadata, XPath selection, regex selection or fulltext

Specified by:
getContent in class Parser

getContents

public java.util.List<Content> getContents()
Description copied from class: Parser
Get a List of contents objects, this objects are configured from the LiusConfig Xml file. It could be a document metadata, XPath selection, regex selection or fulltext

Specified by:
getContents in class Parser

getStrContent

public java.lang.String getStrContent()
Description copied from class: Parser
Get the string content of the document

Specified by:
getStrContent in class Parser