liuslite.utils
Class MSExtractor

java.lang.Object
  extended by liuslite.utils.MSExtractor

public abstract class MSExtractor
extends java.lang.Object

Defines a Microsoft document content extractor.

Author:
Jérôme Charron

Constructor Summary
MSExtractor()
          Constructs a new Microsoft document extractor.
 
Method Summary
 void extract(java.io.InputStream input)
          Extracts properties and text from an MS Document input stream
abstract  java.lang.String extractText(java.io.InputStream input)
          Extracts the text content from a Microsoft document input stream.
 void setContents(java.util.List<Content> contents)
           
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

MSExtractor

public MSExtractor()
Constructs a new Microsoft document extractor.

Method Detail

setContents

public void setContents(java.util.List<Content> contents)

extract

public void extract(java.io.InputStream input)
             throws java.lang.Exception
Extracts properties and text from an MS Document input stream

Throws:
java.lang.Exception

extractText

public abstract java.lang.String extractText(java.io.InputStream input)
                                      throws java.lang.Exception
Extracts the text content from a Microsoft document input stream.

Throws:
java.lang.Exception