com.unitesk.requality.documents
Class WordDocExtractor
java.lang.Object
WordExtractor
com.unitesk.requality.documents.WordDocExtractor
public class WordDocExtractor
- extends WordExtractor
Method Summary |
static TagAndStyle |
buildParagraphTagAndStyle(java.lang.String styleName,
boolean isTable)
|
protected void |
parse(POIFSFileSystem filesystem,
XHTMLContentHandler xhtml)
|
protected void |
parseWord6(POIFSFileSystem filesystem,
XHTMLContentHandler xhtml)
|
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
TWIPS_PER_INCH
public static final float TWIPS_PER_INCH
- See Also:
- Constant Field Values
TWIPS_PER_PT
public static final int TWIPS_PER_PT
- See Also:
- Constant Field Values
WordDocExtractor
public WordDocExtractor(ParseContext context)
parse
protected void parse(POIFSFileSystem filesystem,
XHTMLContentHandler xhtml)
throws java.io.IOException,
org.xml.sax.SAXException,
TikaException
- Throws:
java.io.IOException
org.xml.sax.SAXException
TikaException
parseWord6
protected void parseWord6(POIFSFileSystem filesystem,
XHTMLContentHandler xhtml)
throws java.io.IOException,
org.xml.sax.SAXException,
TikaException
- Throws:
java.io.IOException
org.xml.sax.SAXException
TikaException
buildParagraphTagAndStyle
public static TagAndStyle buildParagraphTagAndStyle(java.lang.String styleName,
boolean isTable)