com.unitesk.requality.documents
public class WordDocExtractor extends WordExtractor
Modifier and Type | Field and Description |
---|---|
protected static java.lang.String |
lateCssStyles |
static float |
TWIPS_PER_INCH |
static int |
TWIPS_PER_PIXEL |
Constructor and Description |
---|
WordDocExtractor(ParseContext context) |
Modifier and Type | Method and Description |
---|---|
static TagAndStyle |
buildParagraphTagAndStyle(java.lang.String styleName,
int hlevel,
boolean isTable)
Given a style name, return what tag should be used, and
what style should be applied to it.
|
java.lang.String |
getCssStyles() |
protected int |
getNumberColumnsSpanned(int[] tableCellEdges,
int currentEdgeIndex,
TableCell tableCell) |
protected int |
getNumberRowsSpanned(Table table,
int currentRowIndex,
int currentColumnIndex,
TableCell tableCell) |
protected int |
getTableCellEdgesIndexSkipCount(Table table,
int r,
int[] tableCellEdges,
int currentEdgeIndex,
int c,
TableCell tableCell) |
static java.lang.String |
ico24ToHtml(int ico24) |
protected void |
parse(DirectoryNode root,
XHTMLContentHandler xhtml) |
protected void |
parseWord6(DirectoryNode root,
XHTMLContentHandler xhtml)
Outputs a section of text if the given text is non-empty.
|
public static final float TWIPS_PER_INCH
public static final int TWIPS_PER_PIXEL
protected static java.lang.String lateCssStyles
protected void parse(DirectoryNode root, XHTMLContentHandler xhtml) throws java.io.IOException, org.xml.sax.SAXException, TikaException
java.io.IOException
org.xml.sax.SAXException
TikaException
protected int getNumberColumnsSpanned(int[] tableCellEdges, int currentEdgeIndex, TableCell tableCell)
protected int getNumberRowsSpanned(Table table, int currentRowIndex, int currentColumnIndex, TableCell tableCell)
protected int getTableCellEdgesIndexSkipCount(Table table, int r, int[] tableCellEdges, int currentEdgeIndex, int c, TableCell tableCell)
public static java.lang.String ico24ToHtml(int ico24)
public java.lang.String getCssStyles()
protected void parseWord6(DirectoryNode root, XHTMLContentHandler xhtml) throws java.io.IOException, org.xml.sax.SAXException, TikaException
xhtml
- XHTML content handlersection
- the class of the <div/> section emittedtext
- text to be emitted, if anyorg.xml.sax.SAXException
- if an error occursjava.io.IOException
TikaException
public static TagAndStyle buildParagraphTagAndStyle(java.lang.String styleName, int hlevel, boolean isTable)