Class | Description |
---|---|
ConllDocumentFormat |
Document format for handling CoNLL/IOB documents:
He PRP B-NP
accepted VBD B-VP
the DT B-NP
position NN I-NP
...
|
CorpusImpl |
Corpora are sets of Document.
|
DocumentContentImpl |
Represents the commonalities between all sorts of document contents.
|
DocumentData | |
DocumentImpl |
Represents the commonalities between all sorts of documents.
|
DocumentJsonUtils |
This class contains utility methods to output GATE documents in a
JSON format which is (deliberately) close to the format used by
Twitter to represent entities such as user mentions and hashtags in
Tweets.
|
DocumentStaxUtils |
This class provides support for reading and writing GATE XML format
using StAX (the Streaming API for XML).
|
DocumentXmlUtils |
This class is contains useful static methods for working with the GATE XML
format.
|
EmailDocumentFormat |
The format of Documents.
|
MimeType |
A very basic implementation for a MIME Type.
|
NekoHtmlDocumentFormat |
DocumentFormat that uses Andy Clark's NekoHTML
parser to parse HTML documents.
|
ObjectWrapper |
Class used to wrap arbitrary values prior to saving as GATE XML.
|
RepositioningInfo |
RepositioningInfo keep information about correspondence of positions
between the original and extracted document content.
|
SerialCorpusImpl | |
SgmlDocumentFormat |
The format of Documents.
|
TextualDocumentFormat |
The format of Documents.
|
TikaFormat | |
UimaDocumentFormat |
UIMA XCAS and XMICAS document formats.
|
XmlDocumentFormat |
The format of Documents.
|
Exception | Description |
---|---|
SynchronisationException |
Copyright © 2024 GATE. All rights reserved.