| Class | Description |
|---|---|
| ConllDocumentFormat |
Document format for handling CoNLL/IOB documents:
He PRP B-NP
accepted VBD B-VP
the DT B-NP
position NN I-NP
...
|
| CorpusImpl |
Corpora are sets of Document.
|
| DocumentContentImpl |
Represents the commonalities between all sorts of document contents.
|
| DocumentData | |
| DocumentImpl |
Represents the commonalities between all sorts of documents.
|
| DocumentJsonUtils |
This class contains utility methods to output GATE documents in a
JSON format which is (deliberately) close to the format used by
Twitter to represent entities such as user mentions and hashtags in
Tweets.
|
| DocumentStaxUtils |
This class provides support for reading and writing GATE XML format
using StAX (the Streaming API for XML).
|
| DocumentXmlUtils |
This class is contains useful static methods for working with the GATE XML
format.
|
| EmailDocumentFormat |
The format of Documents.
|
| MimeType |
A very basic implementation for a MIME Type.
|
| NekoHtmlDocumentFormat |
DocumentFormat that uses Andy Clark's NekoHTML
parser to parse HTML documents.
|
| ObjectWrapper |
Class used to wrap arbitrary values prior to saving as GATE XML.
|
| RepositioningInfo |
RepositioningInfo keep information about correspondence of positions
between the original and extracted document content.
|
| SerialCorpusImpl | |
| SgmlDocumentFormat |
The format of Documents.
|
| TextualDocumentFormat |
The format of Documents.
|
| TikaFormat | |
| UimaDocumentFormat |
UIMA XCAS and XMICAS document formats.
|
| XmlDocumentFormat |
The format of Documents.
|
| Exception | Description |
|---|---|
| SynchronisationException |
Copyright © 2024 GATE. All rights reserved.