GATE Python Plugin
This plugin provides a processing resource, PythonPr
which allows the editing and running of python code for processing
GATE documents. The Python API for processing documents is the Python gatenlp
package.
The plugin provides its own copy of a specific version of the gatenlp
package which is used by default, but it is possible to
instead use whatever version of the gatenlp
package is installed on the system.
Using the Python Plugin
- Requires GATE 8.6.1 or later
- In the Plugin manager, locate the Python plugin and load it
- Alternately, if you need a different version, in the plugin manager, click the “+” button and enter the Maven coordinates
- uk.ac.gate.plugins
- python
- (the version you need)
Installing / Setting up Python
Before the plugin can be used Python must be installed:
- Python version 3.6 or later (3.7 or later recommended) must be installed
- The python package sortedcontainers must be installed
- Detailed installation instructions
The plugin should be compatible with GATE 8.5 or later and should run on Windows, MacOS and Linux-like operating systems.
Reporting problems
If you encounter problems please:
- first check if your problem is described in the Frequent Problems list
- check github issue tracker
- also check if the problem is in the list of closed issues
- please give as much detail as possible about your OS, GATE version, plugin version, Java version etc.
- please use the issue tracker only to report bugs, other problems and feature requests, for questions about how to use the plugin or other general questions use the GATE mailing list:
- for more general questions the GATE mailing list
- please mention “Python plugin” in the subject
- please give as much detail as possible about your OS, GATE version, plugin version, Java version etc.
Please give as much details as possible about your operating system, GATE version, Java version, Python version and whatever else might be relevant.
Plugin components and help topics
Main topics:
- Detailed installation instructions
- Changes by version
- PythonPr: Processing Resource to process documents with Python, using the gatenlp package.
- Pipeline python-spacy: a ready made application that creates annotations for a document from the result of running Python spaCy on the text.
- Pipeline python-stanford-stanza: a ready made application that runs Stanford Stanza and creates annotations from the result.
- Python Editor
- Command Line Debugging of Python Scripts: how to debug the scripts outside of GATE from the command line
- Frequent Problems: if you encounter problems, please check here first!
- JavaDoc: API documentation
Other help topics:
- PythonPrResult: Language Resource to store over-the-corpus processing results as features
- Multiprocessing: Running the
PythonPr
processing in parallel and combining over-the-corpus results from all processes. - PythonWorkerLr: Language Resource that allows
gatenlp
to control GATE from Python and call the GATE API from Python.