Installation

Make sure you have Python 3.6 or later installed. Python version 3.7 or later is highly recommended!

The recommended way to install Python is to use Conda by installing one of

Alternately,

Then create an environment for working with gatenlp. This example creates an environment with the name gatenlp and activates it:

conda create -n gatenlp python==3.8
conda activate gatenlp

The gatenlp has a number of optional dependencies which are only needed if some special features of gatenlp are used.

To install gatenlp with the minimal set of dependencies run:

python -m pip install gatenlp 

To upgrade an already installed gatenlp package to the latest version run:

python -m pip install -U gatenlp 

To install gatenlp with all dependencies run:

python -m pip install gatenlp[all]

To upgrade to the latest version with all dependencies:

python -m pip install  -U gatenlp[all]

NOTE: if this fails because of a problem installing torch (this may happen on Windows), first install Pytorch separately according to the Pytorch installation instructions, see: https://pytorch.org/get-started/locally/ then run the gatenlp installation again.

The following specific dependencies included in ‘all’ can be chosen separately:

The following dependencies are not included in ‘all’ but in ‘alldev’ or can be chosen separately:

Example: to install gatenlp with support for stanza and spacy and serialization:

python -m pip install gatenlp[stanza,spacy,formats]

To install the latest gatenlp code from GitHub with all dependencies:

To also install everything needed for development use “alldev”:

python -m pip install -e .[alldev]

Requirements for using the GATE slave:

Requirements for running gatenlp in a Jupyter notebook:

To create a kernel for your conda environment run:

python -m ipykernel install --user --name gatenlp --display-name "Python gatenlp"

The available kernels can be listed with jupyter kernelspec list

To run and show a notebook run the following and use “Kernel - Change Kernel” in the notebook to choose the gatenlp environment speicific kernel:

jupyter notebook notebookname.ipynb

If you prefer Jupyter lab:

python -m pip install jupyterlab

and then start Jupyter lab with:

jupyter lab

In Jupyter lab, you can work on Jupyter notebooks but also use an interactive console which is also able to visualize documents interactively.

Requirements for development: