Metadata-Version: 2.1
Name: elit_tokenizer
Version: 1.0
Summary: English Tokenizer from ELIT
Home-page: https://github.com/emorynlp/elit-tokenizer
Author: Jinho D. Choi
Author-email: jinho.choi@emory.edu
License: UNKNOWN
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Operating System :: OS Independent
Description-Content-Type: text/markdown
License-File: LICENSE

# ELIT Tokenizer

[ELIT](https://github.com/emorynlp/elit/) (Emory Information and Language Technology) features several tokenizers
to split text into a sequence of tokens and segment them into sentences.
This project is led by the [Emory NLP Research Laboratory](https://www.emorynlp.edu) and under the [Apache 2.0](LICENSE) license.

* Latest release: [1.0](https://pypi.org/project/elit_tokenizer/) (10/15/2021)

## Installation

Python 3.7 or higher is recommended: 

```
pip install elit_tokenizer
```

## Documentations

* [Space Tokenizer](https://github.com/emorynlp/elit-tokenizer/blob/master/docs/SpaceTokenizer.md)
* [English Tokenizer](https://github.com/emorynlp/elit-tokenizer/blob/master/docs/EnglishTokenizer.md)

## Contact

* [Jinho D. Choi](http://www.cs.emory.edu/~choi)

