Metadata-Version: 2.1
Name: fast-sentence-segment
Version: 0.1.7
Summary: Fast and Efficient Sentence Segmentation
Home-page: https://github.com/craigtrim/fast-sentence-segment
License: None
Keywords: nlp,text,preprocess,segment
Author: Craig Trim
Author-email: craigtrim@gmail.com
Maintainer: Craig Trim
Maintainer-email: craigtrim@gmail.com
Requires-Python: >=3.8.5,<4.0.0
Classifier: Development Status :: 4 - Beta
Classifier: License :: Other/Proprietary License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.9
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Dist: baseblock
Requires-Dist: spacy (==3.5.0)
Project-URL: Bug Tracker, https://github.com/craigtrim/fast-sentence-segment/issues
Project-URL: Repository, https://github.com/craigtrim/fast-sentence-segment
Description-Content-Type: text/markdown

# Fast Sentence Segmentation (fast-sentence-segment)
Fast and Efficient Sentence Segmentation

Usage
```python
from fast_sentence_segment import segment_text

results = segment_text(
    'here is a dr. who says something.  and then again, what else?  i dont know.  Do you?')

assert results == [
    [
        'here is a dr. who says something.',
        'and then again, what else?',
        'i dont know.',
        'Do you?'
    ]
]
```

Why use a double-scripted list?

The segementation process will segment into paragraphs and sentences.  A paragraph is composed of 1..* sentences, hence each list of lists is equivalent to a paragraph.

This usage
```python
results = segment_text(input_text, flatten=True)
```
Will return a list of strings, regardless of paragraph delimitation.

