Metadata-Version: 2.1
Name: nlpcleaner
Version: 0.3.0
Summary: Clean and prepare text for modeling with machine learning
Home-page: https://github.com/giovannelli/nlpcleaner
Author: Duccio Giovannelli
Author-email: giovannelli@extendi.it
License: MIT
Description: # Nlpcleaner [![Build Status](https://travis-ci.org/giovannelli/nlpcleaner.svg?branch=master)](https://travis-ci.org/giovannelli/nlpcleaner)
        
        Clean and prepare text for modeling with machine learning.
        - lower all
        - strip all
        - remove numbers
        - remove symbols
        - remove url
        - strip html tags
        - remove stopwords by detected language
        - lemming or stemming
        
        ## Usage
        
        ```
        from nlpcleaner import TextCleaner
        TextCleaner(txt).clean()
        ```
        
        ## Tests
        
        ```
        pipenv install .
        python setup.py test
        ```
        
        ## Push on PyPi
        
        ```
        python setup.py sdist
        pip install twine
        twine upload dist/*
        ```
        
        ## TODO
        * Add tests to cover different cases and languages;
        * check performances
        
Platform: UNKNOWN
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Description-Content-Type: text/markdown
