Metadata-Version: 2.1
Name: html_stripper
Version: 0.3
Summary: A simple package to extract text from (even broken/invalid) HTML
Home-page: https://gitlab.com/helb/html_stripper
Author: Jiri Helebrant
Author-email: helb@helb.cz
License: UNKNOWN
Description: A simple package to extract text from (even broken/invalid) HTML. No dependencies, it just uses Python's internal `HTMLParser` with a few tweaks.
        
        Usage:
        
        ```python
        from html_stripper import strip_tags
        text = strip_tags("<html>…")
        ```
        
        ```python
        from html_stripper import strip_tags
        import requests
        strip_tags(requests.get("https://foo.bar/").text)
        ```
        
        ```python
        from html_stripper import strip_tags, strip_multiple_newlines
        text = strip_multiple_newlines(strip_tags("<html>…")) # replaces chained newlines with a single \n
        ```
        
Keywords: html
Platform: UNKNOWN
Classifier: Development Status :: 4 - Beta
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: GNU General Public License v3 (GPLv3)
Classifier: Operating System :: OS Independent
Requires-Python: >=3.6
Description-Content-Type: text/markdown
