Metadata-Version: 2.1
Name: python-libthai
Version: 0.1.4
Summary: Python - libthai binding library.
Home-page: https://github.com/teerasit-ang
Author: Teerasit Angkhaprsertkul
Author-email: teerasit.ang@gmail.com
License: GNU
Description: PyThai
        ======
        
        Some basic python functions for working with the Thai language. For example:
        
        ```python
        import pythai
        
        pythai.split(u"การที่ได้ต้องแสดงว่างานดี")
        >>> u"การ ที่ ได้ ต้อง แสดง ว่า งาน ดี"
        
        pythai.word_count(u"การที่ได้ต้องแสดงว่างานดี")
        >>> 8
        
        pythai.contains_thai(u"hello")
        >>> False
        
        pythai.contains_thai(u"helloการที่ไ")
        >>> True
        ```
        
        It's meant to be fast and efficient enough to handle large documents without breaking a sweat.
        
        Includes
        ------------
        
        Currently the library supports these functions:
        
        - Word segmentation (`split`)
        - Word count (`word_count`) (faster than counting the result of `split`)
        - Whether a string contains Thai or not (`contains_thai`)
        
        
        Installation
        ------------
        
        PyThai requires `libthai-dev` to work. You can install it quite easily:
        
            sudo apt-get install libthai-dev
        
        And then you can simply install `pythai` through **pip**:
        
            pip install python-libthai
        
        More
        ------------
        
        Special thanks to Vee Satayamas for the original python bindings of libthai from C.
        
        
        
        
Keywords: thai language linguistics segmentation
Platform: UNKNOWN
Classifier: Development Status :: 5 - Production/Stable
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: GNU Lesser General Public License v3 (LGPLv3)
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Topic :: Text Processing :: Linguistic
Description-Content-Type: text/markdown
