Metadata-Version: 2.1
Name: pyxpdf
Version: 0.2.3
Summary: Powerful and Pythonic PDF processing library based on xpdf-4.02
Home-page: https://github.com/ashutoshvarma/pyxpdf
Author: Ashutosh Varma
Author-email: ashutoshvarma11@live.com
Maintainer: Ashutosh Varma
Maintainer-email: ashutoshvarma11@live.com
License: GPL
Description: pyxpdf
        ======
        pyxpdf is a fast and memory efficient python module for parsing PDF documents based on xpdf reader sources.
        
        
        .. start-badges
        
        .. list-table::
            :stub-columns: 1
        
            * - docs
              - |docs|
            * - tests
              - |azure| |travis| |codecov| 
            * - package
              - |pypi| |pythonver| |wheel| |downloads|
            * - license
              - |license|
        
        .. end-badges
        
        Features
        --------
        - Almost x20 times faster than pure python based pdf parsers (see `Speed Comparison`_)
        - Extract text while maintaining original document layout (best possible)
        - Support almost all PDF encodings, CMaps and predefined CMaps.
        - Extract LZW, RLE, CCITTFax, DCT, JBIG2 and JPX compressed images and image masks along with their BBox.
        - Render PDF Pages as image with support of '1', 'L', 'LA', 'RGB', 'RGBA' and 'CMYK' color modes.
        - No explict dependencies (except optional ones, see `Installation`_)
        - Thread Safe
        
        More Information
        ----------------
        
        - `Documentation <https://pyxpdf.readthedocs.io/>`_
        
          - `Installation`_
          - `Quickstart <https://pyxpdf.readthedocs.io/en/latest/intro.html#quick-start>`_
        
        - `Contribute <https://github.com/ashutoshvarma/pyxpdf/blob/master/.github/CONTRIBUTING.md>`_
        
          - `Build <https://github.com/ashutoshvarma/pyxpdf/blob/master/BUILD.rst>`_
          - `Issues <https://github.com/ashutoshvarma/pyxpdf/issues>`_
          - `Pull requests <https://github.com/ashutoshvarma/pyxpdf/pulls>`_
        
        - `Speed Comparison`_
        
        - `Changelog <https://pyxpdf.readthedocs.io/en/latest/changelog.html>`_
        
        License
        -------
        ``pyxpdf`` is licensed under the GNU General Public License (GPL), version 3. See the `LICENSE <https://github.com/ashutoshvarma/pyxpdf/blob/master/LICENSE>`_
        
        Credits
        -------
        - `xpdf reader <https://www.xpdfreader.com/>`_ by Derek Noonburg
        - `lxml <https://www.github.com/lxml/lxml>`_ - project structure and build adapted from lxml
        - `poppler <https://poppler.freedesktop.org/>`_ project
        
        .. _`Speed Comparison`: https://pyxpdf.readthedocs.io/en/latest/compare.html
        .. _`Installation`: https://pyxpdf.readthedocs.io/en/latest/intro.html#installation
        
        .. |azure| image:: https://img.shields.io/azure-devops/build/ashutoshvarma/pyxpdf/1/master?label=Azure%20Pipelines&style=for-the-badge   
           :alt: Azure DevOps builds (branch)
           :target: https://ashutoshvarma.visualstudio.com/pyxpdf/_build
        .. |travis| image:: https://img.shields.io/travis/com/ashutoshvarma/pyxpdf?label=Travis&style=for-the-badge   
           :alt: Travis (.com)
           :target: https://travis-ci.com/github/ashutoshvarma/pyxpdf     
        .. |docs| image:: https://img.shields.io/readthedocs/pyxpdf?style=for-the-badge         
           :alt: Read the Docs
           :target: https://pyxpdf.readthedocs.io/en/latest/
                  
        .. |codecov| image:: https://img.shields.io/codecov/c/github/ashutoshvarma/pyxpdf?style=for-the-badge   
           :alt: Codecov
           :target: https://codecov.io/gh/ashutoshvarma/pyxpdf/
                     
        .. |license| image:: https://img.shields.io/github/license/ashutoshvarma/pyxpdf?style=for-the-badge   
           :alt: GitHub
           :target: https://github.com/ashutoshvarma/pyxpdf/blob/master/LICENSE
                     
        .. |pypi| image:: https://img.shields.io/pypi/v/pyxpdf?color=light&style=for-the-badge   
           :alt: PyPI
           :target: https://pypi.org/project/pyxpdf/
        
        .. |pythonver| image:: https://img.shields.io/pypi/pyversions/pyxpdf?style=for-the-badge   
           :alt: PyPI - Python Version
           :target: https://pypi.org/project/pyxpdf/
        
        .. |wheel| image:: https://img.shields.io/pypi/wheel/pyxpdf?style=for-the-badge   
           :alt: PyPI - Wheel
           :target: https://pypi.org/project/pyxpdf/
                   
        .. |downloads| image:: https://img.shields.io/pypi/dm/pyxpdf?label=PyPI%20Downloads&style=for-the-badge   
           :alt: PyPI - Downloads
           :target: https://pypi.org/project/pyxpdf/
        
Keywords: pdf parser,pdf converter,text mining,xpdf bindings
Platform: UNKNOWN
Classifier: Development Status :: 5 - Production/Stable
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Information Technology
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: GNU General Public License v3 (GPLv3)
Classifier: Programming Language :: Cython
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: C++
Classifier: Operating System :: OS Independent
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Topic :: Text Processing
Requires-Python: !=3.0.*, !=3.1.*, !=3.2.*, !=3.3.*, != 3.4.*
Description-Content-Type: text/x-rst
Provides-Extra: source
Provides-Extra: dev
Provides-Extra: encoding
Provides-Extra: imaging
