Metadata-Version: 2.1
Name: tableParser
Version: 1.0.1
Summary: It extract the table data from the pdf
Home-page: https://github.com/abhishekthegodfather/tableDataParser.git
Author: Abhishek Biswas
Author-email: abhishekbiswas772@gmail.com
Classifier: License :: OSI Approved :: GNU General Public License v3 or later (GPLv3+)
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Description-Content-Type: text/markdown

TableParser

tableParser is a python program using the concept of deep learning for extract the data from tables present in the PDF. It basically takes the file name of the pdf and extrcts the tables from the PDF. This is usefull when we need tables of the reserch papers or some thing like long table. This program uses the pytorch framework (made by facebook) and tesseract for parsing the table data.

This is an Intital stage of the project. It have some bugs hopefully later release will fix this issue. 😊
