Metadata-Version: 2.1
Name: toolwiki
Version: 0.1.1
Summary: A python library to extract information from Wikipedia pages.
Home-page: https://github.com/santoshbs/toolwiki
Author: Santosh Srinivas
Author-email: santosh.b.srinivas@outlook.com
License: MIT
Keywords: python,html table rowspan colspan,scrape parse extract,dataframe
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Description-Content-Type: text/markdown
License-File: LICENSE

# Extract Information From Wikipedia Pages 

This is a python library to extract information from Wikipedia pages. 

## Contents

- `tables.py` includes function to extract information from a wikipedia table as a pandas dataframe.

## Installation

You can install the Real Python Feed Reader from [PyPI](https://pypi.org/project/toolwiki/):

    python -m pip install toolwiki

The reader is supported on Python 3.7 and above.

## How to use

You can use this package in your own Python code by importing from the `toolwiki` package:

    >>> from toolwiki import tables
    >>> dfs= tables.get_dataframes(url='https://en.wikipedia.org/wiki/List_of_UFC_events', by_class='wikitable', raw=False)
    [                                         Event  ...  Ref.
    1                                      UFC 276  ...   [9]
    2                          UFC Fight Night 211  ...  [10]
    ..   ...                                         ...  ...        ...    ...
    14                         UFC Fight Night 202  ...  [20]
    15            UFC Fight Night: Walker vs. Hill  ...  [21]
    [15 rows x 5 columns],        #                                       Event  ... Attendance   Ref.
    1    593          UFC 271: Adesanya vs. Whittaker II  ...     17,872   [22]
    2    592  UFC Fight Night: Hermansson vs. Strickland  ...        N/A   [23]
    ..   ...                                         ...  ...        ...    ...
    601  002                           UFC 2: No Way Out  ...      2,000  [558]
    602  001                        UFC 1: The Beginning  ...      7,800  [559]
    [602 rows x 7 columns]]
    


