Metadata-Version: 2.1
Name: text_extensions_for_pandas
Version: 0.2.0
Summary: Natural language processing support for Pandas dataframes.
Home-page: https://github.com/CODAIT/text-extensions-for-pandas
Author: IBM
Author-email: frreiss@us.ibm.com
License: UNKNOWN
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Operating System :: OS Independent
Classifier: Topic :: Scientific/Engineering
Requires-Python: >=3.6
Description-Content-Type: text/markdown
License-File: LICENSE.txt

<!-- Package description for PyPI -->

Natural language processing support for Pandas DataFrames.

[![Documentation Status](https://readthedocs.org/projects/text-extensions-for-pandas/badge/?version=latest)](https://text-extensions-for-pandas.readthedocs.io/en/latest/?badge=latest)


Text Extensions for Pandas adds [extension types](https://pandas.pydata.org/docs/development/extending.html) to Pandas DataFrames for representing natural
language data, plus a library of functions for working with these extension
types.

## Features

### SpanArray: A Pandas extension type for *spans* of text

* Connect features with regions of a document
* Visualize the internal data of your NLP application
* Analyze the accuracy of your models
* Combine the results of multiple models

### TensorArray: A Pandas extension type for tensors

* Represent BERT embeddings in a Pandas series
* Store logits and other feature vectors in a Pandas series
* Store an entire time series in each cell of a Pandas series

### Pandas front-ends for popular NLP toolkits

* [SpaCy](https://spacy.io/)
* [Transformers](https://github.com/huggingface/transformers)
* [IBM Watson Natural Language Understanding](https://www.ibm.com/cloud/watson-natural-language-understanding)
* [IBM Watson Discovry Table Understanding](https://cloud.ibm.com/docs/discovery-data?topic=discovery-data-understanding_tables)

## Documentation

For examples of how to use the library, take a look at the notebooks in 
[this directory](https://github.com/CODAIT/text-extensions-for-pandas/tree/master/notebooks).

API documentation can be found at [https://text-extensions-for-pandas.readthedocs.io/en/latest/](https://text-extensions-for-pandas.readthedocs.io/en/latest/)

## Source Code

The source code for Text Extensions for Pandas is available at [https://github.com/CODAIT/text-extensions-for-pandas](https://github.com/CODAIT/text-extensions-for-pandas).

We welcome code and documentation contributions!  See the [README file](https://github.com/CODAIT/text-extensions-for-pandas/blob/master/README.md#contributing) 
for more information on contributing.








