Metadata-Version: 2.1
Name: oagdedupe
Version: 0.2.0
Summary: oagdedupe is a Python library for scalable entity resolution, using active learning to learn blocking configurations, generate comparison pairs, then clasify matches.
Home-page: https://github.com/chansooligans/oagdedupe
Keywords: dedupe,entity resolution,record linkage,blocking
Author: Chansoo Song
Requires-Python: >=3.8,<3.11
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Provides-Extra: book
Requires-Dist: Faker (>=13.15.1,<14.0.0)
Requires-Dist: SQLAlchemy (>=1.4.39,<2.0.0)
Requires-Dist: Sphinx (>=5.1.1,<6.0.0,!=5.2.0.post0)
Requires-Dist: autodocsumm (>=0.2.9,<0.3.0); extra == "book"
Requires-Dist: diagrams (>=0.21.1,<0.22.0)
Requires-Dist: fastapi[all] (>=0.79.0,<0.80.0)
Requires-Dist: flake8 (>=4.0.1,<5.0.0)
Requires-Dist: graphviz (>=0.19.0,<0.20.0)
Requires-Dist: ipykernel (>=6.13.0,<7.0.0)
Requires-Dist: jellyfish (>=0.9.0,<0.10.0)
Requires-Dist: jupytext (>=1.14.1,<2.0.0); extra == "book"
Requires-Dist: matplotlib (>=3.5.1,<4.0.0)
Requires-Dist: modAL (>=0.4.1,<0.5.0)
Requires-Dist: myst-parser (>=0.18.0,<0.19.0)
Requires-Dist: nbconvert (>=6.5.1,<7.0.0)
Requires-Dist: networkx (>=2.8,<3.0)
Requires-Dist: numpy (>=1.22.1,<2.0.0)
Requires-Dist: pandas (>=1.4.2,<2.0.0)
Requires-Dist: pathos (>=0.2.9,<0.3.0)
Requires-Dist: protobuf (>=3.20.2,<4.0.0)
Requires-Dist: psycopg2-binary (>=2.9.3,<3.0.0)
Requires-Dist: pydantic (>=1.10.2,<2.0.0)
Requires-Dist: pytest (>=7.1.2,<8.0.0)
Requires-Dist: ray (>=1.13.0,<2.0.0)
Requires-Dist: scikit-learn (>=1.0.2,<2.0.0)
Requires-Dist: seaborn (>=0.11.2,<0.12.0)
Requires-Dist: sphinx-rtd-theme (>=1.0.0,<2.0.0)
Requires-Dist: streamlit (>=1.11.1,<2.0.0)
Requires-Dist: streamlit-aggrid (>=0.2.3,<0.3.0)
Requires-Dist: tqdm (>=4.58.0,<5.0.0)
Project-URL: Documentation, https://deduper.readthedocs.io/en/latest/
Project-URL: Repository, https://github.com/chansooligans/oagdedupe
Description-Content-Type: text/markdown

# oagdedupe  

oagdedupe is a Python library for scalable entity resolution, using active 
learning to learn blocking configurations, generate comparison pairs, 
then clasify matches. 

# installation<a name="#installation"></a>

```
# PyPI
pip install oagdedupe
```

# documentation<a name="#documentation"></a>

You can find the documentation of oagdedupe at https://deduper.readthedocs.io/en/latest/, 
where you can find: 

- [Installation and Getting Started](https://deduper.readthedocs.io/en/latest/usage/installation.html)
- [Examples](https://deduper.readthedocs.io/en/latest/examples/example_dedupe.html)
- the [API reference](https://deduper.readthedocs.io/en/latest/dedupe/api.html)
- [User Guide / Methodology](https://deduper.readthedocs.io/en/latest/userguide/intro.html)


