Metadata-Version: 2.1
Name: scandi-reddit
Version: 0.1.0
Summary: Construction of a Scandinavian Reddit dataset.
License: MIT
Author: Dan Saattrup Nielsen
Author-email: dan.nielsen@alexandra.dk
Requires-Python: >=3.8,<3.11
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Requires-Dist: datasets (>=2.7.1,<3.0.0)
Requires-Dist: datasketch (>=1.5.8,<2.0.0)
Requires-Dist: luga (>=0.2.6,<0.3.0)
Requires-Dist: more-itertools (>=9.0.0,<10.0.0)
Requires-Dist: nlp-dedup (>=0.1.0,<0.2.0)
Requires-Dist: pandas (>=1.5.2,<2.0.0)
Requires-Dist: scikit-learn (>=1.2.0,<2.0.0)
Requires-Dist: wget (>=3.2,<4.0)
Requires-Dist: zstandard (>=0.19.0,<0.20.0)
Description-Content-Type: text/markdown

# ScandiReddit

Construction of a Scandinavian Reddit dataset.

______________________________________________________________________
[![Documentation](https://img.shields.io/badge/docs-passing-green)](https://alexandrainst.github.io/ScandiReddit/scandi_reddit.html)
[![License](https://img.shields.io/github/license/alexandrainst/ScandiReddit)](https://github.com/alexandrainst/ScandiReddit/blob/main/LICENSE)
[![LastCommit](https://img.shields.io/github/last-commit/alexandrainst/ScandiReddit)](https://github.com/alexandrainst/ScandiReddit/commits/main)
[![Code Coverage](https://img.shields.io/badge/Coverage-0%25-red.svg)](https://github.com/alexandrainst/ScandiReddit/tree/main/tests)


Developers:

- Dan Saattrup Nielsen (dan.nielsen@alexandra.dk)


# Installation

The package is available on PyPI, so you can install the package using your favourite
package manager. For instance, `pip install scandi_reddit` or `poetry add
scandi_reddit`.


# Quick start

Simply run the command `build` in the terminal to build the dataset. This takes quite a
while! See `$ build --help` for more information on all the settings.

