# -*- coding: utf-8 -*-
#
# Author: Daniel Yang <daniel.yj.yang@gmail.com>
#
# License: BSD 3 clause


Disclaimer:

- The datasets are shared with the sole intention to provide the convenience of accessing publicly available datasets and reproducing/comparing results.
- They are shared under a good-faith understanding that they are widely viewed and accepted as public-domain datasets.
- If there is any misunderstanding, please contact the author.
- The author does not own any of these datasets.
- The readme in respective folder (or related Internet link) should be followed for citation/license requirements.

Datasets:

1. SMS_Spam_Collection

URL: https://archive.ics.uci.edu/ml/datasets/SMS+Spam+Collection
Download Date: 2020-09-23
Dataset description: "This corpus has been collected from free or free for research sources at the Internet"

2. Social_Network_Ads

URL: https://www.kaggle.com/akram24/social-network-ads/activity
Download Date: 2020-09-23
License: CC0: Public Domain

3. NLTK (Natural Language Toolkit) data

Download Date: 2020-09-25
Download commands:
    nltk.download('punkt', quiet=True)
    nltk.download('wordnet', quiet=True)
The original zip files (punkt.zip and wordnet.zip) were removed.

Here is the rule regarding redistributing of nltk data (see https://github.com/nltk/nltk):
- NLTK source code is distributed under the Apache 2.0 License.
- NLTK documentation is distributed under the Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States license.
- NLTK corpora are provided under the terms given in the README file for each corpus; all are redistributable and available for non-commercial use.
- NLTK may be freely redistributed, subject to the provisions of these licenses.

4. Scikit-Learn data

Download Date: 2020-09-25
Download commands:
    fetch_20newsgroups(subset='all')

5. Fashion MNIST

URL: https://github.com/zalandoresearch/fashion-mnist
Download Date: 2020-10-01
License: MIT

6. bank_note_authentication (more precisely, counterfeit)

URL: https://archive.ics.uci.edu/ml/datasets/banknote+authentication
Download Date: 2020-10-27

7. R_ISLR

URL: https://cran.r-project.org/web/packages/ISLR/index.html
Download Date: 2020-10-28
License: GPL-2

8. R_datarium

URL: https://cran.r-project.org/web/packages/datarium/index.html
Download Date: 2020-10-28
License: GPL-2

9. ML_for_Hackers

URL: https://github.com/johnmyleswhite/ML_for_Hackers
Download Date: 2020-11-02
License: FreeBSD
