Metadata-Version: 2.1
Name: cmtt
Version: 0.1.0
Summary: A library for processing Code Mixed Text. Still in development!
Home-page: https://cmtt.readthedocs.io/
Author: Reuben Devanesan
Author-email: reubendevanesan@gmail.com
License: MIT
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Operating System :: OS Independent
Description-Content-Type: text/markdown

[![forthebadge made-with-python](http://ForTheBadge.com/images/badges/made-with-python.svg)](https://www.python.org/)



<div align = center>

<img width="600px" height="180px" src= "https://user-images.githubusercontent.com/76529011/185376373-787f65d5-b78b-4f11-a7fb-e9aa19dc3a04.png">

</div>



-----------------------------------------

[![code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)![Compatibility](https://img.shields.io/badge/compatible%20with-python3.9.x-blue.svg)



CMTT is a wrapper library that makes code-mixed text processing more efficient than ever. More documentation incoming!



### Installation

```

pip install cmtt

```



### Get started

How to use this library:



```Python

from cmtt.data import *



# Loading json files

result_json = load_url('https://world.openfoodfacts.org/api/v0/product/5060292302201.json')



# Loading csv files

result_csv = load_url('https://gist.githubusercontent.com/rnirmal/e01acfdaf54a6f9b24e91ba4cae63518/raw/b589a5c5a851711e20c5eb28f9d54742d1fe2dc/datasets.csv')



# List the key properties available for the datasets provided by the cmtt library

keys = list_dataset_keys()



# List all datasets provided by cmtt

# Specifying the 'key' property allows to return the dataset names with the respective 'key' value

# Specifying the 'key' as 'all' returns all the information pertaining to all the datasets

data = list_cmtt_datasets()



# Download multiple datasets provided by cmtt, returning a list of paths where the datasets get downloaded

# The Datasets are downloaded into a new 'cmtt' directory inside the root directory of the operating system

lst = download_cmtt_datasets(["linc_ner_hineng", "L3Cube_HingLID_all", "linc_lid_spaeng"])



# Download a dataset from a url, returning the path where the dataset gets downloaded

# The Dataset is downloaded into a new directory 'datasets' inside the current working directory

path = download_dataset_url('https://world.openfoodfacts.org/api/v0/product/5060292302201.json')

```



### Contributors

 - [Paras Gupta](https://github.com/paras-gupt)

 - [Tarun Sharma](https://github.com/tarun2001sharma)

 - [Reuben Devanesan](https://github.com/Reuben27)
