Metadata-Version: 2.1
Name: eda-and-clean
Version: 0.0.1
Summary: A package of automation tools for EDA and cleaning
Home-page: https://github.com/aravindganesan88/eda_and_clean
Author: Aravind Ganesan
Author-email: 1988.aravind@gmail.com
License: UNKNOWN
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.7
Description-Content-Type: text/markdown
License-File: LICENCE.txt

--
## Project Description

The project provides a collection of automation tools for commonly used EDA and cleaning techniques. Instead of reinventing the wheel, the project also attempts to bring in other similar open source automation tools. For example, the library imports from klib and eda_and_beyond.

The package consists of three modules, 

1. EDA - Contains a class which provides automation for detecting datat types, listing null values, identifying redundant entries, basic data analysis among others

2. clean - Contains a class which provides options for the user to perform cleaning tasks such as cleaning string columns, downcasting data types, assigning proper data types, making certain entries as na as necessary, dropping duplicates. More importantly it also tracks all the changes made in a easy to read format and the rows and columns it impacted so that the user can easily keep track of changes. 

3. chart - Contains functions for plotting purposes with embedded formatting.

This is an open-source project. Contributors are welcome. :) 

