Metadata-Version: 2.1
Name: data-algebra
Version: 1.6.3
Summary: data_algebra is a data manipulation language that can both generate SQL queries and work on Pandas DataFrames.
Home-page: https://github.com/WinVector/data_algebra
Author: John Mount
Author-email: jmount@win-vector.com
License: License :: OSI Approved :: BSD 3-clause License
Platform: any
Classifier: Intended Audience :: Science/Research
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: License :: OSI Approved :: BSD License
Requires-Python: >=3.7
Description-Content-Type: text/markdown
Provides-Extra: pretty_python
Provides-Extra: BigQuery
Provides-Extra: PostgreSQL
Provides-Extra: MySQL
Provides-Extra: Spark
Provides-Extra: Polars
Provides-Extra: all
License-File: LICENSE


[data_algebra](https://github.com/WinVector/data_algebra) is a piped data wrangling system
based on Codd's relational algebra and experience working with data manipulation languages at scale.  
The primary purpose of the package is to support an easy to 
compose and maintain grammar of data processing steps that in turn can be used to generate
database specific SQL. The package also implements the same transforms for Pandas and Polars DataFrames.

Currently the system is primarily adapted and testing for Pandas, Polars, Google BigQuery, PostgreSQL, SQLite, Spark, and
MySQL.

[R](https://www.r-project.org) versions of the system are available as 
the [rquery](https://github.com/WinVector/rquery) and 
[rqdatatable](https://github.com/WinVector/rqdatatable) packages.
