Metadata-Version: 2.1
Name: datalake-bundle
Version: 0.5.0a11
Summary: DataLake tables management bundle for the Bricksflow Framework
Home-page: https://github.com/bricksflow/datalake-bundle
License: MIT
Author: Jiri Koutny
Author-email: jiri.koutny@datasentics.com
Requires-Python: >=3.7,<4.0
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Requires-Dist: console-bundle (>=0.3.1,<0.4.0)
Requires-Dist: databricks-bundle (>=0.7.0a1)
Requires-Dist: injecta (>=0.9.1,<0.10.0)
Requires-Dist: pyfony-bundles (>=0.3.2,<0.4.0)
Requires-Dist: simpleeval (>=0.9.10,<1.0.0)
Project-URL: Repository, https://github.com/bricksflow/datalake-bundle
Description-Content-Type: text/markdown

# Datalake bundle

![alt text](./docs/notebook-functions.png)

This bundle provides everything you need to create and manage a Databricks-based DataLake(House):

* Tools to simplify & automate table creation, updates and migrations.
* Explicit table schema enforcing for Hive tables, CSVs, ...
* Decorators to write well-maintainable and self-documented function-based notebooks
* Rich configuration options to customize naming standards, paths, and basically anything to match your needs

## Installation

Install the bundle via Poetry:

```
$ poetry add datalake-bundle
```

## Usage

1. [Recommended notebooks structure](docs/structure.md)
1. [Defining DataLake tables](docs/tables.md)
1. [Using datalake-specific notebook functions](docs/notebook-functions.md)
1. [Using table-specific configuration](docs/configuration.md)
1. [Tables management](docs/tables-management.md)
1. [Parsing fields from table identifier](docs/parsing-fields.md)

