Metadata-Version: 2.1
Name: skyplane-dev
Version: 0.3.0
Summary: Skyplane efficiently transports data between cloud regions and providers.
Home-page: https://skyplane.org/
License: Apache-2.0
Author: Skyplane authors
Author-email: skyplaneproject@gmail.com
Requires-Python: >=3.7.1,<3.12
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Provides-Extra: all
Provides-Extra: aws
Provides-Extra: azure
Provides-Extra: gateway
Provides-Extra: gcp
Provides-Extra: solver
Requires-Dist: azure-identity (>=1.0.0) ; extra == "azure" or extra == "all"
Requires-Dist: azure-mgmt-authorization (>=1.0.0) ; extra == "azure" or extra == "all"
Requires-Dist: azure-mgmt-compute (>=24.0.0) ; extra == "azure" or extra == "all"
Requires-Dist: azure-mgmt-network (>=10.0.0) ; extra == "azure" or extra == "all"
Requires-Dist: azure-mgmt-resource (>=11.0.0) ; extra == "azure" or extra == "all"
Requires-Dist: azure-mgmt-storage (>=11.0.0) ; extra == "azure" or extra == "all"
Requires-Dist: azure-mgmt-subscription (>=1.0.0) ; extra == "azure" or extra == "all"
Requires-Dist: azure-storage-blob (>=12.0.0) ; extra == "azure" or extra == "all"
Requires-Dist: boto3 (>=1.16.0) ; extra == "aws" or extra == "all"
Requires-Dist: cachetools (>=4.1.0)
Requires-Dist: cryptography (>=1.4.0)
Requires-Dist: cvxpy[cvxopt] (>=1.1.0) ; extra == "solver"
Requires-Dist: flask (>=2.1.2,<3.0.0) ; extra == "gateway"
Requires-Dist: google-api-python-client (>=2.0.2) ; extra == "gcp" or extra == "all"
Requires-Dist: google-auth (>=2.0.0) ; extra == "gcp" or extra == "all"
Requires-Dist: google-cloud-compute (>=1.0.0) ; extra == "gcp" or extra == "all"
Requires-Dist: google-cloud-dataproc (>=5.4.0,<6.0.0)
Requires-Dist: google-cloud-storage (>=1.30.0) ; extra == "gcp" or extra == "all"
Requires-Dist: graphviz (>=0.15) ; extra == "solver"
Requires-Dist: lz4 (>=4.0.0,<5.0.0) ; extra == "gateway"
Requires-Dist: matplotlib (>=3.0.0) ; extra == "solver"
Requires-Dist: numpy (>=1.19.0) ; extra == "solver"
Requires-Dist: pandas (>=1.0.0)
Requires-Dist: paramiko (>=2.7.2)
Requires-Dist: pyarrow (>=10.0.1,<11.0.0)
Requires-Dist: pynacl (>=1.5.0,<2.0.0) ; extra == "gateway"
Requires-Dist: pyopenssl (>=22.0.0,<23.0.0) ; extra == "gateway"
Requires-Dist: questionary (>=1.8.0)
Requires-Dist: requests (>=2.23.0)
Requires-Dist: rich (>=9.0.0)
Requires-Dist: sshtunnel (>=0.3.0)
Requires-Dist: typer (>=0.4.0)
Requires-Dist: werkzeug (>=2.1.2,<3.0.0) ; extra == "gateway"
Project-URL: Documentation, https://skyplane.org/
Project-URL: Repository, https://github.com/skyplane-project/skyplane
Description-Content-Type: text/markdown

<picture>
    <source srcset="docs/_static/logo-dark-mode.png" media="(prefers-color-scheme: dark)">
    <img src="docs/_static/logo-light-mode.png" width="300" />
</picture>

[![Join Slack](https://img.shields.io/badge/-Join%20Skyplane%20Slack-blue?logo=slack)](https://join.slack.com/t/skyplaneworkspace/shared_invite/zt-1cxmedcuc-GwIXLGyHTyOYELq7KoOl6Q)
[![integration-test](https://github.com/skyplane-project/skyplane/actions/workflows/integration-test.yml/badge.svg)](https://github.com/skyplane-project/skyplane/actions/workflows/integration-test.yml)
[![docker](https://github.com/skyplane-project/skyplane/actions/workflows/docker-publish.yml/badge.svg)](https://github.com/skyplane-project/skyplane/actions/workflows/docker-publish.yml)
[![docs](https://readthedocs.org/projects/skyplane/badge/?version=latest)](https://skyplane.readthedocs.io/en/latest/?badge=latest)

**🔥 Blazing fast bulk data transfers between any cloud 🔥**

Skyplane is a tool for blazingly fast bulk data transfers between object stores in the cloud. It provisions a fleet of VMs in the cloud to transfer data in parallel while using compression and bandwidth tiering to reduce cost.

Skyplane is:
1. 🔥 Blazing fast ([110x faster than AWS DataSync](https://skyplane.org/en/latest/benchmark.html))
2. 🤑 Cheap (4x cheaper than rsync)
3. 🌐 Universal (AWS, Azure and GCP)

You can use Skyplane to transfer data: 
* between object stores within a cloud provider (e.g. AWS us-east-1 to AWS us-west-2)
* between object stores across multiple cloud providers (e.g. AWS us-east-1 to GCP us-central1)
* between local storage and cloud object stores (experimental)

Skyplane currently supports the following source and destination endpoints (any source and destination can be combined): 

| Endpoint           | Source             | Destination        |
|--------------------|--------------------|--------------------|
| AWS S3             | :white_check_mark: | :white_check_mark: |
| Google Storage     | :white_check_mark: | :white_check_mark: |
| Azure Blob Storage | :white_check_mark: | :white_check_mark: |
| Local Disk         | :white_check_mark: | (in progress)      |

Skyplane is an actively developed project. It will have 🔪 SHARP EDGES 🔪. Please file an issue or ask the contributors via [the #help channel on our Slack](https://join.slack.com/t/skyplaneworkspace/shared_invite/zt-1cxmedcuc-GwIXLGyHTyOYELq7KoOl6Q) if you encounter bugs.

# Resources 
- [Quickstart](#quickstart)
- [Contributing](https://skyplane.org/en/latest/contributing.html)
- [Roadmap](https://skyplane.org/en/latest/roadmap.html)
- [Slack Community](https://join.slack.com/t/skyplaneworkspace/shared_invite/zt-1cxmedcuc-GwIXLGyHTyOYELq7KoOl6Q)

# Quickstart

## 1. Installation
We recommend installation from PyPi:
```
$ pip install "skyplane[aws]"

# install support for other clouds as needed:
#   $ pip install "skyplane[azure]"
#   $ pip install "skyplane[gcp]"
#   $ pip install "skyplane[all]"
```

Skyplane supports AWS, Azure, and GCP. You can install Skyplane with support for one or more of these clouds by specifying the corresponding extras. To install two out of three clouds, you can run `pip install "skyplane[aws,azure]"`.

*GCP support on the M1 Mac*: If you are using an M1 Mac with the arm64 architecture and want to install GCP support for Skyplane, you will need to install as follows
`GRPC_PYTHON_BUILD_SYSTEM_OPENSSL=1 GRPC_PYTHON_BUILD_SYSTEM_ZLIB=1 pip install "skyplane[aws,gcp]"`

## 2. Setup Cloud Credentials 

Skyplane needs access to cloud credentials to perform transfers. To get started with setting up credentials, make sure you have cloud provider CLI tools installed:

```
---> For AWS:
$ pip install awscli

---> For Google Cloud:
$ pip install gcloud

---> For Azure:
$ pip install azure
```
Once you have the CLI tools setup, log into each cloud provider's CLI:
```
---> For AWS:
$ aws configure

---> For Google Cloud:
$ gcloud auth application-default login

---> For Azure:
$ az login
```
After authenticating with each cloud provider, you can run `skyplane init` to create a configuration file for Skyplane.

```bash
$ skyplane init
```
<details>
<summary>skyplane init output</summary>
<br>

```
$ skyplane init

====================================================
 _____ _   ____   _______ _       ___   _   _  _____
/  ___| | / /\ \ / / ___ \ |     / _ \ | \ | ||  ___|
\ `--.| |/ /  \ V /| |_/ / |    / /_\ \|  \| || |__
 `--. \    \   \ / |  __/| |    |  _  || . ` ||  __|
/\__/ / |\  \  | | | |   | |____| | | || |\  || |___
\____/\_| \_/  \_/ \_|   \_____/\_| |_/\_| \_/\____/
====================================================


(1) Configuring AWS:
    Loaded AWS credentials from the AWS CLI [IAM access key ID: ...XXXXXX]
    AWS region config file saved to /home/ubuntu/.skyplane/aws_config

(2) Configuring Azure:
    Azure credentials found in Azure CLI
    Azure credentials found, do you want to enable Azure support in Skyplane? [Y/n]: Y
    Enter the Azure subscription ID: [XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX]:
    Azure region config file saved to /home/ubuntu/.skyplane/azure_config
    Querying for SKU availbility in regions
    Azure SKU availability cached in /home/ubuntu/.skyplane/azure_sku_mapping

(3) Configuring GCP:
    GCP credentials found in GCP CLI
    GCP credentials found, do you want to enable GCP support in Skyplane? [Y/n]: Y
    Enter the GCP project ID [XXXXXXX]:
    GCP region config file saved to /home/ubuntu/.skyplane/gcp_config

Config file saved to /home/ubuntu/.skyplane/config
```

</details>

## 3. Run Transfers 

We’re ready to use Skyplane! Let’s use `skyplane cp` to copy files from AWS to GCP:
```
skyplane cp s3://... gs://...
```
To transfer only new objects, you can instead use `skyplane sync`:
```
$ skyplane sync s3://... gs://...
```

You can configure Skyplane to use more VMs per region with the `-n` flag. For example, to double the transfer speed with two VMs, run: 
```
$ skyplane cp -r s3://... s3://... -n 2
```

## 4. Clean Up 
Skyplane will automatically attempt to terminate VMs that it starts, but to double check and forcefuly terminate all VMs, run `skyplane deprovision`.

# Technical Details
Skyplane is based on research at UC Berkeley into accelerated networks between cloud providers. Under the hood, Skyplane starts a fleet of VMs in the source and destination regions. It then uses a custom TCP protocol to accelerate the transfer between the VMs. Skyplane may use a L7 overlay network to route traffic around congested network hot spots. 

<img src="docs/_static/skyplane-data-plane.png" width="384" />

For more details on Skyplane, see: 
- [Technical Talk](https://skyplane.org/en/latest/architecture.html)
- [NSDI '23 Paper](https://arxiv.org/abs/2210.07259)



