Metadata-Version: 2.1
Name: sciviewer
Version: 0.11
Summary: Tool for interactive exploration of 2D embeddings in Jupyter
Home-page: https://github.com/colabobio/sciviewer
Author: Andres Colubri, Dylan Kotliar
Author-email: andres@broadinstitute.org, dylan_kotliar@hms.harvard.edu
License: MIT
Description: # Single Cell Interactive Viewer (sciviewer)
        
        This is an interactive viewer for 2D embeddings such as [UMAP](https://umap-learn.readthedocs.io/en/latest/) or [tSNE](https://lvdmaaten.github.io/tsne/) of high dimensional single-cell RNA-seq data that is run directly out of the [Jupyter Notebook](https://jupyter.org/) environment. The user can select cells in the 2D plane and the viewer will calculate the differential expression between the selected and the unselected cells. Alternatively, the user can select a group of cells and a direction and the viewer will identify the genes with the greatest variation (Pearson correlation) along that direction. See snapshots of how this works below as well as the examples included in this repository (a small example of [3000 PBMCs](./examples/sciviewer_example_3K_PBMC.ipynb) and a larger one of [50,000 circulating T-cells](./examples/sciviewer_example_50K_Tcell.ipynb)).
        
        ## Installation
        
        The main requirement for sciviewer is [py5](http://py5.ixora.io/install/) which in turn requires Python 3.8. We recommend using the conda package manager to install the necessary dependencies fo sciviewer. Conda can be installed following the instructions [here](https://docs.conda.io/en/latest/miniconda.html). Then follow the steps below to install sciviewer.
        
        1. Prepare and activate the conda environment containing dependencies for py5:
        
        ```
        conda env create -n sciviewer -f https://raw.githubusercontent.com/colabobio/sciviewer/master/sciviewer-env.yml
        conda activate sciviewer
        ```
        Alternatively, if you want to append the needed dependencies to an existing conda environment, instead of creating a new one, you can do the following:
        
        ```
        conda env update -n your_existing_environment -f https://raw.githubusercontent.com/colabobio/sciviewer/master/sciviewer-env.yml
        conda activate your_existing_environment
        ```
        
        2. Next, install the sciviewer package using pip:
        ```
        pip install sciviewer
        ```
        
        And that is it, the module is now installed and ready to be used.
        
        To uninstall, use:
        ```
        pip uninstall sciviewer
        ```
        
        3. Now launch jupyter from within the activated conda environment and you are good to go.
        
        ```
        jupyter lab
        ```
        
        ## Quick start
        
        Sciviewer is executed from a Jupyter notebook such as in the examples directory. It is run by initializing a SCIViewer object with the 2D embedding (# cells X 2) and the expression data (# cells X # genes) and then running the explore_data method. E.g.
        
        ```
        from sciviewer import SCIViewer
        svobj = SCIViewer(umap, expr)
        svobj.explore_data()
        ```
        
        Running the code above will cause the visualizer to appear. Click the video link below for a ~3 minute tutorial on how to use the visualizer:
        
        [![Watch the video](https://img.youtube.com/vi/YgvMmvgFFE0/maxresdefault.jpg)](https://youtu.be/YgvMmvgFFE0)
        
        ## Some key usage points:
         - The expression data can be provided as a Pandas DataFrame, a Numpy ndarray, or as a scipy sparse [csc_matrix](https://docs.scipy.org/doc/scipy/reference/generated/scipy.sparse.csr_matrix.html).
         - Providing the data as a csc_matrix is recommended for large datasets as it can lead to a considerable (1-2 order or magnitude) performance speedup. See [this notebook](./examples/sciviewer_example_50K_Tcell.ipynb) as an example.
         - If the expression data is provided as a Pandas DataFrame, the cell names are inferred from the index and the gene names are inferred from the columns. Otherwise, the gene names and cell names can be provided when initializing the SCIViewer class with the gene_names and cell_names arguments.
         - The selected_cells attribute of the sciviewer object is updated whenever a new set of cells are selected, regardless of the mode, and contains information about the selected cells. The results_proj_correlation attribute of the sciviewer object is updated whenever a new selection is made in the "directional" mode and contains the Pearson correlation and P-values of all genes for the selected direction and cells. The results_diffexpr attribute is updated when a new selection is made in the "differential" mode and contains the T-statistic and P-value for the differential expression test (simple Welch's T-test). These are updated in real time as the visualizer is in use.
        
        See the example notebooks for more details
        
        ## Development / debugging
        
        For development purposes, it can be helpful to import sciviewer directly rather than installing the package. See the extras/debugging directory for notebooks with examples of how to do this e.g. [debug_example_3K_PBMC.ipynb](extras/debugging/debug_example_3K_PBMC.ipynb).
        
        
Platform: UNKNOWN
Description-Content-Type: text/markdown
