Metadata-Version: 2.1
Name: tasmanian-mismatch
Version: 0.1.3
Summary: Tasmanian tool to analyze mismatches at read and position in high throughput sequencing data
Home-page: https://github.com/nebiolabs/tasmanian-mismatch
Author: Ariel Erijman and Brad Langhorst
Author-email: aerijman@neb.com
License: GNU
Description: <!--[![Build Status](https://travis-ci.org/fulcrumgenomics/fgbio.svg?branch=master)](https://travis-ci.org/fulcrumgenomics/fgbio)-->
        [![CircleCI](https://circleci.com/gh/nebiolabs/tasmanian/tree/master.svg?style=svg)](https://circleci.com/gh/nebiolabs/tasmanian/tree/master)
        [![Bioconda](https://img.shields.io/conda/dn/bioconda/fgbio.svg?label=Bioconda)](http://bioconda.github.io/recipes/fgbio/README.html)
        [![License](http://img.shields.io/badge/license-MIT-blue.svg)](https://github.com/nebiolabs/tasmanian/blob/master/LICENSE)
        [![Language](https://img.shields.io/badge/Made%20with-Python-1f425f.svg)](https://www.python.org)
        
        ![Image of Tasmanian Devil](https://upload.wikimedia.org/wikipedia/commons/thumb/4/43/Sarcophilus_harrisii_taranna.jpg/512px-Sarcophilus_harrisii_taranna.jpg)
        
        Tasmanian
        ====
            
        A tool for the analysis of reference mismatches in high throughput sequencing data from DNA samples. Unlike other tools, it is able to evalutate the portions of reads that overlap with specified regions (e.g. Repeats)
        
        
        <!---toc start-->
          * [Goals](#goals)
          * [Contributing](#contributing)
          * [Authors](#authors)
          * [License](#license)
        
        <!---toc end-->
        
        # Goals
        
        The main goal is to identify systematic missmatches that might confound SNPs or other variations that should or should not be associated to biological outcomes. Since we noticed a set of regions, which might not necessarily be missplaced in the reference genome, have dramatic effects in this analysis, we provide a way of spliting these reads and incorporate the information in different tables, so that intersecting/non intersecting reads are not filtered out. Also, the researcher has a more accurate picture of the influence of these regions in the observed artifacts.
        
        ### Overview of Tasmanian use:
        
        ```bash
        samtools view bam | run_intersections [OPTIONS] | run_tasmanian [OPTIONS]
        ```
        
        1. Classification of each base of the read into overlapping (in which case could be contained or boundary - see figure below) or Non-overlapping with regions of interest included in a bed/bedgraph file.
        2. Positional analysis of artifacts splitted by read 1 and read 2.
        
        <img src="https://github.com/nebiolabs/tasmanian/raw/master/figures/intersections_tasmanian.jpg" alt="drawing" width="600"/>
        
        ---
        
        ### The output includes tables to manupulate and plot the data and a built in report for fast access the data (see figure below).
        
        <img src="https://github.com/nebiolabs/tasmanian/raw/master/figures/snapshot_good.jpg" alt="drawing" width="800"/>
        <!--![snapshot of results html file](./figures/snapshot.jpg =100x20)-->
        
        
        
        * Easy to use command-line and [nextflow](https://www.nextflow.io/) implementation.
        * Includes a [Galaxy](https://usegalaxy.org/) wrapper 
        
        
        ## Contributing
        
        Contributions are welcome and encouraged.
        
        ## Authors
        
        * [Brad Langhorst](https://github.com/bwlang)
        * [Ariel Erijman](https://github.com/aerijman)
        
        ## License
        
        `tasmanian artifact metrics tool` is open source software released under the [GNU License](https://github.com/nebiolabs/tasmanian/blob/master/LICENCE.txt).
        
        
Platform: UNKNOWN
Requires-Python: >=3.6
Description-Content-Type: text/markdown
