dandelion.tools.find_clones

dandelion.tools.find_clones(self, identity=0.85, clustering_by=None, by_alleles=None, key_added=None, calculate_junction_length=True)[source]

Find clones based on heavy chain and light chain CDR3 junction hamming distance.

Parameters
  • self (Dandelion, DataFrame, str) – Dandelion object, pandas DataFrame in changeo/airr format, or file path to changeo/airr file after clones have been determined.

  • identity (float) – Junction similarity parameter. Default 0.85

  • clustering_by (str, optional) – modes for clustering: ‘nt’ or ‘aa’. None defaults to ‘aa’.

  • by_alleles (bool, optional) – Whether or not to collapse alleles to genes. None defaults to True.

  • key_added (str, optional) – If specified, this will be the column name for clones. None defaults to ‘clone_id’

  • calculate_junction_length (bool) – Whether or not to re-calculate junction length, rather than rely on parsed assignment (which occasionally is wrong). Default is True

Returns

Return type

Dandelion object with clone_id annotated in .data slot and .metadata initialized.