cluster¶
A cluster is an ordered set of hits related to a model which satisfy the model distance constraints.
cluster API reference¶
cluster¶
-
class
macsypy.cluster.Cluster(hits, model, hit_weights)[source]¶ Handle hits relative to a model which collocates
-
__contains__(v_hit)[source]¶ - Parameters
v_hit (
macsypy.hit.ModelHitobject) – The hit to test- Returns
True if the hit is in the cluster hits, False otherwise
-
__init__(hits, model, hit_weights)[source]¶ - Parameters
hits ([
macsypy.hit.CoreHit|macsypy.hit.ModelHit, … ]) – the hits constituting this clustermodel (
macsypy.model.Model) – the model associated to this cluster
-
__weakref__¶ list of weak references to the object (if defined)
-
_check_replicon_consistency()[source]¶ - Raise
MacsypyError if all hits of a cluster are NOT related to the same replicon
-
fulfilled_function(*genes)[source]¶ - Parameters
gene – The genes which must be tested.
- Returns
the common functions between genes and this cluster.
- Return type
set of string
-
property
hit_weights¶ - Returns
the different weight for the hits used to compute the score
- Return type
-
property
loner¶ - Returns
True if this cluster is made of only some hits representing the same gene and this gene is tag as loner False otherwise: - contains several hits coding for different genes - contains one hit but gene is not tag as loner (max_gene_required = 1)
-
merge(cluster, before=False)[source]¶ merge the cluster in this one. (do it in place)
- Parameters
cluster (
macsypy.cluster.Clusterobject) –before (bool) – If False the hits of the cluster will be add at the end of this one, Otherwise the cluster hits will be inserted before the hits of this one.
- Returns
None
- Raises
MacsypyError – if the two clusters have not the same model
-
property
multi_system¶ - Returns
True if this cluster is made of only one hit representing a multi_system gene False otherwise: - contains several hits - contains one hit but gene is not tag as loner (max_gene_required = 1)
-
replace(old, new)[source]¶ replace hit old in this cluster by new one. (work in place)
- Parameters
old (
macsypy.hit.ModelHitobject.) – the hit to replacenew (
macsypy.hit.ModelHitobject.) – the new hit
- Returns
None
-
property
replicon_name¶ - Returns
The name of the replicon where this cluster is located
- Return type
str
-
property
score¶ - Returns
The score for this cluster
- Return type
float
-
build_clusters¶
-
macsypy.cluster.build_clusters(hits, rep_info, model, hit_weights)[source]¶ From a list of filtered hits, and replicon information (topology, length), build all lists of hits that satisfied the constraints:
max_gene_inter_space
loner
multi_system
If Yes create a cluster A cluster contains at least two hits separated by less or equal than max_gene_inter_space Except for loner genes which are allowed to be alone in a cluster
- Parameters
hits (list of
macsypy.hit.ModelHitobjects) – list of filtered hitsrep_info (
macsypy.Indexes.RepliconInfoobject) – the replicon to analysemodel (
macsypy.model.Modelobject) – the model to study
- Returns
list of regular clusters, the special clusters (loners not in cluster and multi systems)
- Return type
tuple with 2 elements
true_clusters which is list of
Clusterobjectstrue_loners: a dict { str function: :class:macsypy.hit.Loner | :class:macsypy.hit.LonerMultiSystem object}