bpp-popgen  2.2.0
bpp::PolymorphismSequenceContainerTools Class Reference

Utilitary function to manipulate PolymorphismSequenceContainer. More...

#include <Bpp/PopGen/PolymorphismSequenceContainerTools.h>

Public Member Functions

 ~PolymorphismSequenceContainerTools ()
 

Static Public Member Functions

static PolymorphismSequenceContainerread (const std::string &path, const Alphabet *alpha) throw (Exception)
 Read a Mase+ file and return a PolymorphismSequenceContainer. Toggle Sequence when selection tag begin with OUTGROUP (see Polymorphix) More...
 
static PolymorphismSequenceContainerextractIngroup (const PolymorphismSequenceContainer &psc) throw (Exception)
 Extract ingroup sequences from a PolymorphismSequenceContainer and create a new one. More...
 
static PolymorphismSequenceContainerextractOutgroup (const PolymorphismSequenceContainer &psc) throw (Exception)
 Extract outgroup sequences from a PolymorphismSequenceContainer and create a new one. More...
 
static PolymorphismSequenceContainerextractGroup (const PolymorphismSequenceContainer &psc, size_t group_id) throw (Exception)
 Extract a special group from the PolymorphismSequenceContainer. More...
 
static PolymorphismSequenceContainergetSelectedSequences (const PolymorphismSequenceContainer &psc, const SequenceSelection &ss)
 Extract selected sequences. More...
 
static PolymorphismSequenceContainersample (const PolymorphismSequenceContainer &psc, size_t n, bool replace=true)
 Get a random set of sequences. More...
 
static PolymorphismSequenceContainergetSitesWithoutGaps (const PolymorphismSequenceContainer &psc)
 Retrieves sites without gaps from PolymorphismSequenceContainer. More...
 
static size_t getNumberOfNonGapSites (const PolymorphismSequenceContainer &psc, bool ingroup) throw (Exception)
 Return number of sites without gaps in a PolymorphismSequenceContainer. More...
 
static size_t getNumberOfCompleteSites (const PolymorphismSequenceContainer &psc, bool ingroup) throw (Exception)
 Return number of completely resolved sites in a PolymorphismSequenceContainer. More...
 
static PolymorphismSequenceContainergetCompleteSites (const PolymorphismSequenceContainer &psc)
 Retrieves complete sites from a PolymorphismSequenceContainer. More...
 
static PolymorphismSequenceContainerexcludeFlankingGap (const PolymorphismSequenceContainer &psc)
 exclude flanking sites with gap but keep gap sites within the alignment More...
 
static PolymorphismSequenceContainergetSelectedSites (const PolymorphismSequenceContainer &psc, const std::string &setName, bool phase)
 Get a PolymorphismSequenceContainer corresponding to a site selection annotated in the mase comments. More...
 
static PolymorphismSequenceContainergetNonCodingSites (const PolymorphismSequenceContainer &psc, const std::string &setName)
 Retrieve non-coding sites defined in the mase file header. More...
 
static PolymorphismSequenceContainergetOnePosition (const PolymorphismSequenceContainer &psc, const std::string &setName, size_t pos)
 Retrieve sites at one codon position (1,2,3) More...
 
static PolymorphismSequenceContainergetIntrons (const PolymorphismSequenceContainer &psc, const std::string &setName, const GeneticCode *gCode)
 Retrieve intron sites. More...
 
static PolymorphismSequenceContainerget5Prime (const PolymorphismSequenceContainer &psc, const std::string &setName)
 Retrieve 5' sites. More...
 
static PolymorphismSequenceContainerget3Prime (const PolymorphismSequenceContainer &psc, const std::string &setName, const GeneticCode *gCode)
 Retrieve 3' sites. More...
 
static std::string getIngroupSpeciesName (const PolymorphismSequenceContainer &psc)
 Get the species name of the ingroup. More...
 

Detailed Description

Utilitary function to manipulate PolymorphismSequenceContainer.

Author
Sylvain Gaillard

Definition at line 71 of file PolymorphismSequenceContainerTools.h.

Constructor & Destructor Documentation

◆ ~PolymorphismSequenceContainerTools()

PolymorphismSequenceContainerTools::~PolymorphismSequenceContainerTools ( )

Definition at line 46 of file PolymorphismSequenceContainerTools.cpp.

Member Function Documentation

◆ excludeFlankingGap()

PolymorphismSequenceContainer * PolymorphismSequenceContainerTools::excludeFlankingGap ( const PolymorphismSequenceContainer psc)
static

exclude flanking sites with gap but keep gap sites within the alignment

Parameters
psca PolymorphismSequenceContainer reference

Definition at line 318 of file PolymorphismSequenceContainerTools.cpp.

References bpp::PolymorphismSequenceContainer::clone().

◆ extractGroup()

PolymorphismSequenceContainer * PolymorphismSequenceContainerTools::extractGroup ( const PolymorphismSequenceContainer psc,
size_t  group_id 
)
throw (Exception
)
static

Extract a special group from the PolymorphismSequenceContainer.

Parameters
psca PolymorphismSequenceContainer reference.
group_idthe group identifier as an size_t.
Exceptions
GroupNotFoundExceptionif group_id is not found.

Definition at line 149 of file PolymorphismSequenceContainerTools.cpp.

References bpp::PolymorphismSequenceContainer::deleteSequence().

◆ extractIngroup()

PolymorphismSequenceContainer * PolymorphismSequenceContainerTools::extractIngroup ( const PolymorphismSequenceContainer psc)
throw (Exception
)
static

Extract ingroup sequences from a PolymorphismSequenceContainer and create a new one.

Parameters
psca PolymorphismSequenceContainer reference
Exceptions
Exceptionif there is no ingroup sequence

Definition at line 103 of file PolymorphismSequenceContainerTools.cpp.

References bpp::PolymorphismSequenceContainer::deleteSequence().

◆ extractOutgroup()

PolymorphismSequenceContainer * PolymorphismSequenceContainerTools::extractOutgroup ( const PolymorphismSequenceContainer psc)
throw (Exception
)
static

Extract outgroup sequences from a PolymorphismSequenceContainer and create a new one.

Parameters
psca PolymorphismSequenceContainer reference
Exceptions
Exceptionif there is no outgroup sequence

Definition at line 126 of file PolymorphismSequenceContainerTools.cpp.

References bpp::PolymorphismSequenceContainer::deleteSequence().

◆ get3Prime()

PolymorphismSequenceContainer * PolymorphismSequenceContainerTools::get3Prime ( const PolymorphismSequenceContainer psc,
const std::string &  setName,
const GeneticCode *  gCode 
)
static

◆ get5Prime()

◆ getCompleteSites()

◆ getIngroupSpeciesName()

string PolymorphismSequenceContainerTools::getIngroupSpeciesName ( const PolymorphismSequenceContainer psc)
static

Get the species name of the ingroup.

Parameters
psca PolymorphismSequenceContainer.

Definition at line 568 of file PolymorphismSequenceContainerTools.cpp.

◆ getIntrons()

PolymorphismSequenceContainer * PolymorphismSequenceContainerTools::getIntrons ( const PolymorphismSequenceContainer psc,
const std::string &  setName,
const GeneticCode *  gCode 
)
static

Retrieve intron sites.

Same as getNonCodgingSites but exclude 5' and 3' flanking regions if there are

Parameters
psca PolymorphismSequenceContainer
setNamename of the CDS site selection
gCodeThe genetic code to use

Definition at line 433 of file PolymorphismSequenceContainerTools.cpp.

References bpp::PolymorphismSequenceContainer::getGroupId(), bpp::PolymorphismSequenceContainer::isIngroupMember(), bpp::PolymorphismSequenceContainer::setAsIngroupMember(), bpp::PolymorphismSequenceContainer::setAsOutgroupMember(), and bpp::PolymorphismSequenceContainer::setGroupId().

◆ getNonCodingSites()

PolymorphismSequenceContainer * PolymorphismSequenceContainerTools::getNonCodingSites ( const PolymorphismSequenceContainer psc,
const std::string &  setName 
)
static

◆ getNumberOfCompleteSites()

size_t PolymorphismSequenceContainerTools::getNumberOfCompleteSites ( const PolymorphismSequenceContainer psc,
bool  ingroup 
)
throw (Exception
)
static

Return number of completely resolved sites in a PolymorphismSequenceContainer.

Parameters
psca PolymorphismSequenceContainer reference
ingroupa boolean set to true if you want to take only ingroup sequences into account
Exceptions
Exceptionif there is no ingroup sequence

Definition at line 263 of file PolymorphismSequenceContainerTools.cpp.

◆ getNumberOfNonGapSites()

size_t PolymorphismSequenceContainerTools::getNumberOfNonGapSites ( const PolymorphismSequenceContainer psc,
bool  ingroup 
)
throw (Exception
)
static

Return number of sites without gaps in a PolymorphismSequenceContainer.

Parameters
psca PolymorphismSequenceContainer reference
ingroupa boolean set to true if you want to take only ingroup sequences into account
Exceptions
Exceptionif there is no ingroup sequence

Definition at line 233 of file PolymorphismSequenceContainerTools.cpp.

◆ getOnePosition()

PolymorphismSequenceContainer * PolymorphismSequenceContainerTools::getOnePosition ( const PolymorphismSequenceContainer psc,
const std::string &  setName,
size_t  pos 
)
static

Retrieve sites at one codon position (1,2,3)

Be carefull: to use before excluding gap Be careful: if there is no phase information, the method catch an exception and set the phase to 1 This allows to use this method for PolymorphismSequenceContainer generated by getSelectedSequence

Parameters
psca PolymorphismSequenceContainer reference
setNamename of the CDS site selection
posposition index.

Definition at line 392 of file PolymorphismSequenceContainerTools.cpp.

References bpp::PolymorphismSequenceContainer::getGroupId(), bpp::PolymorphismSequenceContainer::isIngroupMember(), bpp::PolymorphismSequenceContainer::setAsIngroupMember(), bpp::PolymorphismSequenceContainer::setAsOutgroupMember(), and bpp::PolymorphismSequenceContainer::setGroupId().

◆ getSelectedSequences()

◆ getSelectedSites()

PolymorphismSequenceContainer * PolymorphismSequenceContainerTools::getSelectedSites ( const PolymorphismSequenceContainer psc,
const std::string &  setName,
bool  phase 
)
static

Get a PolymorphismSequenceContainer corresponding to a site selection annotated in the mase comments.

Be carefull : in the new PolymorphismSequenceContainer the mase comments are lost Information about cds positions and start codon is no more available

Parameters
psca PolymorphismSequenceContainer.
setNameThe name of the set to retrieve.
phasea boolean set to true if you want to take the phase into account during the extraction. It removes the useless sites.

Definition at line 335 of file PolymorphismSequenceContainerTools.cpp.

References bpp::PolymorphismSequenceContainer::getGroupId(), bpp::PolymorphismSequenceContainer::isIngroupMember(), bpp::PolymorphismSequenceContainer::setAsIngroupMember(), bpp::PolymorphismSequenceContainer::setAsOutgroupMember(), and bpp::PolymorphismSequenceContainer::setGroupId().

◆ getSitesWithoutGaps()

◆ read()

PolymorphismSequenceContainer * PolymorphismSequenceContainerTools::read ( const std::string &  path,
const Alphabet *  alpha 
)
throw (Exception
)
static

Read a Mase+ file and return a PolymorphismSequenceContainer. Toggle Sequence when selection tag begin with OUTGROUP (see Polymorphix)

Parameters
pathPath to the Mase+ file
alphaSequence Alphabet
Exceptions
Exceptionif the file is not in the specified format

Definition at line 50 of file PolymorphismSequenceContainerTools.cpp.

References bpp::PolymorphismSequenceContainer::setAsOutgroupMember().

◆ sample()

PolymorphismSequenceContainer * PolymorphismSequenceContainerTools::sample ( const PolymorphismSequenceContainer psc,
size_t  n,
bool  replace = true 
)
static

Get a random set of sequences.

Parameters
psca PolymorphismSequenceContainer reference
nthe number of sequence to get
replacea boolean flag true for sampling with replacement

Definition at line 192 of file PolymorphismSequenceContainerTools.cpp.

References getSelectedSequences().


The documentation for this class was generated from the following files: