.. _history:
Software Background
====================
Canu is derived from `Celera Assembler `_, which is no longer maintained.
Celera Assembler [`Myers 2000 `_] was designed to reconstruct mammalian chromosomal DNA
sequences from the short fragments of a whole genome shotgun sequencing project.
Celera Assembler was used to produce reconstructions of several large genomes, namely
those of Homo sapiens [`Venter 2001 `_], Mus musculus [`Mural 2002 `_], Rattus norvegicus
[`unpublished data `_], Canis familiaris [`Kirkness 2003 `_], Drosophila melanogaster [`Adams
2000 `_], and Anopheles gambiae [`Holt 2001 `_]. Celera Assembler was shown to be very
accurate when its reconstruction of the human genome was compared to independent
reconstructions completed later [`Istrail 2004 `_]. It was used to reconstructing one of the
first large-scale metagenomic projects [`Venter 2004 `_, `Rusch 2007 `_] and a diploid human
reference [`Levy 2007 `_, `Denisov 2008 `_]. It was adapted to 454 Pyrosequencing [`Miller 2008 `_]
and PacBio sequencing [`Koren 2012 `_], demonstrating finished bacterial genomes [`Koren 2013 `_]
and efficient algorithms for eukaryotic assembly [`Berlin 2015 `_].
Celera Assembler was released under the GNU General Public License, version 2 as a
supplement to [`Istrail 2004 `_].
Canu [`Koren and Walenz 2017 `_] was branched from
Celera Assembler in 2015, and specialized for single-molecule high-noise sequences.
For the most recent license information on Canu,
see `README.licences `_.
References
--------------------
- Adams et al. (2000) `The Genome Sequence of Drosophila melanogaster `_. Science 287 2185-2195.
- Myers et al. (2000) `A Whole-Genome Assembly of Drosophila `_. Science 287 2196-2204.
- Venter et al. (2001) `The Sequence of the Human Genome `_. Science 291 1304-1351.
- Mural et al. (2002) `A Comparison of Whole-Genome Shotgun-Derived Mouse Chromosome 16 and the Human Genome `_. Science 296 1661-1671.
- Holt et al. (2002) `The Genome Sequence of the Malaria Mosquito Anophelesd gambiae `_. Science 298 129-149.
- Istrail et al. (2004) `Whole Genome Shotgun Assembly and Comparison of Human Genome Assemblies `_. PNAS 101 1916-1921.
- Kirkness et al. (2003) `The Dog Genome: Survey Sequencing and Comparative Analysis `_. Science 301 1898-1903.
- Venter et al. (2004) `Environmental genome shotgun sequencing of the Sargasso Sea `_. Science 304 66-74.
- Levy et al. (2007) `The Diploid Genome Sequence of an Individual Human `_. PLoS Biology 0050254
- Rusch et al. (2007) `The Sorcerer II Global Ocean Sampling Expedition: Northwest Atlantic through Eastern Tropical Pacific `_. PLoS Biology 1821060.
- Denisov et al. (2008) `Consensus Generation and Variant Detection by Celera Assembler `_. Bioinformatics 24(8):1035-40
- Miller et al. (2008) `Aggressive Assembly of Pyrosequencing Reads with Mates `_. Bioinformatics 24(24):2818-2824
- Koren et al. (2012) `Hybrid error correction and de novo assembly of single-molecule sequencing reads `_. Nature Biotechnology, July 2012.
- Koren et al. (2013) `Reducing assembly complexity of microbial genomes with single-molecule sequencing `_. Genome Biology 14:R101.
- Berlin et. al. (2015) `Assembling Large Genomes with Single-Molecule Sequencing and Locality Sensitive Hashing `_. Nature Biotechnology. (2015).
- Koren and Walenz et al. (2017) `Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation `_. Genome Research. (2017).