DIAMOND is a sequence aligner for protein and translated DNA searches, designed for high performance analysis of big sequence data. The key features are:
Keep posted about new developments by following me on Twitter.
This demonstrates a quick example for setting up and using the program on Linux. Installing the software on your system may be done by downloading it in binary format for immediate use:
wget http://github.com/bbuchfink/diamond/releases/download/v2.0.4/diamond-linux64.tar.gz
tar xzf diamond-linux64.tar.gzThe extracted diamond binary file should be moved to a directory contained in your executable search path (PATH environment variable).
To now run an alignment task, we assume to have a protein database file in FASTA format named nr.faa and a file of DNA reads that we want to align named reads.fna.
In order to set up a reference database for DIAMOND, the makedb command needs to be executed with the following command line:
$ diamond makedb --in nr.faa -d nrThis will create a binary DIAMOND database file with the specified name (nr.dmnd). The alignment task may then be initiated using the blastx command like this:
$ diamond blastx -d nr -q reads.fna -o matches.m8The output file here is specified with the –o option and named matches.m8. By default, it is generated in BLAST tabular format.
Note:
--masking 0.-b.--mid-sensitive, --sensitive, --more-sensitive, --very-sensitive and --ultra-sensitive.The online documentation is located at http://www.diamondsearch.org. For the markdown source code and an offline, text-based documentation, see: https://github.com/bbuchfink/diamond_docs.
DIAMOND is developed by Benjamin Buchfink at the Drost lab, Max Planck Institute for Developmental Biology, Tübingen, Germany.
[Email] [Twitter] [Google Scholar] [Drost lab] [MPI-EBIO]
Publication: