lambda2_mkindexn - the Local Aligner for Massive Biological DatA
Contents
Description
Lambda is a local aligner optimized for many query sequences and searches in protein space. It is
compatible to BLAST, but much faster than BLAST and many other comparable tools.
Detailed information is available in the wiki: <https://github.com/seqan/lambda/wiki>
This is the indexer_binary for creating lambda-compatible databases.
Legal
lambda2mkindexnCopyright: 2013-2019 Hannes Hauswedell, released under the GNU AGPL v3 (or later);
2016-2019 Knut Reinert and Freie Universität Berlin, released under the 3-clause-BSDL
SeqAnCopyright: 2006-2015 Knut Reinert, FU-Berlin; released under the 3-clause BSDL.
Inyouracademicworkspleasecite: Hauswedell et al (2014); doi: 10.1093/bioinformatics/btu439
For full copyright and/or warranty information see --copyright.
lambda2 mkindexn 2.0.1 Nov 28 2024 LAMBDA2_MKINDEXN(1)
Name
lambda2_mkindexn - the Local Aligner for Massive Biological DatA
Options
-h, --help
Display the help message.
-hh, --full-help
Display the help message with advanced options.
--version
Display version information.
--copyright
Display long copyright information.
-v, --verbosityINTEGER
Display more/less diagnostic output during operation: 0 [only errors]; 1 [default]; 2 [+run-time,
options and statistics]. In range [0..2]. Default: 1.
InputOptions:-d, --databaseINPUT_FILE
Database sequences. Valid filetypes are: .sam[.*], .raw[.*], .gbk[.*], .frn[.*], .fq[.*],
.fna[.*], .ffn[.*], .fastq[.*], .fasta[.*], .faa[.*], .fa[.*], .embl[.*], and .bam, where * is any
of the following extensions: gz, bz2, and bgzf for transparent (de)compression.
-m, --acc-tax-mapINPUT_FILE
An NCBI or UniProt accession-to-taxid mapping file. Download from
ftp://ftp.ncbi.nlm.nih.gov/pub/taxonomy/accession2taxid/ or
ftp://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/idmapping/ . Valid
filetypes are: .dat[.*] and .accession2taxid[.*], where * is any of the following extensions: gz,
bz2, and bgzf for transparent (de)compression.
-x, --tax-dump-dirINPUT_DIRECTORY
A directory that contains nodes.dmp and names.dmp; unzipped from
ftp://ftp.ncbi.nlm.nih.gov/pub/taxonomy/taxdump.tar.gzOutputOptions:-i, --indexOUTPUT_DIRECTORY
The output directory for the index files (defaults to "DATABASE.lambda"). Valid filetype is:
.lambda.
--db-index-typeSTRING
Suffix array or full-text minute space. One of fm and bifm. Default: fm.
--truncate-idsBOOL
Truncate IDs at first whitespace. This saves a lot of space and is irrelevant for all LAMBDA
output formats other than BLAST Pairwise (.m0). One of 1, ON, TRUE, T, YES, 0, OFF, FALSE, F, and
NO. Default: on.
Algorithm:--algorithmSTRING
Algorithm for SA construction (also used for FM; see Memory Requirements below!). One of
mergesort, quicksortbuckets, quicksort, radixsort, and skew7ext. Default: radixsort.
-t, --threadsINTEGER
number of threads to run concurrently. Default: autodetected.
--tmp-dirOUTPUT_DIRECTORY
temporary directory used by skew, defaults to working directory.
Remarks
Please see the wiki (<https://github.com/seqan/lambda/wiki>) for more information on which indexes to
chose and which algorithms to pick.
Note that the indexes created are binary and not compatible between different CPU endiannesses. Also the
on-disk format is still subject to change between Lambda versions.
Synopsis
lambda2mkindexn [OPTIONS] -d DATABASE.fasta [-i INDEX.lambda]
