logo
Free, unlimited AI code reviews that run on commit
git-lrc git-lrc GitHub Install Now We'd appreciate a star git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt

lambda2_mkindexn - the Local Aligner for Massive Biological DatA

Description

       Lambda  is  a  local  aligner  optimized  for  many  query sequences and searches in protein space. It is
       compatible to BLAST, but much faster than BLAST and many other comparable tools.

       Detailed information is available in the wiki: <https://github.com/seqan/lambda/wiki>

       This is the indexer_binary for creating lambda-compatible databases.

Name

       lambda2_mkindexn - the Local Aligner for Massive Biological DatA

Options

-h, --help
              Display the help message.

       -hh, --full-help
              Display the help message with advanced options.

       --version
              Display version information.

       --copyright
              Display long copyright information.

       -v, --verbosityINTEGER
              Display more/less diagnostic output during operation: 0 [only errors]; 1 [default]; 2  [+run-time,
              options and statistics]. In range [0..2]. Default: 1.

   InputOptions:-d, --databaseINPUT_FILE
              Database  sequences.  Valid  filetypes  are:  .sam[.*],  .raw[.*],  .gbk[.*],  .frn[.*],  .fq[.*],
              .fna[.*], .ffn[.*], .fastq[.*], .fasta[.*], .faa[.*], .fa[.*], .embl[.*], and .bam, where * is any
              of the following extensions: gz, bz2, and bgzf for transparent (de)compression.

       -m, --acc-tax-mapINPUT_FILE
              An     NCBI     or     UniProt     accession-to-taxid     mapping     file.     Download      from
              ftp://ftp.ncbi.nlm.nih.gov/pub/taxonomy/accession2taxid/                                        or
              ftp://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/idmapping/    .    Valid
              filetypes  are: .dat[.*] and .accession2taxid[.*], where * is any of the following extensions: gz,
              bz2, and bgzf for transparent (de)compression.

       -x, --tax-dump-dirINPUT_DIRECTORY
              A     directory     that     contains     nodes.dmp     and     names.dmp;      unzipped      from
              ftp://ftp.ncbi.nlm.nih.gov/pub/taxonomy/taxdump.tar.gzOutputOptions:-i, --indexOUTPUT_DIRECTORY
              The  output  directory  for  the  index  files (defaults to "DATABASE.lambda"). Valid filetype is:
              .lambda.

       --db-index-typeSTRING
              Suffix array or full-text minute space. One of fm and bifm. Default: fm.

       --truncate-idsBOOL
              Truncate IDs at first whitespace. This saves a lot of space  and  is  irrelevant  for  all  LAMBDA
              output  formats other than BLAST Pairwise (.m0). One of 1, ON, TRUE, T, YES, 0, OFF, FALSE, F, and
              NO. Default: on.

   Algorithm:--algorithmSTRING
              Algorithm for SA construction (also  used  for  FM;  see  Memory   Requirements  below!).  One  of
              mergesort, quicksortbuckets, quicksort, radixsort, and skew7ext. Default: radixsort.

       -t, --threadsINTEGER
              number of threads to run concurrently. Default: autodetected.

       --tmp-dirOUTPUT_DIRECTORY
              temporary directory used by skew, defaults to working directory.

Remarks

       Please  see  the  wiki  (<https://github.com/seqan/lambda/wiki>) for more information on which indexes to
       chose and which algorithms to pick.

       Note that the indexes created are binary and not compatible between different CPU endiannesses. Also  the
       on-disk format is still subject to change between Lambda versions.

Synopsis

lambda2mkindexn [OPTIONS] -d DATABASE.fasta [-i INDEX.lambda]

See Also