logo
Free, unlimited AI code reviews that run on commit
git-lrc git-lrc GitHub Install Now We'd appreciate a star git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt

QTLtools extract - Data extraction mode.

Authors

       Olivier Delaneau (olivier.delaneau@gmail.com), Halit Ongen (halitongen@gmail.com)

QTLtools-v1.3                                      06 May 2020                               QTLtools-extract(1)

Bugs

       o Versions  up  to  and  including  1.2, suffer from a bug in reading missing genotypes in VCF/BCF files.
         This bug affects variants with a DS field in their genotype's FORMAT and have a  missing  genotype  (DS
         field  is  .)  in  one  of the samples, in which case genotypes for all the samples are set to missing,
         effectively removing this variant from the analyses.

       Please submit bugs to <https://github.com/qtltools/qtltools>

Citation

       Delaneau, O., Ongen, H., Brown, A. et al. A complete tool set for molecular QTL discovery  and  analysis.
       NatCommun8, 15452 (2017).  <https://doi.org/10.1038/ncomms15452>

Description

       Extract  all  the data from the provided files into one flat file.  This will also impute all the missing
       data.

Example

       o Extract genotypes, gene quantifications, and covariates from a certain region:

         QTLtools   extract    --vcf    genotypes.chr22.vcf.gz    --bed    genes.50percent.chr22.bed.gz    --cov
         genes.covariates.pc50.txt.gz --region chr22:17000000-18000000 --out extract_output

Name

       QTLtools extract - Data extraction mode.

Options

--vcf[in.vcf|in.bcf|in.vcf.gz]
              Genotypes in VCF/BCF format.  At least one of --vcf--bed--cov is REQUIRED.

       --bedquantifications.bed.gz
              Molecular phenotype quantifications in BED format.  At least one of --vcf--bed--cov is REQUIRED.

       --covcovariates.txt
              Covariates data.  At least one of --vcf--bed--cov is REQUIRED.

       --outfilename_prefix
              Output file name prefix.  REQUIRED.

       --regionchr:start-end
              Genomic region to be processed.  E.g. chr4:12334456-16334456, or chr5.

Output Files

.content.txt.gz
        Flat file with samples in rows and all the extracted variables in columns.

       .header.txt
        Genomic locations of the extracted variables.

See Also

QTLtools(1)

       QTLtools website: <https://qtltools.github.io/qtltools>

Synopsis

QTLtoolsextract[--vcf--bed--cov]relevant_file--outoutput_prefix[OPTIONS]

See Also