phase_common
{: .no_toc .text-center }
Table of contents
{: .no_toc .text-delta }
- TOC {:toc}
Description
Tool to phase common sites, typically SNP array data, or the first step of WES/WGS data.
Usage
Simple run
SHAPEIT5_phase_common --input 10k/msprime.nodup.bcf --filter-maf 0.001 --output 10k/msprime.common.phased.bcf --region 1 --thread 8
The program phases common variants (--filter-maf 0.001) from the input file (--input 10k/msprime.nodup.bcf) using 8 threads (--thread 8) on the full chromosome 1 (--region 1) and saves the results in the output file (--output 10k/msprime.common.phased.bcf).
Command line options
Basic options
--help |
NA |
NA |
Produces help message |
--seed |
INT |
15052011 |
Seed of the random number generator |
-T [ --thread ] |
INT |
1 |
Number of thread used |
-I [--input ] |
STRING |
NA |
Genotypes to be phased in VCF/BCF format |
-H [--reference ] |
STRING |
NA |
Reference panel of haplotypes in VCF/BCF format |
-S [--scaffold ] |
STRING |
NA |
Scaffold of haplotypes in VCF/BCF format |
-M [--map ] |
STRING |
NA |
Genetic map |
--pedigree |
STRING |
NA |
Pedigree information (chile father mother) |
-R [--region ] |
STRING |
NA |
Target region |
Filter parameters
--filter-snp |
NA |
NA |
If specified, the program only consider SNPs |
--filter-maf |
FLOAT |
0 |
[Expert option] Only consider variants with MAF above the specifeed value. It requires AC/AN tags in VCF/BCF file. |
MCMC parameters
--mcmc-iterations |
STRING |
5b,1p,1b,1p,1b,1p,5m |
Iteration scheme of the MCMC (burnin=b, pruning=p, main=m) |
--mcmc-prune |
FLOAT |
0.999 |
Pruning threshold for genotype graphs |
--mcmc-noinit |
NA |
NA |
If specified, phasing initialization by PBWT sweep is disabled |
PBWT parameters
--pbwt-modulo |
FLOAT |
0.1 |
Storage frequency of PBWT indexes in cM |
--pbwt-depth |
INT |
4 |
Depth of PBWT indexes to condition on |
--pbwt-mac |
INT |
5 |
Minimal Minor Allele Count at which PBWT is evaluated |
--pbwt-mdr |
FLOAT |
0.1 |
Maximal Missing Data Rate at which PBWT is evaluated |
--pbwt-window |
INT |
4 |
Run PBWT selection in windows of this size |
HMM parameters
--hmm-window |
INT |
4 |
Minimal size of the phasing window in cM |
--hmm-ne |
INT |
15000 |
Effective size of the population |
Output files
-O [--output ] |
STRING |
NA |
Phased haplotypes in VCF/BCF format |
--output-graph |
STRING |
NA |
Phased haplotypes in BIN format (Useful to sample multiple likely haplotype configurations per sample) |
--log |
STRING |
NA |
Log file |