phase_common
{: .no_toc .text-center }
Table of contents
{: .no_toc .text-delta }
- TOC {:toc}
Description
Tool to phase common sites, typically SNP array data, or the first step of WES/WGS data.
Usage
Simple run
SHAPEIT5_phase_common --input 10k/msprime.nodup.bcf --filter-maf 0.001  --output 10k/msprime.common.phased.bcf --region 1 --thread 8
 
The program phases common variants (--filter-maf 0.001) from the input file (--input 10k/msprime.nodup.bcf) using 8 threads (--thread 8) on the full chromosome 1 (--region 1) and saves the results in the output file (--output 10k/msprime.common.phased.bcf).
Command line options
Basic options
| --help | NA | NA | Produces help message | 
| --seed | INT | 15052011 | Seed of the random number generator | 
| -T [ --thread ] | INT | 1 | Number of thread used | 
| -I [--input ] | STRING | NA | Genotypes to be phased in VCF/BCF format | 
| -H [--reference ] | STRING | NA | Reference panel of haplotypes in VCF/BCF format | 
| -S [--scaffold ] | STRING | NA | Scaffold of haplotypes in VCF/BCF format | 
| -M [--map ] | STRING | NA | Genetic map | 
| --pedigree | STRING | NA | Pedigree information (chile father mother) | 
| -R [--region ] | STRING | NA | Target region | 
Filter parameters
| --filter-snp | NA | NA | If specified, the program only consider SNPs | 
| --filter-maf | FLOAT | 0 | [Expert option] Only consider variants with MAF above the specifeed value. It requires AC/AN tags in VCF/BCF file. | 
MCMC parameters
| --mcmc-iterations | STRING | 5b,1p,1b,1p,1b,1p,5m | Iteration scheme of the MCMC (burnin=b, pruning=p, main=m) | 
| --mcmc-prune | FLOAT | 0.999 | Pruning threshold for genotype graphs | 
| --mcmc-noinit | NA | NA | If specified, phasing initialization by PBWT sweep is disabled | 
PBWT parameters
| --pbwt-modulo | FLOAT | 0.1 | Storage frequency of PBWT indexes in cM | 
| --pbwt-depth | INT | 4 | Depth of PBWT indexes to condition on | 
| --pbwt-mac | INT | 5 | Minimal Minor Allele Count at which PBWT is evaluated | 
| --pbwt-mdr | FLOAT | 0.1 | Maximal Missing Data Rate at which PBWT is evaluated | 
| --pbwt-window | INT | 4 | Run PBWT selection in windows of this size | 
HMM parameters
| --hmm-window | INT | 4 | Minimal size of the phasing window in cM | 
| --hmm-ne | INT | 15000 | Effective size of the population | 
Output files
| -O [--output ] | STRING | NA | Phased haplotypes in VCF/BCF format | 
| --output-graph | STRING | NA | Phased haplotypes in BIN format (Useful to sample multiple likely haplotype configurations per sample) | 
| --log | STRING | NA | Log file |