{: .no_toc .text-center }
{: .no_toc .text-delta }
Tool to phase rare variants onto a scaffold of common variants (output of phase_common / ligate).
Simple run
#chunk1
SHAPEIT5_phase_rare --input-plain 10k/msprime.nodup.bcf --scaffold 10k/msprime.common.truth.bcf --output 10k/msprime.rare.chunk1.bcf --scaffold-region 1:1000000-3000000 --input-region 1:1500000-2500000 --thread 8
#chunk2
SHAPEIT5_phase_rare --input-plain 10k/msprime.nodup.bcf --scaffold 10k/msprime.common.truth.bcf --output 10k/msprime.rare.chunk2.bcf --scaffold-region 1:2000000-4000000 --input-region 1:2500001-3500000 --thread 8The first command phases rare variants from the input file (--input-plain 10k/msprime.nodup.bcf) using 8 threads (--thread 8) on the region 1500000-2500000 of chromosome 1 (--input-region 1:1500000-2500000) using a haplotype scaffold phased for the full chromosome 1 obtained from the phase_common program in the region 1000000-3000000 of chromosome 1 (--scaffold-region 1:1000000-3000000) and saves the results in the output file (--output 10k/msprime.rare.chunk1.bcf).
The obtained files can be quickly concatenated to generate chromosome-wide files using bcftools concat –naive.
| Option name | Argument | Default | Description |
|---|---|---|---|
| --help | NA | NA | Produces help message |
| --seed | INT | 15052011 | Seed of the random number generator |
| -T [ --thread ] | INT | 1 | Number of thread used |
| Option name | Argument | Default | Description |
|---|---|---|---|
| --input-plain | STRING | NA | Genotypes to be phased in plain VCF/BCF format |
| --input-region | STRING | NA | Region to be considered in --input-plain |
| --input-maf | FLOAT | 0.001 | Threshold for sparse genotype representation in –input-plain |
| --scaffold | STRING | NA | Scaffold of haplotypes in VCF/BCF format |
| --scaffold-region | STRING | NA | Region to be considered in --scaffold |
| --map | STRING | NA | Genetic map |
| --pedigree | STRING | NA | Pedigree information (chile father mother) |
| Option name | Argument | Default | Description |
|---|---|---|---|
| --pbwt-modulo | FLOAT | 0.1 | Storage frequency of PBWT indexes in cM |
| --pbwt-depth-common | INT | 2 | Depth of PBWT indexes at common sites to condition on |
| --pbwt-depth-rare | INT | 2 | Depth of PBWT indexes at rare sites to condition on |
| --pbwt-mac | INT | 2 | Minimal Minor Allele Count at which PBWT is evaluated |
| --pbwt-mdr | FLOAT | 0.1 | Maximal Missing Data Rate at which PBWT is evaluated |
| Option name | Argument | Default | Description |
|---|---|---|---|
| --effective-size | INT | 15000 | Effective size of the population |
| Option name | Argument | Default | Description |
|---|---|---|---|
| -O [--output ] | STRING | NA | Phased haplotypes in VCF/BCF format |
| --output-buffer | STRING | NA | If specified, right and left buffers are printed in output |
| --log | STRING | NA | Log file |