Imputation

Figure 1 shows the tab page for the imputation job. For imputation, you need a file in VCF format or VCF.GZ format (a compressed VCF file). The following settings are required for the imputation job:

(1) Missing ID %: Removes samples with a missing rate higher than the set value (0-1).

(2) Missing SNP %: Removes SNP markers with a missing rate higher than the set value (0-1).

(3) HWE: This refers to the Threshold value for Hardy-Weinberg equilibrium filtering. If there are many zeros after the decimal point, you can use “e” for notation. For example, a value of 0.0000000001 can be expressed as 1e-10.

(4) MAF: Minor allele frequency.

(5) Burn-in  

(6) Iterations

(Figure 1)

After entering all the settings and checking the “Show progress” box, clicking the [Execute] button will bring up a pop-up terminal, displaying the imputation process as it is being performed (Figure 2).

(Figure 2)

(Figure 3)

Shown in Figure 3 is the imputation result files, which are of three types:

  1. PLINK format (*.PED, *.MAP)
  2. PLINK binary format (*.bed, *.bim, *.fam)
  3. Compressed VCF (*.vcf.gz)

Among the three genomic data formats above, the format required by QTLmax is #1. All three file formats contain the same information and can be interconverted.

One thought on “Imputation

Leave a Reply

Your email address will not be published. Required fields are marked *

PHP Code Snippets Powered By : XYZScripts.com