create_1000G
downloads and gets 1000 Genomes Phase III panel (hg19) in
PLINK format, and apply quality control for being used to compute PGS using
rapidopgs_multi
.
Given the size of the files, running this function can take long, depending
on broadband speed and server status. We also recommend to ensure that there
is at least 60GB free space available in disk.
create_1000G(
directory = "ref-data",
remove.related = TRUE,
qc.maf = 0.01,
qc.hwe = 1e-10,
qc.geno = 0,
autosomes.only = TRUE
)
a string indicating the directory to download the panel
a logical stating if related individuals should be removed. Default TRUE.
a numeric to set the MAF threshold for variants to be removed. DEFAULT 0.01
a numeric indicating the threshold for Hardy-Weinberg exact test p-value, below which variants will be removed. DEFAULT 1e-10.
a numeric to set maximum missing call rates for variants. DEFAULT = 0.
If FALSE, it will include X and Y chromosomes, too.
bed, fam and bim files for each chromosome in the chosen directory.
if (FALSE) { # \dontrun{
create_1000G()
} # }