create_1000G downloads and gets 1000 Genomes Phase III panel (hg19) in PLINK format, and apply quality control for being used to compute PGS using rapidopgs_multi. Given the size of the files, running this function can take long, depending on broadband speed and server status. We also recommend to ensure that there is at least 60GB free space available in disk.

create_1000G(
  directory = "ref-data",
  remove.related = TRUE,
  qc.maf = 0.01,
  qc.hwe = 1e-10,
  qc.geno = 0,
  autosomes.only = TRUE
)

Arguments

directory

a string indicating the directory to download the panel

a logical stating if related individuals should be removed. Default TRUE.

qc.maf

a numeric to set the MAF threshold for variants to be removed. DEFAULT 0.01

qc.hwe

a numeric indicating the threshold for Hardy-Weinberg exact test p-value, below which variants will be removed. DEFAULT 1e-10.

qc.geno

a numeric to set maximum missing call rates for variants. DEFAULT = 0.

autosomes.only

If FALSE, it will include X and Y chromosomes, too.

Value

bed, fam and bim files for each chromosome in the chosen directory.

Author

Guillermo Reales

Examples

if (FALSE) { # \dontrun{
create_1000G()
} # }