A data.table containing a subset of Michailidou et al., 2017 breast cancer summary statistic dataset, in hg19 build. This dataset is freely available in GWAS catalog (see link below). We used "chromosome", "base_pair_location columns", removed unnecessary and all-missing columns, and took a random sample of 100,000 SNPs without replacement.

michailidou19

Format

A data.table object containing 100,000 SNPs

SNPID, CHR, BP, REF, ALT, ALT_FREQ, BETA, SE, P
SNPID

rsids, or SNP ids

CHR

chromosome

BP

base position, in hg38

REF

reference, or non-effect allele

ALT

alternative, or effect allele

ALT_FREQ

effect allele frequency

BETA

beta, log(OR), or effect size

SE

standard error of beta

P

p-value