Issue: I have a large summary of statistics data set of 15 million SNPs with Z-scores that was shared by a former colleague (who I unfortunately CAN NOT reach him via email). I would like to distill the 15 mills rows of SNPs to just the ones with statistically significant so I cross check them with a two-thousand of interest.
I would really appreciate any suggestion/or guidance on the following:
A package that can convert Z-Scores to P-value?
Are there package(s) that would help me efficiently filter out the significant SNPs and allow me to compare two columns from two different data.frame files?