Hi, community! I have a dataset I'm trying to work with where my main goal is to compare rates of positive test results between different regions. I've already calculated the positive rate for each region on my own, but I'd like to be able to compare regions in a way that takes into account sample size and such. Here's what my dataset looks like:
Species..........ID#..............Site........Status
KAEL.........1821-21283......HPK.......positive
KAEL.........1821-21284......HPK.......negative
JAWE........1821-21285......HPK.......negative
KAAM........1821-21286......UUK.......negative
And so on and so forth. How should I proceed with comparing frequency of "positive" by region?