I have data on bee viruses that I am comparing between groups of bees from two site types. There are nine sites, 4 of one type and 5 of the other.

The data are more normal when log transformed, and log transformation seems to be a good fit. However, there are lots of zeros in the data, and when I log transform, the data become "-lnf". This becomes a problem when I try to run a GLM model on the viral data, with virus ~ site type, which was one idea about how to analyze it.

The other idea was to run an ANOVA with linear contrast. I'm looking for input about using a GLM vs ANOVA and linear contrast? How do we decide which is appropriate? And how do you deal with zeros in log transformed data?