Hi community!!!
I have a table showing relative abundance of many different bacteria. I wanted to create a correlation (Spearman) matrix with significance level (P-value) from this table using "Hmisc" package. I have imported the table as .xlsx format and created a data frame from it. After that I have created the correlation matrix using the following codes:
res2 <- rcorr(my_data, type = c("spearman"))
But in my input data table, there are many bacteria with zero (0) relative abundance. I have keep them as it is (i.e. 0). Should this result into a wrong calculation and wrong output?
Here's a short portion from my data table:
clade_name | ERR1_profile | ERR2_profile | ERR3_profile | ERR4_profile | ERR5_profile | ERR6_profile |
---|---|---|---|---|---|---|
Actinobaculum_sp_oral_taxon_183 | 0 | 0 | 0 | 0 | 0 | 0 |
Actinomyces_graevenitzii | 0 | 0 | 0 | 0 | 0 | 0 |
Actinomyces_naeslundii | 0 | 0 | 0 | 0 | 0.00269 | 0 |
Actinomyces_odontolyticus | 0 | 0 | 0.00341 | 0 | 0.03155 | 0 |
Actinomyces_oris | 0 | 0 | 0 | 0.00155 | 0.00186 | 0 |
Actinomyces_sp_HMSC035G02 | 0 | 0 | 0 | 0 | 0.0066 | 0 |
Actinomyces_sp_HPA0247 | 0 | 0 | 0 | 0 | 0 | 0 |
Actinomyces_sp_ICM47 | 0 | 0 | 0.0042 | 0 | 0 | 0 |
Actinomyces_sp_S6_Spd3 | 0 | 0 | 0 | 0 | 0 | 0 |
Actinomyces_sp_oral_taxon_181 | 0 | 0 | 0 | 0 | 0 | 0 |
Actinomyces_sp_oral_taxon_414 | 0 | 0 | 0 | 0 | 0 | 0 |
Actinomyces_turicensis | 0 | 0 | 0 | 0 | 0 | 0 |
Varibaculum_cambriense | 0 | 0 | 0 | 0 | 0 | 0 |
Aeriscardovia_aeriphila | 0.00454 | 0 | 0 | 0.00593 | 0.00257 | 0 |
Alloscardovia_omnicolens | 0 | 0 | 0 | 0 | 0 | 0 |
Thanks,
dc7