We all know that data science is (at least) 1/2 analysis, and 1/2 communicating the analysis (and this is one of the reasons R + RStudio is such a power combo in DS). Sometimes my problem is not writing the code for a certain plot, but choosing among different plots. Of course I'm not talking about relatively trivial aspects such as which colors to use in the plot, but thing like:
- would this be more readable using
- should I
dodgethe bars of my bar plot? Or should I use transparency?
- I have counts of the levels of a factor variable, for two different populations with different sample size: should I show the raw data (counts), so that it's clear which population has more individuals? Or should I normalize the data and only show the proportions?
In other words, these are data visualization questions, but they're not about (or not mainly about)
ggplot2 code. I may already know how to code all the alternative plots I have in mind, but I'd like someone to give me their opinion on which plot is more clear and informative.
Can I ask such questions in this community, and if so, which is the correct category? Thanks