Hi All,
I wanted to conduct some exploratory analysis of my data. In particular, I require to group my data according to a label (which I have labelled 'distribution'), but I wondered how this was carried out?
For example, would anyone be able to show me would to group observations which have a score that fall into a distribution/label of two?
And thereafter, when it is grouped, would it be possible to explain how it should be stored to conduct some exploratory analysis on it, such as simple descriptive stats etc?
Your help would be very much appreciated. Example data is given:
Score <- c("0.125", "0.678", "0.999", "0.342", "0.621", "0.912", "0.888", "0.755", "0.722", "0.545")
Distribution <- c("1", "2", "3", "2", "2", "3", "3","2", "2", "2")
df <- data.frame(Score, Distribution)
print(df)
Score Distribution
1 0.125 1
2 0.678 2
3 0.999 3
4 0.342 2
5 0.621 2
6 0.912 3
7 0.888 3
8 0.755 2
9 0.722 2
10 0.545 2