I have a simple histogram of the frequency of gene expression values for a dataset of 100 datapoints.
I want to find the expression value (x axis) that corresponds to the top 20% of the data points. Or in this case, I want the break number that is associated with datapoint 80 when organized in the histogram
Ive been doing this manually by adding the breaks starting with the last break until I hit reach a count of 20, and then looking to see which break this is in. Sometimes because of the breaks my count is not exactly 20 but thats fine.
Does this make sense? Is there an easy way to ask this hist function this question? Even if it was as simple as "Which break has #80"