I have a long data set with about 400k rows, and I am trying to extracting values based on multiple criteria.
On each day of the year, checklists with unique Ids are used to capture species presence.
How can I create a column with values that describe how many checklists did Species Aa appear on for each day of the year?
Species ChecklistID DayofYear Aa xyz 1 Bb xyz 1 Cc xyz 1 Aa swa 1 Bb swa 1 Ee dew 2 Aa gre 3 Cc gre 3 Ee fgv 4 Aa hyt 5
The output I am expecting would look like this:
DayofYear ChecklistsWithSpeciesAa 1 2 2 0 3 1 4 0 5 1
Thanks for your help!