I have a data-frame with 300k rows i wish to dedup.
A duplicate is considered based on a pair. So for example in the below, I would only want the first instance of the duplicate. In this case A-C is the same thing as C-A. Does anyone know a way of identifying this?
library(tidyverse) mydf <- tibble::tribble( ~Col_A, ~Col_B, "A", "C", "B", "B", "A", "C", "A", "C", "C", "A" )
My final result here should be something like
Thanks very much for your time