I have a problem during the selecting the rows of a dataframe (composed by 138 milions of rows). I want to select all that rows that satisfy a particular condition:
tcga_tumors_copy <- tcga_tumors_copy[(tcga_tumors_copy$source %in% edges_conjugateABC$A & tcga_tumors_copy$dest %in% edges_conjugateABC$B),]
the number of rows in edges_conjugate ABC is around 5 milions of rows. After applying this condition, the tcga_tumors_copy is composed by 8 milions of rows.... that isn't possible..... How can I select the rows of a dataframes, based on a multiple conditions?
Thx in advance,