# compare datasets

Hi all!
I have several data sets that I want to compare with each other (`df0` with `df1`, `df1` with `df2` and `df2` with `df3`). When a change is made, a variable identifying the source of the change should be entered for each column in a new column.

Here are my sample data sets:

``````df0 = data.frame("ind" = c("A1", "A2", "B1", "C1", "C2"), "x10" = c(0,NA,2,3,4), "x11" = c(NA,0,1,2,3), "x12" = c(3,4,0,NA,NA))
df1 = data.frame("ind" = c("A1", "A2", "B1", "C1", "C2"), "x10" = c(0,NA,1,3,3), "x11" = c(NA,0,1,3,3), "x12" = c(3,2,0,NA,NA))
df2 = data.frame("ind" = c("A1", "A2", "B1", "C1", "C2", "D1"), "x10" = c(0,NA,1,3,3,NA), "x11" = c(NA,0,1,3,3,NA), "x12" = c(3,2,0,NA,NA,NA))
df3 = data.frame("ind" = c("A1", "A2", "B1", "C1", "C2", "D1"), "x10" = c(0,1,1,3,3,NA), "x11" = c(4,0,1,3,3,1), "x12" = c(3,2,0,2,1,NA), "x13" = c(0,1,NA,2,1,1))
``````

The result for the comparison of `df0` and `df1` should look like (`x` means, that there was some change): In the next step, `df1` would be compared with `df2`. If there are changes, the variable `y` should be written into the `change_x1.`-column, which would have the following result: The final comparison of `df2` with `df3` would lead to the following result (`z` indicates a change between the two data sets): Do you have any idea, how I could solve that problem in R?

This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.

If you have a query related to it or one of the replies, start a new topic and refer back with a link.