Hi all,
I just finished a round of data collection for a new study and I need to join two sets of data from two surveys together to link participant responses, but I'm having trouble. The first survey contained participant demographics and some other info, while the second survey had info from the study that may have taken place days later (depending on condition). De-identified toy versions are below:
part_1=tibble::tribble(
~end_date_1, ~consent_name, ~consent_email, ~diff_check, ~age, ~race, ~sex, ~ethnicity, ~time,
"11/12/2020 12:06", "Addam Marbrand", "Lannister", 2L, 18L, 6L, 2L, 1L, 2L,
"11/14/2020 9:36", "Aegon Frey (Jinglebell)", "None", 2L, 18L, 2L, 2L, 2L, 1L,
"11/14/2020 15:10", "Aegon Targaryen", "House Targaryen", 2L, 18L, 6L, 1L, 2L, 2L,
"11/14/2020 18:56", "Adrack Humble", "House Greyjoy", 2L, 21L, 1L, 1L, 1L, 2L,
"11/14/2020 19:29", "Aemon Costayne", "Lannister", 2L, 19L, 6L, 2L, 1L, 2L,
"11/14/2020 20:24", "Aemon Estermont", "Baratheon", 1L, 19L, 1L, 1L, 2L, 1L,
"11/15/2020 10:27", "Aemon Targaryen (son of Maekar I)", "Night's Watch", 1L, 21L, 1L, 2L, 1L, 2L,
"11/15/2020 12:00", "Aenys Frey", "None", 2L, 18L, 2L, 2L, 2L, 1L,
"11/15/2020 12:41", "Aeron Greyjoy", "House Greyjoy", 2L, 23L, 1L, 2L, 2L, 2L,
"11/15/2020 17:06", "Aethan", "Night's Watch", 1L, 18L, 6L, 1L, 1L, 1L,
"11/15/2020 18:43", "Aggar", "House Greyjoy", 2L, 21L, 6L, 1L, 1L, 2L,
"11/15/2020 18:58", "Aggo", "House Targaryen", 2L, 19L, 1L, 2L, 1L, 2L,
"11/15/2020 19:28", "Alan of Rosby", "Night's Watch", 1L, 20L, 1L, 2L, 2L, 2L,
"11/15/2020 19:40", "Alayaya", "None", 1L, 18L, 1L, 2L, 1L, 2L,
"11/15/2020 21:21", "Albar Royce", "Arryn", 1L, 18L, 6L, 2L, 1L, 1L,
"11/16/2020 3:36", "Albett", "Night's Watch", 2L, 19L, 6L, 2L, 1L, 2L,
"11/16/2020 10:51", "Alebelly", "House Stark", 1L, 18L, 1L, 2L, 2L, 2L,
"11/16/2020 11:12", "Alerie Hightower", "House Tyrell", 2L, 18L, 1L, 2L, 2L, 1L,
"11/16/2020 12:49", "Alesander Staedmon", "Baratheon", 2L, 18L, 1L, 2L, 2L, 2L,
"11/16/2020 12:22", "Alester Florent", "Baratheon", 1L, 20L, 1L, 2L, 2L, 2L,
"11/16/2020 13:32", "Alia of Braavos", "None", 3L, 18L, 1L, 2L, 2L, 1L,
"11/16/2020 15:13", "Alla Tyrell", "House Tyrell", 2L, 18L, 1L, 2L, 1L, 2L
)
part_2=tibble::tribble(
~start_date, ~end_date, ~duration_part2, ~variable_1, ~variable_2, ~variable_3, ~variable_4, ~variable_5, ~variable_6, ~variable_7, ~variable_8, ~variable_9, ~variable_10, ~variable_11, ~variable_12, ~variable_13, ~variable_14, ~variable_15, ~variable_16, ~variable_17, ~variable_18, ~variable_19, ~variable_20, ~variable_21, ~variable_22, ~variable_23, ~variable_24, ~variable_25, ~variable_26, ~variable_27, ~variable_28, ~variable_29, ~variable_30, ~variable_31, ~name_validation, ~email_validation, ~time,
"11/12/2020", "11/12/2020", 73L, 0L, 2L, 50L, 0L, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, "Aenys Frey", "None", NA,
"11/14/2020", "11/14/2020", 97L, 0L, 2L, 40L, 0L, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, "Aeron Greyjoy", "House Greyjoy", NA,
"11/14/2020", "11/14/2020", 108L, 1L, 1L, NA, 0L, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, "Aethan", "Night's Watch", NA,
"11/14/2020", "11/14/2020", 61L, 0L, 2L, 40L, 1L, 1L, 2L, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, "Aggar", "House Greyjoy", NA,
"11/15/2020", "11/15/2020", 39L, 0L, 1L, NA, 0L, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, "Aggo", "House Targaryen", NA,
"11/15/2020", "11/15/2020", 22L, 0L, 1L, NA, 0L, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, "Alan of Rosby", "Night's Watch", NA,
"11/15/2020", "11/15/2020", 66L, 1L, 1L, NA, 0L, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, "Alayaya", "None", NA,
"11/15/2020", "11/15/2020", 61L, 1L, 1L, NA, 0L, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, "Albar Royce", "Arryn", NA,
"11/15/2020", "11/15/2020", 796L, 0L, 1L, NA, 0L, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, "Albett", "Night's Watch", NA,
"11/15/2020", "11/15/2020", 141L, 1L, 1L, NA, 1L, 0L, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, "Alebelly", "House Stark", NA,
"11/15/2020", "11/15/2020", 65L, 1L, 2L, 60L, 0L, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, "Alerie Hightower", "House Tyrell", NA,
"11/15/2020", "11/15/2020", 58L, 1L, 1L, NA, 0L, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, "Alesander Staedmon", "Baratheon", NA
)
You can see that part 1 has more responses than part 2; not everyone who signed up came back to finish the study. So I need a way to merge these datasets so that I have every column in both, but responses are matched by name and email, and any name/email combo that does not appear in the second is dropped.
Anyone have any suggestions?