Hi all,
My dataset has been delivered as several tables.
Each observation has a unique identifier that has been randomly hashed for anonmity, however is conistent between tables - however not all identifiers are on all tables. The order is also different.
What is the best way of combining just the variables of interest from seperate tables, while aligining them with their corresponding variables as appropriate.
For example, I may want to use 3 variables from casepatient, 1 from skillsets, 4 from vss but make sure that the same observation is not produced as two different observations.
And finally, because they are random the hashed identifiers are... random. They are the variable called ï..derived_patient_id
on each table.
Some table examples: