I have a dummy dataset I need to do some testing on before our database is moved into production. It is a survey that we are sending out to 3,000 people and will have a gift card associated with completing the survey.
Is there an R package that can help me find invalid responses (those that the person just randomly picked an answer to get to the gift card) and to try and help identify possible duplicates (people trying to get more than 1 gift card)? The questions range from Y/N to 5 point scales to free text. I’m pretty sure I could write a SQL query to find some but want to have the highest possible confidence the list I provide is at least 98% distinct and valid participants.
Thanks and have a great day