Hello everyone!
I'm trying to use Rstudio to analyze the reviews of a certain product.
I have already broken down the sentences into words in R, so each word of a sentence is in a separate column right now.
Problem is that it is filled with stop words ( of, in, a, etc), so I tried to use the anti join feature to get rid of them.
Here is how I tried to put the useful words of the reviews into a new dataset:
Cleanwords <- reviews %>% anti_join(my_stop_words)
When I tried to do this, I got this error message:
Error: by must be supplied when x and y have no common variables.
i use by = character()` to perform a cross-join.
So I tried it like this:
Cleanwords <- reviews %>% anti_join(my_stop_words, by =c("word", "stopword"))
But then I got another error message:
Error: Join columns must be present in data.
x Problem with stopword.
Can you help me with this?
Thanks in advance!