If you already have data frames with the words, there is no need to make new ones. I had to do that to have something to work with. If you cannot get the method to work. Please post a small sample of your data. You can use the output of the command
dput(head(reviews))
Please place lines containing only three back ticks just before and after the pasted output, like this
```
your output here
```
Using the data you posted and a stop_words data frame that I made by hand, I can filter out the words "to" and "a" with this code. I believe you have a stop_words data frame already, so you should be able to use it, changing the column name in the anti_join from stopword to whatever it is in your data frame.
stop_words <- data.frame(stopword = c("to", "a"))
review <- structure(list(word = c("would", "be", "nice", "to", "have",
"a")), row.names = c(NA, 6L), class = c("tbl_df", "tbl", "data.frame"))
anti_join(review, stop_words, by = c(word = "stopword"))
# A tibble: 4 x 1
word
<chr>
1 would
2 be
3 nice
4 have
My next step would be analyzing the text by the first 2 words or first 3 words in a sentence, just to get a clear picture of the nature of these reviews (sad, disappointed, satisfied, etc).
I'll try to do that on my own tomorrow, but I have a feeling that I'll be back with another question..