Hi, I'm trying to do some analysis on my Facebook Messages data, however I'm running into some encoding issues. Facebook allows you to download your data as .json files. I want to analyze some of the emojis and text used, but am having difficulties converting them. For example, in the raw .json file, I see:
{
"messages": [
{
"sender_name": "Person 1",
"timestamp_ms": 1572480435138,
"content": "I\u00e2\u0080\u0099m assuming..",
"type": "Generic"
}
]
}
When reading it into R, using jsonlite::fromJSON, it appears in the console like:
library(jsonlite)
dat <- fromJSON('example.json')
dat$messages$content
> dat$messages$content
[1] "Iâ\u0080\u0099m assuming.."
Is there an easy way to convert these weird characters. This one for example looks like it should be an apostrophe
I'm on a Windows machine using Rstudio if that matters at all