I scraped a number of webpages that I stored in a list-column tibble.
How can I save that data, so that I can work on it without re-scraping the whole thing in the future? I couldn't find a way to do that as the usual write_rds doesn't seem to save the underlying xml objects stored in the dataframe. When I load the RDS, the list-col contains only empty values "list(node = <pointer: (nil)>, doc = <pointer: (nil)>)" instead of the actual html code.
Any alternate method that would allow me to properly save this data?
Unfortunately, the result is the same.
When I load back the dataframe I stored using save() the list-col is filled with "list(node = <pointer: (nil)>, doc = <pointer: (nil)>)", the actual data is missing.