I actually managed to get somewhere using dplyr nest() and purrr. Either way, from my RAM killing the server it only goes up to 35gb.
is there a way to reduce RAM usage after a processing step? I can see that the data I produced is around 10gb in size, but my RAM is taking around 35gb.
Using gc() doesnt relly help. The only way I found that helps to reduce RAM usage is save the file on your server, restart the session, load your previous datasets and run next step, repeat.