Rstudio need to install on all EMR nodes?

I have AWS spark/emr with 6 nodes, I install Rstudio on master node, do I need to install on all rest of nodes?

It depends on what you want to do. You almost certainly do not need Rstudio on the nodes. But you might need R if you are wanting R code to run on the nodes. If you want to control the Spark cluster using one of the R packages like sparklyr then I don't think they need R on the nodes as the local R just sends native commands to the Spark Cluster.

If you provide some context around what you are trying to do, others may be able to give you more color.

3 Likes

This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.

If you have a query related to it or one of the replies, start a new topic and refer back with a link.