I am new to Spark and I wondered if a company that owns a datacenter can easily implement HDFS storage system and Spark framework on it instead of using cloud services like AWS.
In that case, does someone know any tutorial or have any tips to achieve it ?
(By the way, I use R programming language and would be interested to use R sparklyr package)
Thanks for your help