Help, i am new to R

hey. i hope you guys can help me, i have had limited training in r, but have an semester project, where i have to cluster and forecast a dataset.

the information i have are
Productname
Revenue pr product
product unit, --> one to three katagories per porduct
sales date
nummer of items

all of these are pr product.

  • i hope sombody can help me, with how to setup my data and inport it the right way from excel and how to start the further analysis

best regards Nicolai

Hi Nicolai,
Welcome to the RStudio Community!

As a general rule, you will probably get your data in an excel file or a csv file, and the readr package is an excellent tool for data importing.

Once you have the data in R, you will want to check for missing data with the naniar package, and explore your data with the DataExplorer and skimr packages.

You may find that you need to make your data more tidy (https://vita.had.co.nz/papers/tidy-data.pdf ), in which case you may need to use the tidyr and dplyr packages to rearrange your data.

For forecasting, there are several choices, but if it is a time series, the tsibble and fable packages are popular.

By the way, specific questions about homework can not be answered here, but general inquiries (like this) are welcome.
Your professor would likely also appreciate it if you asked some of these questions at office hours. Give it a try.

1 Like

Wow, this sentence was gold! I have not come across these excellent packages before.
I would add janitor for initial data cleaning.

1 Like

Agreed on janitor. It is just assumed here in Ann Arbor. Sort of our local package by Sam Firke.