Tidymodel (Recipe package) - Number to factor

Well, I've read the function reference for step_num2factor and didn't figured it out how to use it properly, honestly.

price_recipe <-
     recipe(SalePrice ~ . , data = train_raw) %>%
step_num2factor(MSSubClass, levels  = names_vector)

temp_rec <- prep(price_recipe, training = train_raw, strings_as_factors = FALSE) # temporary recipe
temp_data <- bake(temp_rec, new_data = all_raw) # temporary data

class(all_raw$MSSubClass)
# > col_double() 

MSSubClass: Identifies the type of dwelling involved in the sale.

    20	1-STORY 1946 & NEWER ALL STYLES
    30	1-STORY 1945 & OLDER
    40	1-STORY W/FINISHED ATTIC ALL AGES
    45	1-1/2 STORY - UNFINISHED ALL AGES
    50	1-1/2 STORY FINISHED ALL AGES
    60	2-STORY 1946 & NEWER
    70	2-STORY 1945 & OLDER
    75	2-1/2 STORY ALL AGES
    80	SPLIT OR MULTI-LEVEL
    85	SPLIT FOYER
    90	DUPLEX - ALL STYLES AND AGES
   120	1-STORY PUD (Planned Unit Development) - 1946 & NEWER
   150	1-1/2 STORY PUD - ALL AGES
   160	2-STORY PUD - 1946 & NEWER
   180	PUD - MULTILEVEL - INCL SPLIT LEV/FOYER
   190	2 FAMILY CONVERSION - ALL STYLES AND AGES

The obs are saved as 20,30,40.... 190 and I want to transform to names ( or even the same numbers but as unordered factors)

The complete dataset is provided by kaggle at:

Before putting the data into a recipe, I suggest that you use case_when() inside of a mutate().

There is a clean version of these data in the AmesHousing data.

This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.