I’m teaching a data science course to complete R novices (first year undergraduate students) and primarily using the tidyverse toolkit. If loading an external dataset, we user
readr so the result object is a tibble. We also use
dplyr heavily so datasets loaded from other packages that might not have been tibbles get converted to tibbles along the way. I’m not interested in going through the details of how an object of class
tibble differs from a
data.frame object (but if you think one should, I’d love to hear your thoughts). However I catch myself using the term “data frame” sometimes in class, and I think I really just mean a dataset/data matrix as opposed to the
data.frame class, and I’m using this term out of habit.
What term do others teaching the tidyverse use to refer to “data rectangles”? Do you exclusively use the word tibble? I’m trying to train myself to say “dataset” when I just want to talk about a data rectangle, and tibble only when it’s necessary to discuss the class of the object, but I’d love to hear others’ thoughts on this.
(As an aside, I’m still trying to learn to not say "we subset with
filter" and instead say “we filter the dataset”, because I don’t want to use a word that’s also the name of another function that works differently, but old habits die hard… I find that precise wording helps students google things better, hence my semi-obsession with it.)