Summing certain parts of columns in R studio

JennaMurray · March 23, 2023, 7:55pm

How is the number of cases / number of controls distributed with respect to the age categories?

JennaMurray · March 23, 2023, 7:56pm

how would I add row number 1-15 in "ncontrols" column?

jrkrideau · March 23, 2023, 8:30pm

Welcome to the forum,

Screenshots are not very useful but here is a simple example .

dat1 <- data.frame(aa = sample(1:50, 20), bb = LETTERS[1:20])                              
sum(dat1[1:15, "aa"])

You might find this useful: FAQ

JennaMurray · March 23, 2023, 8:37pm

what do the "aa" and "bb" represent?

JennaMurray · March 23, 2023, 8:40pm

I have named my data set Q1.data, as it is question 1's data.
I want to sum the first 15 rows of data in the "ncontrols" or the 5th column

so should my code read:
dat1 <- data.frame(aa = Q1.data(1:15, 5), **bb = LETTERS[1:20] (???) **) (???)
sum(dat1[1:15, "aa"])

jrkrideau · March 23, 2023, 10:59pm

They are column names, corresponding to "agegp" or "ncontrols", etc.

No, let's say your data set is a data.frame called "mydata". All you need to do is

sum(mydata[1:15, "ncontrols"]

In English, this means we are selecting the column "ncontrols" and taking rows from 1 to 15 and summing those 15 numbers.

You might find the Wrangling and Data chapters of Preceptor’s Primer a handy intro.

williaml · March 23, 2023, 11:12pm

Hi, for a reproducible example instead of a screenshot:

FAQ: How to do a minimal reproducible example ( reprex ) for beginners Guides & FAQs

A minimal reproducible example consists of the following items: A minimal dataset, necessary to reproduce the issue The minimal runnable code necessary to reproduce the issue, which can be run on the given dataset, and including the necessary information on the used packages. Let's quickly go over each one of these with examples: Minimal Dataset (Sample Data) You need to provide a data frame that is small enough to be (reasonably) pasted on a post, but big enough to reproduce your issue. Let's say, as an example, that you are working with the iris data frame head(iris) #> Sepal.Length Sepal.Width Petal.Length Petal.Width Species #> 1 5.1 3.5 1.4 0.…

For grouping and summarising:

Summarise each group down to one row — summarise • dplyr (tidyverse.org)

system · May 4, 2023, 11:13pm

This topic was automatically closed 42 days after the last reply. New replies are no longer allowed.

If you have a query related to it or one of the replies, start a new topic and refer back with a link.