Data Mining of Drug Safety Reports

fsr91 · August 18, 2020, 1:07pm

Hello,

I am working on a large dataset for a research project. The project includes safety case reports (Adverse Event) and I am trying to apply an inclusion criteria which is:

the adverse event must come from two different sources. the sources are coded in an excel sheet as (1) and (0). To be included the adverse event must come from both sources (1 & 0) to enter into the final analysis. The data is available in excel.

What would be the appropriate codes to run on R in order to apply the criteria?

Thanks!

nirgrahamuk · August 18, 2020, 2:16pm

How do you identify an adverse event. There is an ID for that ?

fsr91 · August 18, 2020, 2:39pm

data has three variables:

Case number
Source: 0 & 1
Adverse event: for example headache, infection etc. Some adverse events are repeated many times.

I am not interested in the case number. I need to identify adverse events that came from two sources (0&1) regardless of the frequency.

Thanks!

nirgrahamuk · August 18, 2020, 2:58pm

library(tidyverse)
set.seed(42) # for reproducible random data
(example_data <- data.frame(
  case_num =1:100,
  source = c(rep(0,50),rep(1,50)),
  adverse_event = factor(sample(c(letters,LETTERS),size=100,replace=TRUE))
))

(
  result_df <- group_by(example_data,
                        adverse_event) %>%
    summarise(total_cases  = n(),
              both_sources = sum(source==0) > 0 & sum(source==1) > 0 )
)

system · September 8, 2020, 2:58pm

This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.

If you have a query related to it or one of the replies, start a new topic and refer back with a link.