How do I run a regression when looking at a specific subset of my sample?
For example, I want to study the trends of diabetes in my sample given that they have chronic kidney disease (CKD). Because the majority of my sample that has CKD also has diabetes (compared to any other other traditional risk factors for CKD), I want to study this subset of people and see what are potential risk factors for them. I want to examine what the trends are associated with (diabetes | CKD).
I want to assess how variables like wealth, education level, residence, sex, etc impact the prevalence of the diabetes prevalence (given that they have CKD).
How can I amend this such that my regression is only assessing these variables for those who have diabetes and CKD? Am I going to have to make another data frame?