big data code making

Hello to everyone. I need to help. I make big data document and I must analyze it by R studio.

lm2<-lm(PFS_1yr~Age + Sex+ PFS_1yr + IDVoxelNum_ktrans_e + VolumeNum_ktrans_e + Elongation_ktrans_e + Flatness_ktrans_e + LeastAxisLength_ktrans_e + MajorAxisLength_ktrans_e + Maximum2DDiameterColumn_ktrans_e + Maximum2DDiameterRow_ktrans_e + Maximum2DDiameterSlice_ktrans_e + Maximum3DDiameter_ktrans_e + MeshVolume_ktrans_e + MinorAxisLength_ktrans_e + Sphericity_ktrans_e + SurfaceArea_ktrans_e + SurfaceVolumeRatio_ktrans_e + VoxelVolume_ktrans_e + 10Percentile_ktrans_e + RootMeanSquared_ktrans_e + Skewness_ktrans_e + TotalEnergy_ktrans_e + Uniformity_ktrans_e + Variance_ktrans_e + Autocorrelation_ktrans_e + ClusterProminence_ktrans_e + ClusterShade_ktrans_e + ClusterTendency_ktrans_e + Contrast_ktrans_e + Correlation_ktrans_e + DifferenceAverage_ktrans_e + DifferenceEntropy_ktrans_e + DifferenceVariance_ktrans_e + Id_ktrans_e + Idm_ktrans_e + Idmn_ktrans_e + Idn_ktrans_e + Imc1_ktrans_e + Imc2_ktrans_e + InverseVariance_ktrans_e + JointAverage_ktrans_e + JointEnergy_ktrans_e + JointEntropy_ktrans_e + MCC_ktrans_e + MaximumProbability_ktrans_e + SumAverage_ktrans_e + SumEntropy_ktrans_e + SumSquares_ktrans_e + DependenceEntropy_ktrans_e + DependenceNonUniformity_ktrans_e + DependenceNonUniformityNormalized_ktrans_e + DependenceVariance_ktrans_e + GrayLevelNonUniformity_ktrans_e + GrayLevelVariance_ktrans_e + HighGrayLevelEmphasis_ktrans_e + LargeDependenceEmphasis_ktrans_e + LargeDependenceHighGrayLevelEmphasis_ktrans_e + LargeDependenceLowGrayLevelEmphasis_ktrans_e + LowGrayLevelEmphasis_ktrans_e + SmallDependenceEmphasis_ktrans_e + SmallDependenceHighGrayLevelEmphasis_ktrans_e + SmallDependenceLowGrayLevelEmphasis_ktrans_e + GrayLevelNonUniformity_ktrans_e + HighGrayLevelEmphasis_Vp_necrosis + LargeDependenceEmphasis_Vp_necrosis + GrayLevelNonUniformity_Vp_necrosis + GrayLevelNonUniformityNormalized_Vp_necrosis + , data=test1)
summary(lm2)

My data consist of 100obs. of 7683 variables. Help me to create some simple code. Multiple regression.

What is your hypothesis?

The aim of my research is find what kind of features influence on progression

Hi @Lena,

You mention this is a research project. Given my experience, the way you have stated your question and the code you supply (E.g. the variable PFS_1yr appears both as response and predictor), I would strongly advice to seek technical supervision.

The question you pose is really quite complex and requires careful thinking in terms of hypothesis definition and hypothesis generation.

So, as I said, get someone experienced to guide you and then I'd recommend the following learning ressources:

We all have to start somewhere and there is no quick-fix or short cut, so you have some hours to spend on a learning curve to climb, so happy learning :slightly_smiling_face:

1 Like

This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.