I have a dataset of around 3,000 observations and am running into a problem I've never encountered when attempting to model a regression on the dataset with lm()
For some reason, the output of the regression appears to be a coefficient for every possible value of some of the independent variables instead of one coefficient for each independent variable. Does it look like a problem with the way my data is formatted maybe?
my code
mod1 <- lm(growth ~ connected + emp2010 + pop2010 + perc_bachelors + med_hh_income + perc_poverty, data=VA)
Output
summary(mod1)
Call:
lm(formula = growth ~ connected + emp2010 + pop2010 + perc_bachelors +
med_hh_income + perc_poverty, data = VA)
Residuals:
Min 1Q Median 3Q Max
-114.99 -11.30 -4.18 1.16 1148.42
Coefficients: (329 not defined because of singularities)
Estimate Std. Error t value Pr(>|t|)
(Intercept) 5.377e+00 6.916e+00 0.777 0.43697
connected 2.013e+00 1.789e+00 1.125 0.26074
emp2010 -1.905e-04 2.286e-04 -0.833 0.40467
pop201010831 -6.804e+00 1.337e+01 -0.509 0.61098
pop201010901 -7.080e+00 1.410e+01 -0.502 0.61563
pop201011465 -5.283e+00 1.336e+01 -0.396 0.69248
pop201011478 -7.444e+00 1.209e+01 -0.616 0.53798
pop2010120212 2.416e-01 1.026e+01 0.024 0.98121
pop201012099 -5.148e+00 1.562e+01 -0.330 0.74168
pop201012167 2.836e+01 1.411e+01 2.011 0.04446 *
pop201012181 -6.744e+00 1.562e+01 -0.432 0.66591
pop201012419 -5.237e+00 1.305e+01 -0.401 0.68830
pop2010124587 7.550e+00 1.136e+01 0.665 0.50641
pop201012517 -7.451e+00 1.372e+01 -0.543 0.58702
pop201012572 -3.262e+00 1.370e+01 -0.238 0.81183
pop201012644 -5.182e+00 1.504e+01 -0.345 0.73038
pop201012968 -5.660e-01 1.713e+01 -0.033 0.97364
pop201013195 -5.402e+00 1.503e+01 -0.359 0.71932
pop201013342 -2.107e-01 1.305e+01 -0.016 0.98711
pop2010133647 -5.139e+00 1.061e+01 -0.485 0.62806
pop201013421 -8.256e+00 1.710e+01 -0.483 0.62931
pop2010139046 -6.279e+00 1.083e+01 -0.580 0.56226
pop201014013 5.562e+00 1.409e+01 0.395 0.69307
pop201014029 2.329e+00 1.172e+01 0.199 0.84246
pop201014653 -3.992e+00 1.336e+01 -0.299 0.76513
pop201014989 -1.613e+00 1.306e+01 -0.123 0.90172
pop201015030 -3.111e+00 1.277e+01 -0.244 0.80755
pop201015509 -2.107e+00 1.411e+01 -0.149 0.88136
pop201015819 3.358e-01 1.337e+01 0.025 0.97996
pop201015855 7.621e+00 1.307e+01 0.583 0.55979
pop201015966 -1.172e+01 1.711e+01 -0.685 0.49329
pop201016318 3.994e+01 1.372e+01 2.911 0.00364 **
pop201016406 -4.533e+00 1.372e+01 -0.331 0.74103
pop201016874 1.005e+01 1.455e+01 0.691 0.48972
pop201017205 2.383e+00 1.305e+01 0.183 0.85510
pop201017237 -8.304e+00 1.503e+01 -0.553 0.58061
pop201017472 2.163e+00 1.454e+01 0.149 0.88172
pop201017655 -1.496e+00 1.410e+01 -0.106 0.91549
pop201017704 -5.994e+00 1.252e+01 -0.479 0.63212
pop201017707 -7.481e+00 1.454e+01 -0.515 0.60682
pop201018082 -3.823e+00 1.230e+01 -0.311 0.75590
pop2010181822 -2.239e+00 1.050e+01 -0.213 0.83117
pop201018493 -3.374e+00 1.370e+01 -0.246 0.80556
pop201018643 1.915e+00 1.305e+01 0.147 0.88334
pop2010197467 -5.919e+00 1.131e+01 -0.523 0.60082
pop2010201828 -2.715e+00 1.002e+01 -0.271 0.78636
pop201020885 -5.737e+00 1.188e+01 -0.483 0.62926
pop201021136 6.708e+00 1.170e+01 0.573 0.56665
pop2010219268 -4.544e+00 1.008e+01 -0.451 0.65230
pop201022058 -6.353e+00 1.251e+01 -0.508 0.61150
pop201022217 3.561e+00 1.171e+01 0.304 0.76100
pop201022506 3.970e+00 1.370e+01 0.290 0.77207
pop201022723 -3.632e+00 1.338e+01 -0.271 0.78607
pop201022794 9.507e+00 1.408e+01 0.675 0.49972
pop201023234 -2.896e+00 1.306e+01 -0.222 0.82449
pop201023375 1.555e+00 1.305e+01 0.119 0.90514
pop201023806 7.233e+00 1.229e+01 0.589 0.55625
pop20102395 -1.274e+01 1.811e+01 -0.703 0.48192
pop201024116 -3.530e+00 1.409e+01 -0.250 0.80226
pop2010242143 -6.531e+00 1.007e+01 -0.648 0.51682
pop201024459 -4.426e+00 1.277e+01 -0.347 0.72881
pop201024641 -4.110e+00 1.044e+01 -0.394 0.69395
pop201025308 -5.236e+00 1.253e+01 -0.418 0.67614
pop201025434 -7.002e+00 1.409e+01 -0.497 0.61929
pop201025953 -5.179e+00 1.276e+01 -0.406 0.68494
pop201027449 -5.411e-01 1.278e+01 -0.042 0.96623
pop201027758 1.992e-01 1.338e+01 0.015 0.98812
pop201027844 7.790e+00 1.454e+01 0.536 0.59221
pop201028842 -3.448e+00 1.208e+01 -0.286 0.77526
pop201029005 -3.227e+00 1.075e+01 -0.300 0.76408
pop2010291653 6.079e+00 1.015e+01 0.599 0.54921
pop201029985 3.632e+01 1.307e+01 2.778 0.00550 **
pop2010300053 1.561e+01 1.001e+01 1.560 0.11891
pop2010308633 -3.610e+00 9.758e+00 -0.370 0.71146
pop201032248 1.498e+00 1.154e+01 0.130 0.89671
pop201032303 -8.757e+00 1.170e+01 -0.749 0.45411
pop201032315 -9.636e-01 1.110e+01 -0.087 0.93085
pop201032383 -1.831e+00 1.086e+01 -0.169 0.86611
pop201032730 2.365e+00 1.208e+01 0.196 0.84472
pop201032774 3.858e+00 1.138e+01 0.339 0.73470
pop201032867 1.131e+01 1.171e+01 0.967 0.33388
pop201034066 2.148e+00 1.169e+01 0.184 0.85427
pop201034762 9.621e+01 1.276e+01 7.540 6.5e-14 ***
pop201034963 2.068e+01 1.155e+01 1.791 0.07339 .
pop201035129 3.522e+00 1.207e+01 0.292 0.77046
pop201036067 2.131e+00 1.168e+01 0.182 0.85528
pop201036311 4.951e+00 1.124e+01 0.441 0.65950
pop201036610 -6.031e+00 1.251e+01 -0.482 0.62987
pop201037044 -7.172e-02 1.153e+01 -0.006 0.99504
pop2010379415 -4.520e+00 1.025e+01 -0.441 0.65922
pop20103886 -8.110e-01 1.370e+01 -0.059 0.95281
pop201041468 -4.789e+00 1.109e+01 -0.432 0.66589
pop201041496 -2.826e+00 1.252e+01 -0.226 0.82139
pop201042267 -5.670e+00 1.123e+01 -0.505 0.61362
pop2010435996 6.632e+00 9.683e+00 0.685 0.49349
pop201043787 2.388e+01 1.085e+01 2.200 0.02790 *
pop201044706 -9.230e+00 1.139e+01 -0.811 0.41762
pop201045749 -3.357e+00 1.097e+01 -0.306 0.75952
pop201047406 -1.196e+00 1.097e+01 -0.109 0.91315
pop20104779 6.554e-01 1.712e+01 0.038 0.96946
pop20105173 -1.968e+00 1.811e+01 -0.109 0.91347
pop201054174 8.029e+00 1.111e+01 0.723 0.46983
pop201054322 -3.497e+00 1.036e+01 -0.338 0.73576
pop201054860 -7.932e+00 1.125e+01 -0.705 0.48076
pop201054938 -3.218e+00 1.063e+01 -0.303 0.76212
pop20105822 -8.076e+00 1.563e+01 -0.517 0.60552
pop20105989 -8.136e+00 1.503e+01 -0.541 0.58830
pop201063147 3.224e+00 1.086e+01 0.297 0.76652
pop201064386 4.214e+00 1.073e+01 0.393 0.69444
pop201064546 3.808e-01 1.063e+01 0.036 0.97143
pop201064846 -3.877e+00 1.153e+01 -0.336 0.73666
pop20106653 -5.670e+00 1.454e+01 -0.390 0.69662
pop201067697 -3.563e+00 1.064e+01 -0.335 0.73779
pop20106873 -5.373e+00 1.561e+01 -0.344 0.73079
pop20106926 -5.360e+00 1.713e+01 -0.313 0.75431
pop20106936 -5.710e+00 1.372e+01 -0.416 0.67736
pop20106990 -5.777e+00 1.503e+01 -0.384 0.70080
pop20107039 -4.931e+00 1.630e+01 -0.302 0.76234
pop20107205 1.901e+00 1.278e+01 0.149 0.88183
pop201073201 -4.272e+00 1.063e+01 -0.402 0.68791
pop201073726 -1.527e+00 1.052e+01 -0.145 0.88466
pop20107376 -2.487e+00 1.503e+01 -0.165 0.86861
pop201074922 6.153e+00 1.097e+01 0.561 0.57489
pop201075835 -4.553e+00 1.062e+01 -0.429 0.66826
pop201082544 -6.298e+00 1.073e+01 -0.587 0.55726
pop20108549 -3.244e+00 1.503e+01 -0.216 0.82914
pop20109004 -5.171e+00 1.630e+01 -0.317 0.75103
pop201091583 4.746e+00 1.054e+01 0.450 0.65239
pop201092527 1.713e+01 1.045e+01 1.640 0.10120
pop20109328 1.310e+01 1.563e+01 0.838 0.40196
pop201095793 -3.725e+00 1.018e+01 -0.366 0.71445
pop201096633 8.718e+00 1.053e+01 0.828 0.40788
pop201096785 1.125e-02 1.073e+01 0.001 0.99916
pop20109855 -8.398e+00 1.936e+01 -0.434 0.66445
pop201099172 -7.993e-02 1.010e+01 -0.008 0.99369
perc_bachelors10.3 NA NA NA NA
perc_bachelors10.4 NA NA NA NA
perc_bachelors10.7 NA NA NA NA
perc_bachelors10.9 NA NA NA NA
perc_bachelors11.1 NA NA NA NA
perc_bachelors11.3 NA NA NA NA
perc_bachelors11.5 NA NA NA NA
perc_bachelors11.6 NA NA NA NA
perc_bachelors11.7 NA NA NA NA
perc_bachelors11.8 NA NA NA NA
perc_bachelors11.9 NA NA NA NA
perc_bachelors12.0 NA NA NA NA
perc_bachelors12.4 NA NA NA NA
perc_bachelors12.5 NA NA NA NA
perc_bachelors12.6 NA NA NA NA
perc_bachelors12.7 NA NA NA NA
perc_bachelors12.8 NA NA NA NA
perc_bachelors13.1 NA NA NA NA
perc_bachelors13.2 NA NA NA NA
perc_bachelors13.4 NA NA NA NA
perc_bachelors13.6 NA NA NA NA
perc_bachelors13.9 NA NA NA NA
perc_bachelors14.0 NA NA NA NA
perc_bachelors14.3 NA NA NA NA
perc_bachelors14.6 NA NA NA NA
perc_bachelors14.9 NA NA NA NA
perc_bachelors15.1 NA NA NA NA
perc_bachelors15.2 NA NA NA NA
perc_bachelors15.4 NA NA NA NA
perc_bachelors15.5 NA NA NA NA
perc_bachelors15.6 NA NA NA NA
perc_bachelors15.9 NA NA NA NA
perc_bachelors16.1 NA NA NA NA
perc_bachelors16.2 NA NA NA NA
perc_bachelors16.3 NA NA NA NA
perc_bachelors16.5 NA NA NA NA
perc_bachelors16.6 NA NA NA NA
perc_bachelors17.2 NA NA NA NA
perc_bachelors17.5 NA NA NA NA
perc_bachelors17.9 NA NA NA NA
perc_bachelors18.0 NA NA NA NA
perc_bachelors18.5 NA NA NA NA
perc_bachelors18.7 NA NA NA NA
perc_bachelors18.8 NA NA NA NA
perc_bachelors19.0 NA NA NA NA
perc_bachelors19.1 NA NA NA NA
perc_bachelors19.2 NA NA NA NA
perc_bachelors19.3 NA NA NA NA
perc_bachelors19.7 NA NA NA NA
perc_bachelors19.8 NA NA NA NA
perc_bachelors20.0 NA NA NA NA
perc_bachelors20.4 NA NA NA NA
perc_bachelors20.5 NA NA NA NA
perc_bachelors20.6 NA NA NA NA
perc_bachelors20.7 NA NA NA NA
perc_bachelors21.4 NA NA NA NA
perc_bachelors21.5 NA NA NA NA
perc_bachelors21.6 NA NA NA NA
perc_bachelors21.8 NA NA NA NA
perc_bachelors21.9 NA NA NA NA
perc_bachelors22.3 NA NA NA NA
perc_bachelors22.6 NA NA NA NA
perc_bachelors22.9 NA NA NA NA
perc_bachelors23.1 NA NA NA NA
perc_bachelors23.5 NA NA NA NA
[ reached getOption("max.print") -- omitted 264 rows ]
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Residual standard error: 44.28 on 2562 degrees of freedom
(346 observations deleted due to missingness)
Multiple R-squared: 0.06076, Adjusted R-squared: 0.01163
F-statistic: 1.237 on 134 and 2562 DF, p-value: 0.03692
(I didn't include a sample of the data because there are about 200 variables, so even one observation is too long using dput())