So I was posed this question:
(a) The training set contains 1000 observations on 7 covariates with the last (the 8th)column containing a continuous response variable. Predict the response variable from the covariates.
(b) The test set contains a further 500 observations on the 7 covariates. Provide predictions of the response using the model you chose in part (a).
I'm not sure if I'm doing this correctly. Ive read in the .csv files and did some regression. Here's what I've been trying:
train.lm<-lm(y~., data=train) summary(train.lm) predict(train.lm, train) predict(train.lm, test)
Am I even on the right track?
Any help is greatly appreciated.rtraining-datatest-data