结果会保存在S3,有一个result的目录生成
这个例子里,我们用森林覆盖率作为例子,做一个训练 CoverType:森林覆盖的情况
同时,我们也使用F-Score来判断模型的Auccracy
我们的目录结构是这样的
结果是这样的,我们从中选出最尤的几个解
这个例子中,我们使用了house pricing的data,来预测房价
so we're given almost 80 different features including things like the number of
中文叫残差,如果回归模型正确的话, 我们可以将残差看作误差的观测值。 它应符合模型的假设条件,且具有误差的一些性质。利用残差所提供的信息,来考察模型假设的合理性及数据的可靠性称为残差分析。正数,负数都是不理想的结果。nagative rasidual indicates overestimating the target.
If we didn't have a zero-centered bell-shaped curve, then there's some definite error with the model's prediction and we might then try and add some more features to our model, so in the example of house prices, we might try and include new features such as the say the area of the garden or does the house have under floor heating or the energy rating of the house and we'll use these to try and determine what is not being captured by our model.