0
有人可以解釋什麼時候你分開測試和培訓的數據集?SSAS數據挖掘:測試和培訓數據集...請詳細解釋
有人可以解釋什麼時候你分開測試和培訓的數據集?SSAS數據挖掘:測試和培訓數據集...請詳細解釋
簡而言之,您的數據挖掘模型的準確性通過基於您的測試集中已知結果的訓練集進行預測來評估。
More information on the testing and validation of data mining models (MSDN)
爲了能夠測試你建立預測分析模型,您需要將您的數據集,分爲兩組:訓練和測試數據集。這些數據集應該隨機選擇,應該是實際人羣的良好表現。
Similar data should be used for both the training and test datasets.
Normally the training dataset is significantly larger than the test dataset.
Using the test dataset helps you avoid errors such as overfitting.
The trained model is run against test data to see how well the model will perform.