How to select the number of samples to train a Machine Learning algorithm?

Question

Jose Marques 2019-1-31

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/442422-how-to-select-the-number-of-samples-to-train-a-machine-learning-algorithm

评论： Greg Heath 2019-2-4

I working in a dataset of 12000 samples concerning about 5 years of an industrial process.

It is likely that during this time the plant has undergone changes (equipments, the performance drop itself, chemical products).

Is there a tool for identifying the best subset of this data? In my view, a temporal cut in the data could increase the quality of the models created.

3 个评论
显示 1更早的评论隐藏 1更早的评论

Jose Marques 2019-1-31

Thanks for the comment!

The dataset has 426 inputs (I am using techniques for feature selection too).

I am using four algorithms to create the models: Regression Tree, Bagged Trees, SVM and Neural Networks.

Greg Heath 2019-2-4

As a common sense rule of thumb I try to use at least 10 to 30 times as many training points as unknown parameters that have to be estimated.

In addition I use 10 to 20 sets of random initial weights.

I assume , of course, that you ave examined plots of the data to initialize your common sense.

Hope this Helps

Greg

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

BERGHOUT Tarek 2019-2-3

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/442422-how-to-select-the-number-of-samples-to-train-a-machine-learning-algorithm#answer_359276

u can use deep belif networks ; they are the best for feature sellection and mapping; and train you network by driven chunks of data "by randomly chosing a pairs of (inputs,targets)" and in the same time pire attention to your approximation function you must keep your error function in its local minimam. deep belif nets depands on a set of stacked auto_encoders that allows to tune all the parameters of the networks with small amount of training data

https://www.youtube.com/watch?v=E2Mt_7qked0

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

How to select the number of samples to train a Machine Learning algorithm?

3 个评论
显示 1更早的评论隐藏 1更早的评论

回答（1 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

How to select the number of samples to train a Machine Learning algorithm?

3 个评论 显示 1更早的评论隐藏 1更早的评论

回答（1 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

3 个评论
显示 1更早的评论隐藏 1更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论