Testing unlabeled data on a trained model

Dear Matlab community,
I need to know if there's a way to test the reliability of predictions made by classifying new data (unlabeled data) using and already trained model.
This is what I did:
1) Create a dataset with labeled data, with 2 predictors and 3 response variables (training set);
2) Fit and validate a Multiclass Support Vector Machine classifier using the training set;
3) Use the obtained model to make predictions on a new dataset with unlabeled data (test set)
I would like to know which are the classification metrics (if there are) to establish the relaibility of this classification, since the new data is unlabeled.
Thanks.

4 个评论

Reliability of the predictions made by a trained model is generally done using a test set which is labeled.
I suggest you split your labeled dataset into train, valid and test datasets. The train dataset is used for training the model, the valid dataset is used for tuning the hyperparameters of a model, and finally the test dataset will give you the performance or reliability of your final trained model.
I've used a 7-fold CV validation method + Hyperparameters Optimization option to train the model. What I did next, was to use that model (which is already trained and validated) to make predictions on new unlabeled data. From this new classification (predicted values) I wanted to know:
  1. if there was some statistical kind of validation to test the reliability of the predictions,
  2. or is the reliability of the predictions based only in the performance of your trained model?
Thanks for your reply! :)
If your labeled training data and the unlabeled test data have a high correlation, the best thing you can do is to use a small partition of the labeled training data as test data to get a quantitative measure on reliability. The high correlation should ensure similar performance with your unlabeled test data.
Apart from this, I don't think there is any reliable way to get performance of your model on real data without ground truth.
Thank you so much for the answer! This is just what I wanted to know.

请先登录,再进行评论。

回答(0 个)

类别

帮助中心File Exchange 中查找有关 Statistics and Machine Learning Toolbox 的更多信息

提问:

2020-10-26

评论:

2020-10-29

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by