Take the data out after using Classification Learner App
2 次查看(过去 30 天)
显示 更早的评论
Hi, I separated my data into training and testing data. I trained using the training data, then I tested using the testing data with the code below. It said "Unrecognized variable name 'Character'." And how can I take the data out after classification? Please help me. Thank guys.
testdata=readtable("ClassificationTestData.xlsx")
predictions = char(trainedModel.predictFcn(testdata))
% accuracy
iscorrect=predictions==cell2mat(string((testdata.Character)));
iscorrect=iscorrect(:,2);
accuracy = sum(iscorrect)*100/20;
0 个评论
采纳的回答
Image Analyst
2021-12-15
Huy, here is a complete demo along with instructions in the comments
%==============================================================================
% TRAINING
trainingData=readtable("ClassificationData.xlsx")
% The first 4 columns are the inputs.
tPredictors = trainingData(:, 1:4);
% The last column is the "answer/ground truth".
tResponse = trainingData{:, end};
% Now run Classification Learner app. Start a new session from workspace, and
% Select tPredictors as the "Data set variable" and
% select tResponse as the response.
% I did "All quick to train" and found the Fine Tree model was the best.
% I highlighted it and clicked the Export button to export trainingModel to the workspace.
% Then save it to a .mat file from the command line like this:
% save('trainedModel.mat', 'trainedModel')
%==============================================================================
% TESTING
% Now run it on some test data once you have the trainedModel.mat file created.
% First read in the model from the .mat file
fileName = fullfile(pwd, 'trainedModel.mat');
if ~isfile(fileName)
message = sprintf('Model file not found:\n%s\n\nAre you sure you exported it from Classification Learner', fileName);
uiwait(errordlg(message));
return;
end
s = load(fileName)
trainedModel = s.trainedModel;
% Read in test data
testData=readtable("ClassificationTestData.xlsx")
testPredictors = testData(:, 1:4); % First 4 columns are the model inputs.
testResponse = testData{:, end}; % Last column is the ground truth (correct values).
% Predict the values.
numTestValues = size(predictions, 1)
% Compare predictions to ground truth.
itsCorrect = zeros(numTestValues, 1);
for k = 1 : numTestValues
if isequal(predictions{k}, testResponse{k})
itsCorrect(k) = 1;
end
end
% accuracy
accuracy = sum(itsCorrect)*100/numTestValues
20 个评论
Image Analyst
2021-12-28
Try this:
% TRAINING
trainingData=readtable("ClassificationData.xlsx")
% The first 4 columns are the inputs.
tPredictors = trainingData(:, 1:4);
% The last column is the "answer/ground truth".
tResponse = trainingData{:, end};
matFileName = fullfile(pwd, 'trainedModel.mat');
if ~isfile(matFileName)
warningMessage = sprintf('Model file not found:\n', matFileName);
uiwait(errordlg(warningMessage));
return;
end;
s = load(matFileName)
trainedModel = s.trainedModel
fn = fieldnames(trainedModel);
c = contains(fn, 'Classification');
regressionModelName = fn{find(c)}
% Read test data in to a table.
testData=readtable("ClassificationTestData.xlsx")
testPredictors = testData(:, 1:4); % First 4 columns are the model inputs.
testResponse = testData{:, end}; % Last column is the ground truth (correct values).
% Predict the values from the data in the tPredictors table.
predictions = trainedModel.predictFcn(testPredictors);
% Predict the values.
numTestValues = size(predictions, 1)
% Compare predictions to ground truth.
itsCorrect = zeros(numTestValues, 1);
for k = 1 : numTestValues
if isequal(predictions{k}, testResponse{k})
itsCorrect(k) = 1;
end
end
% accuracy
accuracy = sum(itsCorrect)*100/numTestValues
Everything is being classified as Cluster5. Is that what you expect?
更多回答(3 个)
Cris LaPierre
2021-12-15
You appear to be trying to use a variable name (Character) that does not exist in your test data file. To me, it appears the available variable names are Power, WingSpeed, WindDirect, WindSpeed, and Cluster.
2 个评论
Cris LaPierre
2021-12-15
I did. You need to use one of the actual variable names in your table, which appear to be Power, WingSpeed, WindDirect, WindSpeed, and Cluster.
Image Analyst
2021-12-15
You train using Power, WingSpeed, WindDirect, WindSpeed as the predictors, and Cluster as the response (ground truth).
Now when you go to use the trainedModel, you need to pass in only the predictors (columns 1-4 which are Power, WingSpeed, WindDirect, WindSpeed). Don't pass in Cluster as a predictor since the trained model won't be expecting that.
Your predictions are trainedModel.predictFcn(testdata). That's how you "can take the data out after classification". I'm not sure you need to cast it to char. And there is no column in your testdata table called "Character". Again, it needs to be Cluster since that's the column where the ground truth response for your test data is stored.
Attach trainedModel.mat if you need more help.
3 个评论
Image Analyst
2021-12-15
Save the model to a .mat file
save('trainedModel.mat', 'trainedModel');
then attach trainedModel.mat with the paper clip icon.
Character is not a field of the table so you need to extract the ground truth labels like this
testGroundTruth = testdata.Cluster;
iscorrect= predictions == testGroundTruth;
That may not work. I won't know unless I get the model. Or I'd have to train the model myself, but I don't know which model you chose.
Also make sure you didn't train your model with Cluster being one of the predictors as well as the response. Otherwise it will ignore your other 4 inputs since Cluster is, obviously, a perfect predictor of the response.
Huy Cao
2021-12-15
8 个评论
Image Analyst
2022-1-12
I think for earlier versions it expects the response variable to be one of the columns in your predictors table. I was just working with a guy yesterday who had r2019b and we figured out that's what it was wanting.
另请参阅
类别
在 Help Center 和 File Exchange 中查找有关 Classification Learner App 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!