MATLAB Answers

Leon
0

How to turn NaN values in only numerical columns into -999?

Asked by Leon
on 17 Oct 2019 at 1:07
Latest activity Commented on by Leon
on 17 Oct 2019 at 14:59
I have some data with both numerical and string columns. See attached for an example (aa.xlsx).
It has four columns like this:
Header1 Header2 Header3 Header4
1, 4, -9, ABC123
2, NaN, 0, NaN
5, 6, NaN, ABC789
My goal is to convert any NaN values that are in only numerical columns into -999, while leaving the NaN values in string columns intact. The end results should look like something like this:
Header1 Header2 Header3 Header4
1, 4, -9, ABC123
2, -999, 0, NaN
5, 6, -999, ABC789
Here is the code I know will work, if all of my columns are numerical:
%convert any NaN into -999
T1 = readtable ('aa.xlsx', 'PreserveVariableNames',true)
Ind_table = isnan(T1{:,:});
T1{:,:}(Ind_table) = -999;
How should I modify it so that it won't do the conversion for columns that are made up of strings?
Many thanks!

  0 Comments

Sign in to comment.

Tags

Products


Release

R2019b

3 Answers

Answer by Adam Danz
on 17 Oct 2019 at 1:48
Edited by Adam Danz
on 17 Oct 2019 at 1:48
 Accepted Answer

When you create your table, the missing values in the Header4 column will not appear as NaNs since that column contains character arrays. Instead, they will just be an empty char array. A very annoying thing with tables is that they do not support subscript indexing. So the solution below converts the table to a cell array, replace the NaN values in numeric columns, and then puts the cell array back into a table with matching properties as your original table.
T1 = readtable ('aa.xlsx', 'PreserveVariableNames',true);
T1cell = table2cell(T1);
isnum = varfun(@isnumeric,T1,'output','uniform'); % ID columns that are numeric
ismiss = ismissing(T1); % find missing values
T1cell(ismiss & isnum) = {-999};
T1New = cell2table(T1cell);
T1New.Properties = T1.Properties; % your new table with NaN replacement
Result
T1New =
3×4 table
Header1 Header2 Header3 Header4
_______ _______ _______ __________
1 4 -9 {'ABC123'}
2 -999 0 {0×0 char}
5 6 -999 {'ABC789'}

  2 Comments

The code works flawlessly.
Thank you so much! This is amazing.
Glad I could help!

Sign in to comment.


Answer by Sebastian Bomberg on 17 Oct 2019 at 14:46

You can have fillmissing apply only to the numeric variables directly:
fillmissing(T1,"constant",-999,"DataVariables",@isnumeric)

  1 Comment

Very cool and it works!
Thank you very much!

Sign in to comment.


Answer by Walter Roberson
on 17 Oct 2019 at 2:01

fillmissing(T1,'constant',{-999,-999,-999,'NaN'})
Note that this will use the character vector 'NaN' (three characters) in place of the numeric NaN entries in column 4, as it is not possible to have numeric entries in a column devoted to character vectors.

  5 Comments

Mind blowing!
I love this solution even better, because it also allows me to fill empty cells within string columns with my preferred string 'N/A'.
Thank you so much! So sorry for unable to accept another answer.
You can always unaccept / accept; I don't mind :)
Weird that the code works for some of my files but not for all of them. I now get an error.
Here is the code I use:
% replacing any NaN values with -999 for numerical columns, and
% any NaN values with 'N/A' for string columns.
char_vars = varfun(@iscell, T1, 'OutputFormat', 'uniform'); % identify the columns that are not numerical (strings)
constants = num2cell(-999*ones(1,width(T1))); % create a one-row matrix of -999
constants(char_vars) = {'N/A'}; % replace -999 with 'NaN' for string columns
new_T1 = fillmissing(T1, 'constant', constants); % fill the empty cells within each respective columns
T1 = new_T1;
Here is the error:
Error using fillmissing/checkArrayType (line 548)
Invalid fill constant type.
Error in fillmissing/fillTableVar (line 182)
[intConstVj,extMethodVj] = checkArrayType(Avj,intMethod,intConstVj,extMethodVj,x,true);
Error in fillmissing/fillTable (line 160)
B.(vj) = fillTableVar(indVj,A.(vj),intMethod,intConst,extMethod,x,useJthFillConstant,useJthExtrapConstant,mavj);
Error in fillmissing (line 138)
B = fillTable(A,intM,intConstOrWinSize,extM,x,dataVars,ma);
Error in FunctionLoading (line 32)
new_T1 = fillmissing(T1, 'constant', constants); % fill the empty cells within each respective columns

Sign in to comment.