How to validate expected numeric tall array column?

2 次查看(过去 30 天)
I have 60 column tall array where I cannot declare expected numeric columns numeric. I exported those numeric columns from Google Spreadsheet, but MATLAB think there is something string, but I cannot confirm it. Code
ds.SelectedFormats{strcmp(ds.SelectedVariableNames, 'Age')} = '%d';
Output
Error using tall
Mismatch between file and format character vector.
Trouble reading 'Numeric' field from file (row number 4, field number 2) ==> -,male,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0,,0,,,,,"lorem, ipsun",0,1,1,0,0,0,0,0,0,0,,,,,,,,,\n
Error in test_tall_array (line 26)
tt = tall(ds);
Caused by:
Reading the variable name 'Age' using format '%d' from file: '/home/masi/test.csv' starting at offset 0.
MATLAB: 2016b OS: Debian 8.5
  1 个评论
Kirby Fears
Kirby Fears 2016-10-24
编辑:Kirby Fears 2016-10-24
SL,
The error suggests row 4, field 2 of your data contains an invalid data type. Can you manually view row 4 of your data to confirm this? Is it possible that some data is misaligned within the "Age" column?

请先登录,再进行评论。

回答(1 个)

Adam Filion
Adam Filion 2018-4-23
It looks like the data provider uses '-' to denote missing data in some places. The datastore's 'TreatAsMissing' property can solve this.
ds = datastore('*','TreatAsMissing','-');

类别

Help CenterFile Exchange 中查找有关 Large Files and Big Data 的更多信息

产品

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by