Tall array read csv file

1 次查看(过去 30 天)
Safwana Razak
Safwana Razak 2019-11-19
评论: dpb 2019-11-20
Hi why i am encounter this error while train my tall array data in machine learning model?
Error using matlab.io.datastore.TabularTextDatastore/readData (line 77)
Mismatch between file and format character vector.
Trouble reading 'Numeric' field from file (row number 6738, field number 4) ==> I/O Timeout,I/O Timeout,I/O
Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O
Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O Time...
Learn more about errors encountered during GATHER.
FYI my csv file go (200000x344). i tried to remove row number 6738 but the error still occur.
please advise.
  5 个评论
Safwana Razak
Safwana Razak 2019-11-20
hi,
i already found the problem. my file contain the worng data in row 56892 but it the error its stated 6738, maybe because tall array already chucked my data.
Annotation 2019-11-20 091838.png
BTW, thanks for your reply.
dpb
dpb 2019-11-20
"the worng data in row 56892 but it the error its stated 6738, "
I've not used the tall array stuff, but apparently the error message line count is coming from the segment in use rather than being referenced back to the beginning of the file. That's probably worth a "Quality of Implementation" bug report to TMW to make debugging easier.
The klew to what was wrong is that it did echo the offending record content so a search for that string in the file would locate it...

请先登录,再进行评论。

回答(0 个)

类别

Help CenterFile Exchange 中查找有关 Large Files and Big Data 的更多信息

产品


版本

R2019b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by