Extracting certain data from very large text/numeric data

Question

Bernard 2013-8-28

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/85839-extracting-certain-data-from-very-large-text-numeric-data

关闭： MATLAB Answer Bot 2021-8-20

I am trying to extract data from a hoc file which is a combination of text,whitespace,characters, and numbers. I need to be able to find the row index of wherever there occurs the string "section[%d]" where d is an integer, just being able to find the row when I use importdata to a cell array would be good enough, there are upwards of like 40 occurences of the string so I need to find all of them.

6 个评论
显示 4更早的评论隐藏 4更早的评论

Bernard 2013-8-29

This is not anything to do with calculation. I need to just find where in the text the section id string occurs because that will give me a reference for the first point in that section. The ID number doesn't matter that much since if there is section written 10 times throughout all the points it will be sections(1-10)

Walter Roberson 2013-8-29

My regexp solution is not working for you?

此问题已关闭。

Answer 1

Walter Roberson 2013-8-28

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/85839-extracting-certain-data-from-very-large-text-numeric-data#answer_95351

在 MATLAB Online 中打开

find(~cellfun(@isempty, regexp(YourCell, 'section\[%\d+\]', 'start')))

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

Answer 2

Cedric 2013-8-29

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/85839-extracting-certain-data-from-very-large-text-numeric-data#answer_95397

编辑：Cedric 2013-8-29

在 MATLAB Online 中打开

Based on your comment: one way to tackle that is to split the file according to section headers/footer, so you get blocks that you can process using TEXTSCAN. Example:

 content = fileread('myData.txt') ;
 blocks  = regexp(content, '(}\s*){0,1}section\[\d+\]\s*{|}', 'split') ;
 blocks  = blocks(2:end-1) ;                 % Eliminate first empty and last 
                                             % (after last '}') blocks.
 nBlocks = length(blocks) ;
 data    = cell(nBlocks, 1) ;
 for bId = 1 : nBlocks
    data{bId} = textscan(blocks{bId}, 'pt3dadd(%f,%f,%f,%f,%f)') ;
 end

and if you don't want data to be a cell array of cell arrays (output of _TEXTSCAN_is a cell array of columns), you can replace the above line in the FOR loop with:

    buffer    = textscan(blocks{bId}, 'pt3dadd(%f,%f,%f,%f,%f)') ;
    data{bId} = [buffer{:}] ;

Extracting certain data from very large text/numeric data

6 个评论
显示 4更早的评论隐藏 4更早的评论

回答（2 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

标签

Community Treasure Hunt

Extracting certain data from very large text/numeric data

6 个评论 显示 4更早的评论隐藏 4更早的评论

回答（2 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

标签

Community Treasure Hunt

WeChat

6 个评论
显示 4更早的评论隐藏 4更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论