Info

此问题已关闭。 请重新打开它进行编辑或回答。

Extracting certain data from very large text/numeric data

1 次查看(过去 30 天)
I am trying to extract data from a hoc file which is a combination of text,whitespace,characters, and numbers. I need to be able to find the row index of wherever there occurs the string "section[%d]" where d is an integer, just being able to find the row when I use importdata to a cell array would be good enough, there are upwards of like 40 occurences of the string so I need to find all of them.
  6 个评论
Bernard
Bernard 2013-8-29
This is not anything to do with calculation. I need to just find where in the text the section id string occurs because that will give me a reference for the first point in that section. The ID number doesn't matter that much since if there is section written 10 times throughout all the points it will be sections(1-10)

回答(2 个)

Walter Roberson
Walter Roberson 2013-8-28
find(~cellfun(@isempty, regexp(YourCell, 'section\[%\d+\]', 'start')))

Cedric
Cedric 2013-8-29
编辑:Cedric 2013-8-29
Based on your comment: one way to tackle that is to split the file according to section headers/footer, so you get blocks that you can process using TEXTSCAN. Example:
content = fileread('myData.txt') ;
blocks = regexp(content, '(}\s*){0,1}section\[\d+\]\s*{|}', 'split') ;
blocks = blocks(2:end-1) ; % Eliminate first empty and last
% (after last '}') blocks.
nBlocks = length(blocks) ;
data = cell(nBlocks, 1) ;
for bId = 1 : nBlocks
data{bId} = textscan(blocks{bId}, 'pt3dadd(%f,%f,%f,%f,%f)') ;
end
and if you don't want data to be a cell array of cell arrays (output of _TEXTSCAN_is a cell array of columns), you can replace the above line in the FOR loop with:
buffer = textscan(blocks{bId}, 'pt3dadd(%f,%f,%f,%f,%f)') ;
data{bId} = [buffer{:}] ;

此问题已关闭。

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by