reading data after a string

1 次查看(过去 30 天)
Sergio
Sergio 2013-10-27
编辑: Cedric 2013-10-27
Hi,
I would like to read the data below the words 'Link Results'. What i would really like to know is how to get the line number in the text file that I'm trying to read that has 'Link Results'. After I get that I know how to read the data. so my question would be how to find that number.
Thanks!
  3 个评论
Sergio
Sergio 2013-10-27
Here is a piece of the text I would like to read
. . . 877 0.01 0.01 0.01 72.01 0.01 1047 0.01 0.01 0.01 72.01 0.01
************
Link Results
************
<<< Link 1301 >>>
-------------------------------------------------------------
Flow Velocity Depth Percent
Date Time CFS ft/sec feet Full
-------------------------------------------------------------
SEP-29-2010 00:15:00 0.000 0.000 0.000 0.0
SEP-29-2010 00:30:00 0.000 0.000 0.000 0.0
SEP-29-2010 00:45:00 0.000 0.000 0.000 0.0
.
.
.
I figured I could just get the line index where it reads 'Link Results' (which is 7063 in this case but changes with the other text files I need to run through) and add 8 to get to my data
Thanks!
Cedric
Cedric 2013-10-27
And how does the table end? Is it the end of file or is there some other content afterwards?

请先登录,再进行评论。

采纳的回答

Cedric
Cedric 2013-10-27
编辑:Cedric 2013-10-27
Here is one way, assuming that the block of data is the last content that you have in the file. If not, I can update the answer. As Walter is pointing out, we have to read the file from the beginning (here we read the full content) and match pattern(s). This solution does that using a a regular expression, which is quite concise, but you could implement a more basic loop which scans each line.
content = fileread( 'myFile.txt' ) ;
blocks = regexp( content, 'Link Results.*?Full.*?\-\s(.*)', 'tokens' ) ;
block = blocks{1}{1} ;
data = textscan( block, '%s %s %f %f %f %f' ) ;
After running this, you get..
>> data
data =
{3x1 cell} {3x1 cell} [3x1 double] [3x1 double] [3x1 double] [3x1 double]
>> data{1}
ans =
'SEP-29-2010'
'SEP-29-2010'
'SEP-29-2010'
>> data{2}
ans =
'00:15:00'
'00:30:00'
'00:45:00'
>> data{3}
ans =
0
0
0
etc. Then you can post-process this cell array to get e.g. a cell array of time stamps, and a numeric array of data..
timeStamps = arrayfun( @(r) [data{1}{r}, ' ', data{2}{r}], ...
1:numel(data{1}), 'UniformOutput', false ).' ;
dataNum = [data{3:end}] ;
which leads to:
>> timeStamps
timeStamps =
'SEP-29-2010 00:15:00'
'SEP-29-2010 00:30:00'
'SEP-29-2010 00:45:00'
>> dataNum
dataNum =
0 0 0 0
0 0 0 0
0 0 0 0
  4 个评论
Sergio
Sergio 2013-10-27
This would be the end of the text file **** OCT-01-2010 23:30:00 15.833 1.067 0.819 1.7 OCT-01-2010 23:45:00 15.496 1.061 0.810 1.7 OCT-02-2010 00:00:00 15.175 1.055 0.801 1.7
Analysis begun on: Sun Oct 27 16:18:20 2013
Analysis ended on: Sun Oct 27 16:18:37 2013
Total elapsed time: 00:00:17
Cedric
Cedric 2013-10-27
编辑:Cedric 2013-10-27
Ok, just update the pattern in the regexp call as follows:
blocks = regexp( content, 'Link Results.*?Full.*?\-\s(.*?)\s*Ana', ...
'tokens' ) ;

请先登录,再进行评论。

更多回答(1 个)

Walter Roberson
Walter Roberson 2013-10-27
There are no functions built into MATLAB, or to any of the operating systems that current MATLAB run on, that can tell you which line number of a file that you are positioned to. None of the operating systems supported have any inherent concept of "line number".
Therefore if you want to know which line number something is on, you need to start at the beginning of the file, read line by line, counting each as you go, until you find the pattern you are looking for.

类别

Help CenterFile Exchange 中查找有关 Standard File Formats 的更多信息

标签

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by