Reading unformatted text and converting to formatted

22 次查看(过去 30 天)
Hi,
I have some files that contain raw text that is unformatted. These files have lots of lines and all of these lines are along the lines of
RFCH[0][0]:1, RFCH[0][1]:2, RFCH[0][2]:3,
etc. What I need to do is assign the value to the right of the colon to a corresponding matrix location, so the first 1 would go to [0,0] ([1,1] since Matlab probably wouldn't recognize [0,0]).
I know I can open the file using
fid=fopen(projectdir, 'r')
T=textscan(fid, '%s')
fclose(fid)
I am not really sure how go about this. Any help would be great.
  4 个评论

请先登录,再进行评论。

采纳的回答

Stephen23
Stephen23 2015-11-24
编辑:Stephen23 2015-11-24
You can read the whole file using fileread, identify the digits using regexp, convert the values to linear indices using sub2ind, and then simple linear indexing is all that is required to allocate the values from the file to the final matrix:
str = fileread('temp.txt');
% identify digits:
rgx = '[A-Z]+\[(\d+)\]\[(\d+)\]: *(\d+)';
C = regexp(str,rgx,'tokens');
% convert digits to numeric:
M = cellfun(@str2double,vertcat(C{:}));
M(:,1:2) = 1+M(:,1:2);
% convert to linear indices:
out = nan(max(M(:,1)),max(M(:,2)));
idx = sub2ind(size(out),M(:,1),M(:,2));
% allocate values:
out(idx) = M(:,3)
Because you did not provide any sample datafile I had to create my own, which contains only this string:
RFCH[0][0]:1, RFCH[0][1]:2, RFCH[0][2]:3, RFCH[2][2]:4
The code's output using my sample data file is this:
out =
1 2 3
NaN NaN NaN
NaN NaN 4
  4 个评论
Ibro Tutic
Ibro Tutic 2015-11-24
The file can't be uploaded due to NDA's and what not. And I figured that I could figure out what portion of the code looked at the letters and edit it in myself, but the method you use was something I did not think of and am not familiar with, so it didn't work out the way I intended.
Ibro Tutic
Ibro Tutic 2015-11-24
编辑:Ibro Tutic 2015-11-24
I am getting the same error as above, it seems the matrix C is an empty 0x0 matrix, which I assume is causing the error. The space after the colon was an error on my part, there is not supposed to be a space. I modified your script to take this into account and I am still receiving the same error.

请先登录,再进行评论。

更多回答(1 个)

dpb
dpb 2015-11-24
RFCH[0][0]:1, RFCH[0][1]:2, RFCH[0][2]:3,
fmt='RFCH[%d][%d]:%d';
fid=fopen('yorfile');
c=cell2mat(textscan(fid,fmt,'delimiter',',','collectoutput',1));
fid=fclose(fid);
X(c(:,1)+1,c(:,2)+1)=c(:,3);

类别

Help CenterFile Exchange 中查找有关 Text Data Preparation 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by