Data is not saving to the workspace

10 次查看（过去 30 天）

Aaron Smith 2017-2-10

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/324347-data-is-not-saving-to-the-workspace

评论： Stephen23 2017-2-21

I have a large text file composed of a single row of 52480000 numbers separated by semicolons. I'm attempting to organize the data into 51250 rows of 1024 numbers and then separate this into distinct blocks of 1025 x 1024. The numbers need to stay in the same order they were in in the original file (with every 1025th number being the start of a new row) I have tried using a while and if loop.

R = 51250;
C = 1024;
fid = fopen( 'TEST_A.asc');
k = 0;
while ~feof(fid)
z = textscan( fid, '%d', R*C, 'EndOfLine', ';');
if ~isempty(z{1})
k = k + 1;
s = fprintf( 'TEST_A.asc', ';');
dlmwrite( s, reshape( z{1}, 1025, []), ';')
end
end
fclose(fid);

This code does not create an initial cell of 52480000 numbers, which means that none of the subsequent data sets (s & z) are created in the workspace. The problem is that if I textscan the data into Matlab before formatting it, the file creates a memory error. Does anyone notice anything that I don't about this code or have any pointers?

26 个评论
显示 24更早的评论隐藏 24更早的评论

Stephen23 2017-2-10

编辑：Stephen23 2017-2-10

See earlier question:

https://www.mathworks.com/matlabcentral/answers/323811-creating-multiple-equally-sized-matrices-from-a-single-numerical-cell

"I'm attempting to organize the data into 51250 rows of 1024 numbers and then separate this into distinct blocks of 1025 x 1024"

Why do you need this intermediate step?

My answer showed you how to to simply process exactly those blocks of 1025*1024, avoiding that intermediate matrix entirely. What do gain by creating that huge matrix that you don't even want? My code shows how you can go directly to the smaller matrices (which seems to be your aim) without having to read the whole file data into MATLAB and without needing to use the intermediate step of rearranging all of the data into one pointlessly huge matrix.

Why not just read the blocks you need (1025*1024) instead of wasting time and memory with that huge matrix?

"The numbers need to stay in the same order they were in in the original file (with every 1025th number being the start of a new row) "

Yes, and that is what my answer does. Change R = 51250; back to R = 1025; and this code will work too.

Aaron Smith 2017-2-14

编辑：Aaron Smith 2017-2-14

在 MATLAB Online 中打开

When using fopen outside of the code itself, it works fine and doesn't create an error. The only thing I can think it could be is the fullfile and sbd in the fopen command. I tried taking it out, moving it but that creates errors with the code. Is there a way to put the fullfile(sbd, ...) part in a separate line?

sbd = 'tempdir';
R = 1025;
C = 1024;
opt = { 'EndOfLine', ';', 'CollectOutput', true };
>> fid = fopen(fullfile(sbd,'TEST_A.asc'),'rt');
>> k = 0;
while ~feof(fid)
k = k + 1;
Z = textscan( fid, '%d', R*C, opt{:});
S = fullfile( sbd, sprintf( 'TEST_ASA.asc', k ));
if rem( numel( Z{1}), R)==0
dlmwrite( S, reshape( Z{1}, [], R).', ';')
else
dlmwrite( S, Z{1}, ';')
end
end
Error using feof
Invalid file identifier.  Use fopen to generate a valid file identifier.
>> [fid, errmsg] = fopen( 'TEST_A.asc' )
fid =
       9
errmsg =
       ''
I was thinking, looking at the fullfile page on mathworks, Should i set up a folder to be a destination for the file? 
f = fullfile('myfolder','mysubfolder','myfile.m')

I'm thinking it may be the subdirectory (sbd) that is causing the error

Stephen23 2017-2-15

编辑：Stephen23 2017-2-15

You could register with dropbox, mediafire, google drive, or one of the many other file sharing websites, and send me the link of the file (via my profile page: please also include a link to this thread otherwise the email will get deleted automatically).

Stephen23 2017-2-15

编辑：Stephen23 2017-2-15

@Aaron Smith: I received your message. I will have a look a little later.

请先登录，再进行评论。

请先登录，再回答此问题。

采纳的回答

Stephen23 2017-2-15

1
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/324347-data-is-not-saving-to-the-workspace#answer_254956

编辑：Stephen23 2017-2-15

在 MATLAB Online 中打开

Thank you for the file. What did I learn from the actual data file: that it is not "composed of a single row", but in fact there are 51200 rows in the file that I received.

Why is this important? Because computers are stupid, and they do exactly what they are told to do. Knowing how to read a file correctly requires knowing what format the file has. In this case it is also quite handy for us, because it is trivial to read and write lines without much processing.

The code below worked correctly for me, reading the 200 MB file, and creating 50 smaller files with the rows following the same order as the original file.

sbd = 'temp';
f2d = fopen(fullfile(sbd,'temp_01.asc'),'wt');
f1d = fopen(fullfile(sbd,'TEST_A.asc'),'rt');
k = 0;
while ~feof(f1d)
    str = fgetl(f1d);
    if sscanf(str,'%d')==1
        k = k+1;
        fclose(f2d);
        fnm = fullfile(sbd,sprintf('temp_%02d.asc',k));
        f2d = fopen(fnm,'wt');
    end
    fprintf(f2d,'%s\n',str);
end
fclose(f1d);
fclose(f2d);

Note that:

the size of the output matrices is 1024x1025 (because there are 1025 numbers per line). This is correct because the first number of each line is simply a line count (check the files and you will see).
the lines are exactly the same as the original file.
MATLAB hold one line at a time: the lines are simply read from the large file and written directly to a new file.
as a result: no matrix, no converting from string to numeric and back to string.
it is slow because the file is large... reading and writing 51200 lines of 1025 numbers each will take some time.

7 个评论
显示 5更早的评论隐藏 5更早的评论

Stephen23 2017-2-16

编辑：Stephen23 2017-2-17

在 MATLAB Online 中打开

"i'm not sure if there is a fix for it."

You need to provide the correct filepath for your files. I put all of my files into one sub-directory of the current path named "temp". That worked for me. Do you see "temp" at the start of my code?

Imagine that you tell MATLAB (or any other programming language that has ever existed) to open this file 'C:\Temp\myfile.txt' But what should happen if there is no such file in that location? Then the programming language cannot read your mind: it cannot guess that you actually meant another location, e.g. 'C:\Temp\testfiles\myfile.txt', or that the file is actually called 'my_mistake.csv'. YOU are the one who has to know where you files are, and YOU have to provide the correct path to fopen (via fullfile if used).

So look at my code: I used a sub-directory named "temp". My files were all in that sub-directory. So I told MATLAB to look in that sub-directory. But when you test for those files like this:

[fid1, errmsg] = fopen( 'TEST_A.asc' )

Where is it looking?: ONLY IN THE CURRENT DIRECTORY. You did not tell fopen to look in any sub-directory, or in any other directories anywhere in your computer, or even anywhere else in the known universe. Just the current directory. Let me ask a question: is the file 'TEST_A.asc' in the current directory? If the answer is no, then why are you telling MATLAB to look for it in the current directory?

fopen failures are most commonly caused by one thing: users not giving the correct path (which includes spelling mistakes of the name).

"i'm not sure if there is a fix for it."

The fix is that you provide fopen with the correct path.

PS: [fid2, errmsg] = fopen( 'test_01.asc', 'w') is a pointless test because it just creates that file wherever you tell it too: see the "w" option? That creates a file. It does not care where.

PPS: Why did you get rid of the t option? You should keep it (unless you plan on doing strange things with EOL characters). Removing random things is not a good way of making code work.

Aaron Smith 2017-2-21

Thanks Stephen, that code works as far as I can see. What may I ask are the two ~ in the code doing?

Stephen23 2017-2-21

https://www.mathworks.com/help/matlab/matlab_prog/ignore-function-outputs.html

https://www.mathworks.com/help/matlab/matlab_prog/ignore-function-inputs.html

http://blogs.mathworks.com/steve/2010/01/11/about-the-unused-argument-syntax-in-r2009b/

请先登录，再进行评论。

类别

MATLAB Data Import and Analysis Data Import and Export Low-Level File I/O

在 Help Center 和 File Exchange 中查找有关 Low-Level File I/O 的更多信息

产品

MATLAB

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Data is not saving to the workspace

26 个评论
显示 24更早的评论隐藏 24更早的评论

采纳的回答

7 个评论
显示 5更早的评论隐藏 5更早的评论

更多回答（0 个）

另请参阅

类别

标签

产品

Community Treasure Hunt

Data is not saving to the workspace

26 个评论 显示 24更早的评论隐藏 24更早的评论

采纳的回答

7 个评论 显示 5更早的评论隐藏 5更早的评论

更多回答（0 个）

另请参阅

类别

标签

产品

Community Treasure Hunt

26 个评论
显示 24更早的评论隐藏 24更早的评论

7 个评论
显示 5更早的评论隐藏 5更早的评论