How do I find the number of character in a text file?

10 次查看(过去 30 天)
I have a text file 'inp.txt' containing one line like this:
'banerjee.raktim 123456789125'
I want to know the number of character present in this line, including the white space. And also want to know the position of the white space character.
The expected no of character is 28 and opsition of space is:
16
How to do this?

采纳的回答

Matt Fig
Matt Fig 2011-1-26
fid = fopen('inp.txt'); % see also fclose
A = char(fread(fid,inf)).';
A =
banerjee.raktim 123456789125
>> length(A)
ans =
28
>> strfind(A,' ')
ans =
16
  3 个评论
Jan
Jan 2011-1-26
This "A = char(fread(fid,inf)).';" is can be improved: "A = fread(fid, [1, inf], '*char');" This avoid a data copy, casting and transpose operation, but creates a tiny [1x2] vector. But FGETL is efficient also and available in Matlab 6.5.

请先登录,再进行评论。

更多回答(2 个)

Walter Roberson
Walter Roberson 2011-1-26
Are the quote marks and the period present in the file? For the purposes of this discussion I will presume they are not.
a = textread('inp.txt','%s','Whitespace','');
line_length = length(a{1});
space_pos = find(a{1} == ' ',1,'first');
fprintf('The actual number of characters was %d and the actual space position was %d', line_length, space_pos);
  5 个评论
Walter Roberson
Walter Roberson 2011-1-26
Matlab 6.5? Sorry, that is before my time for functions such as textread(). This answer will not work for you.
Jan
Jan 2011-1-26
TEXTREAD is available in Matlab 6.5, but you have to use lower case for 'whitespace'.

请先登录,再进行评论。


Gary
Gary 2011-1-26
for a text file, doesn't each character represent 1 byte?
Does the 'dir' function exist in Matlab 6.5? In my tests creating a text file in Windows notepad, then getting the file size using Matlab's 'dir', the reported number of bytes in the file is equal to the number of characters I typed into it.
Of course, that doesn't tell you where the spaces are.
  6 个评论
Walter Roberson
Walter Roberson 2011-1-28
The OP wants the number of characters on the first line, and gave a specific expected count that did not include any line terminators.
We saw in an earlier cssm thread that Matlab does not read past ^Z on MS Windows for a file opened as text, so any check based upon file size returned by dir() would have to account for possible line terminators and possible ^Z . I don't think it worth the effort compared to just reading the line and taking its length().
Walter Roberson
Walter Roberson 2011-1-28
Gary, MS Windows is happy to write UTF-16 text files, so we cannot count on one character being the same as 1 byte, even without these issues of line or file terminators.

请先登录,再进行评论。

类别

Help CenterFile Exchange 中查找有关 Environment and Settings 的更多信息

产品

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by