fprintf for character above 127 uses 3 bytes

Question

MarsMat 2016-3-23

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/275045-fprintf-for-character-above-127-uses-3-bytes

评论： Guillaume 2016-3-23

Hello Everyone, could somebody give me a hint why apply fprintf to a character beyond 127 would result a consumption of 3 bytes? Is this related to Matlab or the system? Thank you in advance.

Here is my test code, and further below is part of the results I got out of this code:

fid=fopen('testprintbeyond128','w')
for i=1:255
     fprintf(fid,'%s\t',num2str(i));       
     fprintf(fid,'%s\t',dec2hex(i)); 
     cc=fprintf(fid,'%s\t',char(i));
     fprintf(fid,'%d\n',cc)
end
        fclose(fid)

results:

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Guillaume 2016-3-23

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/275045-fprintf-for-character-above-127-uses-3-bytes#answer_214701

在 MATLAB Online 中打开

Once you've opened your text file, check its encoding with

[~, ~, ~, encoding] = fopen(fid)

You will find that it is UTF-8. In UTF-8 character codes U+0000 between U+007F are encoded on one byte and character codes between U+0080 and U+07FF are encoded on two bytes. Add one byte for the tab (U+0009) and what you're seeing is normal.

3 个评论
显示 1更早的评论隐藏 1更早的评论

Walter Roberson 2016-3-23

See http://www.mathworks.com/matlabcentral/answers/275091-fprintf-and-fread-character-beyond-ascii-in-1-byte

Guillaume 2016-3-23

@MarsMat, two things:

If you use fprintf then really you should read the data back with fscanf. This would handle all these encoding issues transparently. While, we're at it, I would open the file with 'wt' and 'rt' to make sure end of lines are handled properly.

To have character from 128 to 255 stored as one byte, you need to explicitly specify an adequate character encoding when you fopen the file. Only you can tell which one is the most appropriate, 'ISO-8859-1' is the closest to unicode but depending on where you're located others may be more appropriate.

请先登录，再进行评论。

fprintf for character above 127 uses 3 bytes

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

3 个评论
显示 1更早的评论隐藏 1更早的评论

更多回答（0 个）

另请参阅

类别

标签

Community Treasure Hunt

fprintf for character above 127 uses 3 bytes

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

3 个评论 显示 1更早的评论隐藏 1更早的评论

更多回答（0 个）

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

3 个评论
显示 1更早的评论隐藏 1更早的评论