Remove accent from struct
14 次查看(过去 30 天)
显示 更早的评论
I have this info_circuitos.Name_circuito. I need to shearch in a name. but the struct have accents. a e i o u OR A E I O U. Could have accent in any of that letter.
I want to create a new info_circuitos.Name_circuito without accent Help pls.
Thanks
2 个评论
采纳的回答
sixwwwwww
2013-12-9
编辑:sixwwwwww
2013-12-9
you can do it like this:
names = {files.name};
str = {'Á', 'É', 'Í', 'Ó', 'Ú', 'á', 'é', 'í', 'ó', 'ú'};
strreplace = {'A', 'E', 'I', 'O', 'U', 'a', 'e', 'i', 'o', 'u'};
for j = 1:length(names)
for i = 1:numel(str)
names{j} = regexprep(names{j}, str{i}, strreplace{i});
end
end
Good luck!
14 个评论
Walter Roberson
2013-12-9
This code does not change the structure, just finds the equivalent strings.
更多回答(1 个)
Walter Roberson
2013-12-9
编辑:Walter Roberson
2013-12-9
Assuming ISO-8859-1 rather than full Unicode:
This code is untested.
function new_structure = remove_accents_from_struct(old_structure)
% Note: this code is only designed for size(structure) = 1 x 1
% and does not descend recursively into substructures.
old_fields = fieldnames(old_structure);
new_fields = cellfun( @remove_accents_from_string, old_fields, 'Uniform', 0);
if isequal(old_fields, new_fields) %optimization when there are no accents
new_structure = old_structure;
return
end
num_fields = length(old_fields);
% create empty structure populated with the translated field names but not their contents
new_structure = cell2struct( cell( num_fields, 1 ), new_fields );
% now copy the field contents over
for K = 1 : num_fields
new_structure.(new_fields{K}) = new_structure.(old_fields{K});
end
end
function newS = remove_accents_from_string(S)
% table extracted from % http://core.trac.wordpress.org/browser/tags/3.7.1/src/wp-includes/formatting.php
accented_chars = [ ...
char(128), char(131), char(138), char(142), char(154), ...
char(158), char(159), char(162), char(165), char(181), ...
char(192), char(193), char(194), char(195), char(196), ...
char(197), char(199), char(200), char(201), char(202), ...
char(203), char(204), char(205), char(206), char(207), ...
char(209), char(210), char(211), char(212), char(213), ...
char(214), char(216), char(217), char(218), char(219), ...
char(220), char(221), char(224), char(225), char(226), ...
char(227), char(228), char(229), char(231), char(232), ...
char(233), char(234), char(235), char(236), char(237), ...
char(238), char(239), char(241), char(242), char(243), ...
char(244), char(245), char(246), char(248), char(249), ...
char(250), char(251), char(252), char(253), char(255) ];
unaccented_chars = 'EfSZszYcYuAAAAAACEEEEIIIINOOOOOOUUUUYaaaaaaceeeeiiiinoooooouuuuyy';
newS = S;
[tf, idx] = ismember(S, accented_chars);
newS(tf) = unaccented_chars(idx);
end
2 个评论
Walter Roberson
2013-12-9
You said you could have an accent on any letter, so I could not assume that you only want to deal with grave accents on English vowels.
Most of the rest of the length is comments or efficiency.
另请参阅
类别
在 Help Center 和 File Exchange 中查找有关 Characters and Strings 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!