Info

此问题已关闭。 请重新打开它进行编辑或回答。

Are Regular Expressions the best way to do this job?

2 次查看(过去 30 天)
I have big list of devices, but they are "encrypted" and look like this:
  • 'A10_EG_KitchenRadio_L2_P'
  • 'A11_KG_FloorPP_L1_P'
  • 'C01_EG_PC_L3_P'
  • 'C02_EG_TV_Video_L3_P'
  • 'C03_EG_HIFI_L3_P'
  • 'C04_EG_Switch_L1_P'
  • 'C05_EG_MeasuringSystems_L3_P'
  • 'A03_freezer_cooling_combi_L1_P'
instead they should look like this:
  • Radio
  • PowerPlug
  • PC
  • TV
  • HIFI
  • Switch
  • Measuringsystem
  • Freezer
I'm trying to solve this by using Regular Expressions. Is there a better way to do this?
TIA.
  4 个评论
Daniel Shub
Daniel Shub 2012-4-23
Do you want "freezer" or "Freezer". Can you state the rule for what you want? In particular I am concerned about 'C02_EG_TV_Video_L3_P' and the possibility of 'C02_TV_Video_L3_P'.
Norbert
Norbert 2012-4-23
I want Freezer. Every device should be capitalized. I know there are some possibility for every devices, but I want them all to have the same name, for example:
* C02_EG_TV_Video_L3_P
* C02_TV_Video_L3_P
* C02_TV_Station_L3_P
* C02_Home_TV_L3_P
* C02_Television_L3_P
* ...
to be named:
* TV
(same for other devices)

回答(2 个)

Daniel Shub
Daniel Shub 2012-4-23
I think the answer depends on what you mean by regular expressions. In MATLAB, I think of
doc regexp
doc regexprep
For this task I would argue that regular expression support in MATLAB is not as good or user friendly as it is in Perl, SED or AWK.
If you are looking for help writing the regular expression, you need to be able to state the rules. For example
  1. All lines start with Letter-digit-digit-undescore
  2. There is an optional Letter-Letter-underscore (where Letter-Letter is not TV or PC)
  3. All lines end with underscore-Letter-digit-underscore-Letter
  4. What is in between the start and the end of the line should be classified as one of N things according to the following rules
  5. If if it contains ...

Andrei Bobrov
Andrei Bobrov 2012-4-23
t = regexprep(A,{'[EK]G','_','\w*Radio','\w*PP','(cool\w*|freezer)','\w*vision'},{'',' ','Radio','PowerPlug','Freezer','TV'})
out = cellfun(@(x)x{2},regexp(t,'\w*','match'),'un',0)
  6 个评论
Oleg Komarov
Oleg Komarov 2012-4-23
I would define which sequences map to which device. Then I would loop for each device using regexp.
Norbert
Norbert 2012-4-23
Isn't it easier to just use strfind? E.g. if I find a string including 'cool', just name it 'Freezer'.

此问题已关闭。

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by