Indices of the values for which two conditions are true

2 次查看(过去 30 天)
I have a table of data where the first column is an abbreviated name (abbrev), second column is the first word of the full name (firstword) and the third is a particular number corresponding to that name. I would like to clean my data from duplicates that have the same 'abbrev' and 'firstword' and sum up the numbers for these duplicates. Some entries may have the same abbreviated name but a different first word- e.g. 'rr' and 'Roger' and 'Dodger' and vice versa, that's why I want to introduce this condition that both the first name and the first word have to match for an entry to be considered a duplicate.
Or in other words from this data:
abbrev =
{'yw' }
{'rr' }
{'yw' }
{'rr' }
firstword =
{'yellow'}
{'Roger' }
{'yellow'}
{'Dodger' }
number =
5
10
1
3
I want to get this:
abbrev =
{'yw' }
{'rr' }
{'rr' }
firstword =
{'yellow'}
{'Roger' }
{'Dodger' }
number =
6
10
3
Thank you in advance!

采纳的回答

Peng Li
Peng Li 2020-5-11
tbl = table(abbrev(:), firstword(:), number(:));
[gp, outTbl] = findgroups(tbl(:, 1:2));
outTbl.sum = splitapply(@sum, tbl.(3), gp)
outTbl =
3×3 table
Var1 Var2 sum
______ __________ ___
{'rr'} {'Dodger'} 3
{'rr'} {'Roger' } 10
{'yw'} {'yellow'} 6

更多回答(0 个)

类别

Help CenterFile Exchange 中查找有关 Numeric Types 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by