Info
此问题已关闭。 请重新打开它进行编辑或回答。
Repetition of repeated rows
    2 次查看(过去 30 天)
  
       显示 更早的评论
    
I have a very large matrix (780000X2). In column 1, there is a scenario number, between 1 and 20000. I want to identify those rows which have a scenario number repeated elsewhere at least 20 times. For example, let's say I want to identify rows in Matrix A which have a scenario number repeated at least 2 times.
A= [1 22;2 23;2 24;2 25;3 26]
In this situation, I would be trying to identify (2, 23), (2,24) and (2,25).
0 个评论
回答(2 个)
  Star Strider
      
      
 2016-4-6
        This works:
A = [1 22;2 23;2 24;2 25;3 26];
[Au,ia,ic] = unique(A(:,1));                                    % Find unique Values In Column #1
A1h = accumarray(ic, 1);                                        % Historgram Counts Of Those Values
Desired_Rows = A(ic == ia(A1h > 2), :)                          % Find All Rows With ‘ia’ Index Of Column #1 Numbers Meeting Criteria
Desired_Rows =
       2    23
       2    24
       2    25
It first finds the unique entries in column 1, then counts them in the accumarray call, finds all those meeting criteria in the ‘ic’ output of the unique call, and uses those addresses (a logical vector here) to select and save the output of that to the ‘Desired_Result’ variable.
0 个评论
  Robert
      
 2016-4-6
        There are a few ways you could do this. The most straightforward is with hist. Use hist to count the instances of your data with bins 1:20000.
values = 1:20000;
num_occurances = hist(A(:,1),values);
values(num_occurances >= 20)
0 个评论
此问题已关闭。
另请参阅
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!


