grouping data based on the times a value is repeated

3 次查看(过去 30 天)
I have data in array form with big number of rows and 12 columns. One of the column has an increasing array but not in a stable way (e.g. 1 1 1 1 1 2 2 2 2 3 3 4 4 4 4 4 4 4 4 5 6 6 6 etc)I want to organize data so I have all rows with 1 repetition of the value, 2 repetitions and so forth without disrupting the order of the sequence between each number. So I want to have an array that in the beginning was like this:
[1 45 67]
[1 23 89]
[1 90 110]
[2 41 52]
[2 51 76]
[2 77 88]
[2 13 14]
[3 545 111]
[3 242 53]
[3 80 23]
[4 11 22]
[4 14 26]
[4 16 77]
[4 11 943]
and I want it to be like that:
[1 45 67]
[1 23 89]
[1 90 110]
[3 545 111]
[3 242 53]
[3 80 23]
and
[2 41 52]
[2 51 76]
[2 77 88]
[2 13 14]
[4 11 22]
[4 14 26]
[4 16 77]
[4 11 943].
The data is all numerical and all rows have values in all columns.
Is there a way to do this with MATLAB?
Thank you very much.

采纳的回答

Guillaume
Guillaume 2018-2-28
m = [1 45 67; 1 23 89; 1 90 110; 2 41 52; 2 51 76; 2 77 88; 2 13 14; 3 545 111; 3 242 53; 3 80 23; 4 11 22; 4 14 26; 4 16 77; 4 11 943]
numreps = diff(find(diff([0;m(:, 1);0]))); %or use any histogram function
[~, neworder] = sort(numreps);
splitm = mat2cell(m, numreps, size(m, 2));
reorderedm = cell2mat(splitm(neworder))
Other ways of obtaining numreps:
numreps = accumarray(m(:, 1), 1); %only if m(:, 1) is integer from 1 to x with no gap.
numreps = histcounts(m(:, 1), 'BinMethod', 'integers'); %only if m(:, 1) is integer from 1 to x with no gap.
numreps = histcounts(m(:, 1), [unique(m(:, 1)); Inf]);
  3 个评论
Guillaume
Guillaume 2018-3-5
If I understand correctly what you're asking, replace the 2nd line with
[sortedreps, neworder] = sort(numreps);
to find the length of each run
runlength = diff(find(diff([0; sortedreps; 0])))

请先登录,再进行评论。

更多回答(2 个)

Image Analyst
Image Analyst 2018-2-28
Have you tried taking the histogram of the first column?
  5 个评论
Image Analyst
Image Analyst 2018-3-5
编辑:Image Analyst 2018-3-5
You must have a really old version of MATLAB. You can use hist() or histc() instead.
Guillaume
Guillaume 2018-3-5
In my answer, under "Other ways of obtaining numreps", I showed two different ways of using histcounts. Because of the automatic binning you can't just pass the column of number.

请先登录,再进行评论。


Georgios Tertikas
i meant it works but it doesnt show the results i need

类别

Help CenterFile Exchange 中查找有关 Characters and Strings 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by