Vectorized method to sum missed one values

Question

0 个投票

Hi there,

I am trying to achieve something, but I can't think of a vectorized way of doing this. The problem is as follows.

Say I have a vector of 0's and 1's, e.g. [0, 1, 1, 0, 0, 1, 0, 1]. Then I want to manipulate it in such a way that I get the following vector: [0, 2, 1, 0, 0, 3, 0, 2]. Hence, from left to right, every time a 1 occurs, it adds the number of consecutive preceding zero's, if any.

This can easily be done in a loop, but because of the vast number of computations, I am looking for a vectorized way to achieve this.

Any help is appreciated!

Best, Robert

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Follow Question

Answer 1

Amir Xz 2018-9-19

编辑：Amir Xz 2018-9-19

在 MATLAB Online 中打开

1 个投票

A=[0, 1, 1, 0, 0, 1, 0, 1];
[~,NonZr] = find(A~=0);
A(NonZr) = [NonZr(1),NonZr(2:end)-NonZr(1:end-1)];

Result:

A =
     0     2     1     0     0     3     0     2

4 个评论
显示 2更早的评论隐藏 2更早的评论

Robert Vullings 2018-9-20

Thank you very much, this is indeed a very smart way to do it!

Robert Vullings 2018-11-2

在 MATLAB Online 中打开

Any ideas on how to expand this (or another method) to a 2D array?

For example, say

A=[0, 1, 1, 0, 0, 1, 0, 1;
   1, 0, 1, 0, 1, 1, 0, 1];

would then become

A=[0, 2, 1, 0, 0, 3, 0, 2;
   1, 0, 2, 0, 2, 1, 0, 2];

请先登录，再进行评论。

Answer 2

Guillaume 2018-9-19

编辑：Guillaume 2018-9-19

在 MATLAB Online 中打开

1 个投票

You can replace the earlier part of this answer by the compiled version of rcumsumc for speed

v = [0, 1, 1, 0, 0, 1, 0, 1];
rcum = double(~v);
csum = cumsum(rcum);
rcum(v == 1) = -diff([0, csum(v == 1)]);
rcsum = cumsum(rcum) + 1;
%all the above can be replaced by rcumsum
%rcsum = rcumsum(~v) + 1;
reploc = diff(v) == 1
v([false, reploc]) = rcsum([reploc, false])

Note that I'm not convinced that it will be faster than a well written loop (which can do the job in only one pass over the data).

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

Christopher Wallace 2018-9-19

This is about 20x faster than my answer when run on my machine. Nice work!

请先登录，再进行评论。

Answer 3

Christopher Wallace 2018-9-19

在 MATLAB Online 中打开

0 个投票

startingData = [0, 1, 1, 0, 0, 1, 0, 1];
stringArr = sprintf('%d', startingData ); % Convert to string for use with regexp
zerosLoc = regexp(stringArr , '(0*)');    % Find starting index of groups of 0's
onesLoc = regexp(stringArr , '(1*)');     % Find starting index of groups of 1's
startingData(onesLoc) = (onesLoc - zerosLoc) + 1; The difference in the starting location of the ones and the starting location of the zeros which will result in the number of zeros leading up to the 1.

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

Guillaume 2018-9-19

在 MATLAB Online 中打开

Conversions from numbers to strings are never fast, but

stringArr = char(startingData + '0');

will be a lot faster than using sprintf.

However, you don't need regexp and strings to find the start of the sequences.

zerosLoc = find(diff([1, startingData]) == -1);  %find [1 0] transitions
onesLoc = find(diff([0, startingData]) == 1);    %find [0 1] transitions

With this it may actually be faster than my solution.

请先登录，再进行评论。

Vectorized method to sum missed one values

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

4 个评论
显示 2更早的评论隐藏 2更早的评论

更多回答（2 个）

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

类别

标签

Community Treasure Hunt

Vectorized method to sum missed one values

0 个评论 显示 -2更早的评论 隐藏 -2更早的评论

采纳的回答

4 个评论 显示 2更早的评论 隐藏 2更早的评论

更多回答（2 个）

1 个评论 显示 -1更早的评论 隐藏 -1更早的评论

1 个评论 显示 -1更早的评论 隐藏 -1更早的评论

类别

标签

另请参阅

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

4 个评论
显示 2更早的评论隐藏 2更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论