replace zeros according to distribution of integers in each column

1 次查看(过去 30 天)
if I have some matrix x of randomly distributed values (value 0 to n = 5 or 10 in this case)
x = [binornd( 5, 0.05, 10000, 1 ), binornd( 10, 0.1, 10000, 1 )];
I want to create a second matrix where any zeros in x are replaced at random by a value from 1 to n according to the distribution when zero is excluded in that particular column. I want to do this by random sampling from the values in each column excluding zero.
Any ideas how to do this quickly (in reality the matrices are considerably bigger than this example...)

采纳的回答

Guillaume
Guillaume 2016-10-22
编辑:Guillaume 2016-10-22
If I understand correctly, you want to replace all zeros in a column by random non-zero values from the same column? In which case:
replacementcount = sum(x == 0);
for column = 1:size(x, 2)
validvalues = nonzeros(x(:, column));
x(x(:, column) == 0, column) = validvalues(randperm(numel(validvalues), replacementcount(column)));
end
  3 个评论
Guillaume
Guillaume 2016-10-22
The error with randperm is because you have more 0s in the column than non-zeros, something I had not considered.
Because the number of 0s can vary per column you cannot avoid the loop. Note that it's only looping over the columns whose number I understood is negligible compared to the number of rows.

请先登录,再进行评论。

更多回答(1 个)

Andriy Kavetsky
Andriy Kavetsky 2016-10-22
You can try this n=10; or n=5; and x(x==0)=randi(n)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by