Histogram to a CDF/PDF

75 次查看(过去 30 天)
Sclay748
Sclay748 2020-8-24
评论: Sclay748 2020-8-24
Hello, This is a screenshot of a table I have constructed for work.
Just to play it safe, I blacked out the column names, though it would be hard to assume anything with just 7 rows of the table to go off of. We will call the 5 fields "column1, column2, etc."
So I am able to create the hisogram of any of the columns, besides 3, but that isn't needed because it is all '94'.
I do:
histogram([a(1:135756).column1])
and the histogram works perfectly.
How would I do a CDF or PDF of this data?
I have tried:
histogram([a(1:135756).column1],'Normalization',pdf)
or
histogram([a(1:135756).column1],'Normalization',cdf)
but nothing changes from the original histogram.
Thank you!

采纳的回答

Bruno Luong
Bruno Luong 2020-8-24
编辑:Bruno Luong 2020-8-24
A=[a(1:135756).column1];
figure
subplot(2,1,1);
histogram(A,'Normalization','pdf');
ylabel('pdf');
subplot(2,1,2);
histogram(A,'Normalization','cdf');
ylabel('cdf');

更多回答(1 个)

Alan Stevens
Alan Stevens 2020-8-24
You can get a CDF as follows:
% Modified Kaplan-Meier CDF
% assumes each point is representative of 1/N of the population.
a = sort(a(:,1)); % so all the data for a are sorted in ascending order
N = length(a);
for k = 1:N
CDF(k) = (k - 0.5)/N;
end
plot(a,CDF)
Because you have a large number of points you could simply numerically differentiate the CDF to get a PDF.
  4 个评论
Bruno Luong
Bruno Luong 2020-8-24
Hmm it cries for replacing the for-loop
a1 = sort([a(1:135756).column1]);
N = length(a1);
CDF = (0.5:N-0.5) / N;
plot(a1, CDF);
Sclay748
Sclay748 2020-8-24
编辑:Sclay748 2020-8-24
Bruno, that worked. Is it supposed to be a single curved line?
I thought it would still plot the bars, but arranged by CDF. Or that is atleast how my boss's turned out when he showed me an example using histogram(......,'Normalization',cdf)

请先登录,再进行评论。

产品


版本

R2020a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by