How to determine the number of bins and the edge length based on the density of each bin. (Bins most likely are not uniform.)

Question

tsan toso 2013-10-18

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/90645-how-to-determine-the-number-of-bins-and-the-edge-length-based-on-the-density-of-each-bin-bins-most

评论： tsan toso 2013-10-19

Non-uniform bin density Hi,

I have been trying to figure out how to write a function to bin a sample of data together based on its density (Number of Occurrence/ Length of edge). But there are not alot of examples out there.

The output would give a vector of edges where both :

1) the number of bins are given by how many are required to group data that have different density by a threshold (maybe 40%?)

2) and the length of the edges are determined by if the adjacent data groups have similar density. (Similar density are grouped together, but if the neighboring bin is 40% more or less in density, it would require another bin).

So to illustrate my point, below is a simple example:

I have data values that ranges from 1 to 10 and I have 10 observations of it where x=[1,2,3,4,5,5,5,6, 6,7];

x would result in a range with edges that are [1,5,6,7,8], so there are four states just because the bins represent different density clusters.

Just to mention my actual data is continuous, any help is appreciated.

Regards,

Tresno

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

tsan toso 2013-10-18

Just to continue from the above example. I thought of a preliminary algorithm for large data samples:

1. Sort data in ascending order.

2. Group data where at least a group has 10 elements

3. Calculate and compare density to group similar ones together.

I got stuck on the 3rd point. Where I am not sure how to effectively group them. My obstacle comes from if the density increases slowly, but gradually e.g. Density: 1,2,3,4,5,6,7,8,9,10

Where do I call it break and say that one group has a different density from another.

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Image Analyst 2013-10-18

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/90645-how-to-determine-the-number-of-bins-and-the-edge-length-based-on-the-density-of-each-bin-bins-most#answer_100093

Try using kmeans clustering in the stats toolbox. For an example, see http://www.mathworks.com/products/demos/image/color_seg_k/ipexhistology.html

4 个评论
显示 2更早的评论隐藏 2更早的评论

Image Analyst 2013-10-18

Try this clustering app by Brett Shoelson: http://www.mathworks.com/matlabcentral/fileexchange/35014-clusterdata

tsan toso 2013-10-19

Hey thanks for the feedback. Did look at the app, however, it seems to cluster groups of data based on how close they are to each other (not exactly based on the sparsity of the data)...would ask if Brett if he have any thoughts.

Thanks again.

请先登录，再进行评论。

How to determine the number of bins and the edge length based on the density of each bin. (Bins most likely are not uniform.)

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

回答（1 个）

4 个评论
显示 2更早的评论隐藏 2更早的评论

另请参阅

类别

标签

产品

Community Treasure Hunt

How to determine the number of bins and the edge length based on the density of each bin. (Bins most likely are not uniform.)

1 个评论 显示 -1更早的评论隐藏 -1更早的评论

回答（1 个）

4 个评论 显示 2更早的评论隐藏 2更早的评论

另请参阅

类别

标签

产品

Community Treasure Hunt

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

4 个评论
显示 2更早的评论隐藏 2更早的评论