discretize
Group data into bins or categories
Syntax
Description
returns the indices of the bins that contain the elements of
Y
= discretize(X
,edges
)X
. The j
th bin contains element
X(i)
if edges(j) <= X(i) <
edges(j+1)
for 1 <= j < N
, where
N
is the number of bins and length(edges) =
N+1
. The last bin contains both edges such that edges(N)
<= X(i) <= edges(N+1)
.
[___] = discretize(___,
returns
the corresponding element in values
)values
rather than
the bin number, using any of the previous input or output argument
combinations. For example, if X(1)
is in bin 5,
then Y(1)
is values(5)
rather
than 5
. values
must be a vector
with length equal to the number of bins.
[___] = discretize(___,'categorical')
creates
a categorical array where each bin is a category. In most cases, the
default category names are of the form “[A,B)
”
(or “[A,B]
” for the last bin), where A
and B
are
consecutive bin edges. If you specify dur
as a
character vector, then the default category names might have special
formats. See Y
for a listing of the display formats.
[___] = discretize(___,'categorical',
,
for datetime or duration array inputs, uses the specified datetime
or duration display format in the category names of the output.displayFormat
)
[___] = discretize(___,'categorical',
also
names the categories in categoryNames
)Y
using the cell array
of character vectors, categoryNames
. The length
of categoryNames
must be equal to the number of
bins.
[___] = discretize(___,'IncludedEdge',
,
where side
)side
is 'left'
or
'right'
, specifies whether each bin includes its right or
left bin edge. For example, if side
is
'right'
, then each bin includes the right bin edge,
except for the first bin which includes both edges. In this
case, the j
th bin contains an element X(i)
if edges(j) < X(i) <= edges(j+1)
, where 1 <
j <= N
and N
is the number of bins. The
first bin includes the left edge such that it contains edges(1) <=
X(i) <= edges(2)
. The default for side
is
'left'
.
Examples
Input Arguments
Output Arguments
Tips
The behavior of
discretize
is similar to that of thehistcounts
function. Usehistcounts
to find the number of elements in each bin. On the other hand, usediscretize
to find which bin each element belongs to (without counting).
Extended Capabilities
Version History
Introduced in R2015a