General rules that are followed while NaN or missing values are encountered:
- Rule1: The algorithm simply discards the data points where all the features have NaN values and does not use them while training.
- Rule 2: If a data point have a few NaN feature values then the algorithm will find the split on the basis of valid values first.