Handle Unknown Values
Synopsis
This operator collects known values when applied on data and also allows to replace unknown values. The result is a preprocessing model which can be applied to new data sets for which all nominal values which are not known will be replaced by missings.
Description
This operator collects all the values for all nominal columns in a data set and stores them in a preprocessing model. While the operator does not change the input data at all, such a preprocessing model can be very useful when you want to ensure that new data sets are only using nominal values which have been known before. Many models cannot deal well with new data sets and may break if you do not handle this beforehand.
If the preprocessing model is applied on new data sets, all nominal values which have not been part of the input of this operator will be replaced by missing values. They can be handled by regular missing value handling operator afterwards.
Input
example set input
This port expects an ExampleSet for which all nominal values should be remembered.
Output
example set output
The processed data which is the same as the input data because actually there is no processing happening but just values are remembered.
original
The original data set.
预处理模型
You can apply this model on new data sets with Apply Model to so that all nominal values which have not been part of the input data will be replaced by missing values.