重量Uncertainty
Synopsis
This operator calculates the relevance of attributes of the given ExampleSet by measuring the symmetrical uncertainty with respect to the class.
Description
The Weight by Uncertainty operator calculates the weight of attributes with respect to the label attribute by measuring the symmetrical uncertainty with respect to the class. The higher the weight of an attribute, the more relevant it is considered. Please note that this operator can be only applied on ExampleSets with nominal label. The relevance is calculated by the following formula:
relevance = 2 * (P(Class) - P(Class | Attribute)) / P(Class) + P(Attribute)
Input
example set
This input port expects an ExampleSet. It is output of the Retrieve operator in the attached Example Process.
Output
weights
This port delivers the weights of the attributes with respect to the label attribute. The attributes with higher weight are considered more relevant.
example set
The ExampleSet that was given as input is passed without changing to the output through this port. This is usually used to reuse the same ExampleSet in further operators or to view the ExampleSet in the Results Workspace.
Parameters
Normalize weights
This parameter indicates if the calculated weights should be normalized or not. If set to true, all weights are normalized in a range from 0 to 1.
Sort weights
This parameter indicates if the attributes should be sorted according to their weights in the results. If this parameter is set to true, the order of the sorting is specified using the排序方向parameter.
Sort direction
This parameter is only available when thesort weightsparameter is set to true. This parameter specifies the sorting order of the attributes according to their weights.
Normalize
This parameter indicates if the standard deviation should be divided by the minimum, maximum, or average of the attribute.
Number of bins
This parameter specifies the number of bins to be used.