Optimize Parameters (Evolutionary)
Synopsis
This operator finds the optimal values of the selected parameters of the operators in its subprocess. It uses an evolutionary computation approach.
Description
This operator finds the optimal values for a set of parameters using an evolutionary approach which is often more appropriate than a grid search (as in the Optimize Parameters (Grid) operator) or a greedy search (as in the Optimize Parameters (Quadratic) operator) and leads to better results. This is a nested operator i.e. it has a subprocess. It executes its subprocess for a multiple number of times to find optimal values for the specified parameters.
This operator delivers the optimal parameter values through theparameterport which can also be written into a file with the Write Parameters operator. This parameter set can be read in another process using the Read Parameters operator. The performance vector for optimal values of parameters is delivered through theperformanceport. Any additional results of the subprocess are delivered through theresultports.
Other parameter optimization schemes are also available in RapidMiner. The Optimize Parameters (Evolutionary) operator might be useful if the best ranges and dependencies are not known at all. Another operator which works similar to this parameter optimization operator is the Loop Parameters operator. In contrast to the optimization operators, this operator simply iterates through all parameter combinations. This might be especially useful for plotting purposes.
Differentiation
Optimize Parameters (Grid)
The Optimize Parameters (Grid) operator executes its subprocess for all combinations of the selected values of the parameters and then delivers the optimal parameter values.
Input
input
This operator can have multiple inputs. When one input is connected, anotherinputport becomes available which is ready to accept another input (if any). The order of inputs remains the same. The Object supplied at the firstinputport of this operator is available at the firstinputport of the nested chain (inside the subprocess). Do not forget to connect all inputs in correct order. Make sure that you have connected the right number of ports at the subprocess level.
Output
performance
This port delivers the Performance Vector for the optimal values of the selected parameters. A Performance Vector is a list of performance criteria values.
parameter
This port delivers the optimal values of the selected parameters. This optimal parameter set can be written into a file with the Write Parameters operator. The written parameter set can be read in another process using the Read Parameters operator.
result
Any additional results of the subprocess are delivered through theresultports. This operator can have multiple outputs. When oneresultport is connected, anotherresultport becomes available which is ready to deliver another output (if any). The order of outputs remains the same. The Object delivered at the firstresultport of the subprocess is delivered at the firstresult港口的操作符。别忘了欺诈nect all outputs in correct order. Make sure that you have connected the right number of ports.
Parameters
Edit parameter settings
The parameters are selected through theedit parameter settingsmenu. You can select the parameters and their possible values through this menu. This menu has anOperatorswindow which lists all the operators in the subprocess of this operator. When you click on any operator in theOperatorswindow, all parameters of that operator are listed in theParameterswindow. You can select any parameter through the arrow keys of the menu. The selected parameters are listed in theSelected Parameterswindow. Only those parameters should be selected for which you want to find optimal values. This operator finds optimal values of the parameters in the specified range. The range of every selected parameter should be specified. When you click on any selected parameter (parameter in theSelected Parameterswindow) theGrid/Rangeoption is enabled. This option allows you to specify the range of values of the selected parameters. TheMinandMaxfields are for specifying the lower and upper bounds of the range respectively. Thestepsandscaleoptions are disabled for this operator. Note that only numerical parameters are displayed, since this operator does not support non numerical parameters.
Error handling
This parameter allows you to select the method for handling errors occurring during the execution of the inner process. It has the following options:
- fail_on_error: In case an error occurs, the execution of the process will fail with an error message.
- ignore_error: In case an error occurs, the error will be ignored and the execution of the process will continue with the next iteration.
Max generations
This parameter specifies the number of generations after which the algorithm should be terminated.
Use early stopping
This parameter enables early stopping. If not set to true, always the maximum number of generations are performed.
Generations without improval
This parameter is only available when theuse early stoppingparameter is set to true. This parameter specifies the stop criterion for early stopping i.e. it stops afterngenerations without improvement in the performance.nis specified by this parameter.
Specify population size
This parameter specifies the size of the population. If it is not set to true, one individual per example of the given ExampleSet is used.
Population size
This parameter is only available when thespecify population sizeparameter is set to true. This parameter specifies the population size i.e. the number of individuals per generation.
Keep best
This parameter specifies if the best individual should survive. This is also called elitist selection. Retaining the best individuals in a generation unchanged in the next generation, is called elitism or elitist selection.
Mutation type
This parameter specifies the type of the mutation operator.
Selection type
该参数指定t的选择方案his evolutionary algorithms.
Tournament fraction
This parameter is only available when theselection typeparameter is set to 'tournament'. It specifies the fraction of the current population which should be used as tournament members.
Crossover prob
The probability for an individual to be selected for crossover is specified by this parameter.
Use local random seed
This parameter indicates if alocal random seedshould be used for randomization. Using the same value oflocal random seedwill produce the same randomization.
Local random seed
This parameter specifies thelocal random seed. This parameter is only available if theuse local random seedparameter is set to true.
Show convergence plot
This parameter indicates if a dialog with a convergence plot should be drawn.