Information Selection for Regression

Xin_WangXin_Wang MemberPosts:2Contributor I
edited November 2018 inHelp
我使用information selection extension package for noise in label detection. The targets of my problem are numerical numbers. I am trying to use CNN but it does not select any examples. Does the package support regression as well or is it strictly for classification. If it also supports regression, what parameters are needed to set the process up correctly?






<宏/ >







































Tagged:

Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University ProfessorPosts:3,404RM Data Scientist
    Dear Xing,

    说实话,我必须承认我从未使用过这个s extension. It was always on my todo list. Marcin, the auther of the package, is also active on this board. You might send him a private message:http://rapid-i.com/rapidforum/index.php?action=profile;u=2560

    ~Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • Xin_WangXin_Wang MemberPosts:2Contributor I
    Thanks a lot for the tip, Martin!
  • marcin_blachnikmarcin_blachnik MemberPosts:61Guru
    In the Information Selection toolbox, eachSelect by ...operator (such as CNN, ENN etc) has Decision Function parameter. The Decision Function defines method used to measure the error. For classification problem this should be configured to Class Loss, but for regression problem it can be ThresholdLinearLoss or Local Threshold Linear Loss (I don't recomment the other two with "Relative" key word).The decision function is executed when the algorithm takes a decision to select or reject given example. Both Decision Functions for regression are similar, and they can be written as IF abs(label_predicted - label_truth) < STD * threshold.
    The threshold parameter must be configured for both regression decision functions. It defines the maximum acceptable difference between label predicted by nearest neighbors and the true label. In the case of Local decision function the threshold is multiplied by standard deviation of labels of the nearest examples (the noise estimation parameter defines number of nearest neighbors considered to estimate standard deviation of labels). The local function is more robust buth more computationally expensive. For non local decision function STD = 1;

    Hope it helped
    Best
    Marcin
Sign InorRegisterto comment.