prediction confidence

navnav MemberPosts:28Maven
edited October 2019 inHelp
hello, someone can explain me how the prediction confidence columns work and how are calculated when I apply a classification model on a test set. Thanks.
Tagged:

Answers

  • IngoRMIngoRM Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University ProfessorPosts:1,751RM Founder
    Hi,

    whoa, that's a pretty broad question with no definite answer for all learning schemes.

    In general, the prediction confidences state how sure the model was for each of the possible values. This is similar to probabilities ("how large is the probability that the class is "positive"?) but not necessarily the same.

    How they are calculated? Well, that differs for all model types. For schemes like Naive Bayes and Logistic Regression, the confidences are indeed the probabilities based on the seen training data. If you use an SVM and apply scalings like Platt scaling, it is at least pretty close. For other schemes, things might be different. For example, the confidences of decision trees are the fraction of the class in the applicable leaf against the total number of cases in this leaf.

    There are only two ways: Simply accept the confidences as a measurement of how sure the model is and believe it. Or do it the hard way: read all the literature about all the model types and learn how they are calculated in detail. The source code might also help here.

    Cheers,
    Ingo
  • adamanadaman MemberPosts:17Maven
    Hey together,

    this is all fine for me, no need to understand it all in detail, but i would like to put a threshold on the confidence after the model applier is finsihed to get only some examples with a lower or higher threshold. But i can´t, as the confidence is a spezial attribute? or am i doing something complete wrong
  • IngoRMIngoRM Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University ProfessorPosts:1,751RM Founder
    Hi,

    well, you have several options for this.

    You could use
    • the operator "Generate Attributes" (you will have to rename the confidence attributes before since the parentheses would cause problems otherwise...)
    • one of the discretization operators
    • the operator "Drop Uncertain Predictions" (although this one does not exactly divide your data into discrete bins...)
    If the fact that the confidence is a special attribute is a problem somewhere, you could either check the setting "include special attributes" or use the operator "Set Role" before the data transformation is applied.

    Here is an example using the operator "Generate Attributes":










































    (2 <连接from_op = "生成数据)" from_port="output" to_op="Add Noise (2)" to_port="example set input"/>


    <连接from_op = "重命名代替“from_port = "example set output" to_op="Generate Attributes" to_port="example set input"/>








    And here is an example using one of the discretization operators:






































    (2 <连接from_op = "生成数据)" from_port="output" to_op="Add Noise (2)" to_port="example set input"/>










    Cheers,
    Ingo
  • adamanadaman MemberPosts:17Maven
    thx :-) for the hints
  • sinead_brackensinead_bracken MemberPosts:5Contributor I

    Hi there,

    just to add on this:

    are the values indicative of how sure we are in the sense of if the confidence value is 0.785, could we say we are 78.5% confident that this prediction falls into this category?

    Or is it more along the lines of 78.5% of entries like this fall into this category too?

  • Telcontar120Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,635Unicorn
    Another good, but sometimes difficult, question. Generally the confidence can be interpreted in both senses, because the first sense (confidence in the prediction) is actually based on the second sense (distribution of similar records). However, this number itself is highly dependent on the specifics of the training dataset, so it is susceptible to "drift" when applying the score to other datasets. Most of the time, the scores are more robust as rank-ordering tools, so that even if the underlying distributions of classes shift, they preserve the correct ordering even if the absolute probabilities shift.
    Brian T.
    Lindon Ventures
    Data Science Consulting from Certified RapidMiner Experts
    sgenzer
  • rafeenarafeena MemberPosts:14Contributor II
    @sinead_brackenand everyone else. regarding this question does it mean the confidence is like the accuracy measurement of the performance
  • Telcontar120Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,635Unicorn
    No, I would say that accuracy is something different, at least as usually defined in a machine learning context. Accuracy is typically a measure of overall model performance, such as derived from the confusion matrix for a classification problem, and as shown in the performance operators in RapidMiner.
    This is related but is ultimately not the same as the confidence for an individual prediction (or even set of predictions) and it is itself subject to skew based on the confidence threshold selected for classification purposes (see the earlier part of this same thread for a discussion of setting thresholds).

    Brian T.
    Lindon Ventures
    Data Science Consulting from Certified RapidMiner Experts
Sign InorRegisterto comment.