文本挖掘excel使用集群ing to obtain confusion matrix

brunonbrasilbrunonbrasil MemberPosts:8Contributor II
edited January 2020 inHelp
嗨,
I'm new to rapidminer, I need to get the confusion matrix to validate clusters obtained from a text. Did you know how to do this?
Jasmine_

Best Answer

Answers

  • Telcontar120Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,635Unicorn
    Clusters are a form of unsupervised machine learning so it is not possible to generate a confusion matrix directly from clustering. You would first need to turn the clusters into a label and then have another process to assign the clusters to compare the two outputs. Or if you already have another existing label with the same number of categories as clusters, then you can use the Map Clusters on Labels operator to do this automatically and then use a normal Performance operator to generate the confusion matrix.
    Brian T.
    Lindon Ventures
    Data Science Consulting from Certified RapidMiner Experts
    Jasmine_ brunonbrasil
  • brunonbrasilbrunonbrasil MemberPosts:8Contributor II
    I built this model to classify the confusion matrix. I managed to get the confusion matrix but I don't know if it is the correct form. Does it make sense to you?




    Jasmine_
  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM ModeratorPosts:2,959Community Manager
    @brunonbrasilso based on that screenshot you are using a very old version of RapidMiner Studio. I would highly recommend updating to the most recent version (9.5.1).

    Scott
    Jasmine_ [Deleted User] brunonbrasil
  • brunonbrasilbrunonbrasil MemberPosts:8Contributor II
    The context I consider as a label, means the clusters that I obtain manually and compare with the clusters that I intend to obtain. The Receiver represents the data in sentences.

    Jasmine_
Sign InorRegisterto comment.