"Compute Mean Value of Each Cluster for an Ignored Attribute"

goh_han_pingoh_han_pin MemberPosts:1Contributor I
edited June 2019 inHelp

During clustering,

how do I ignore an attribute

then later display the average value of that ignored attitude for each cluster?

My scenario as follows:

I have a dataset.

Each example is a student, with a set of attributes.

The attributes are the student’s ‘input’ characteristics, and one attribute being the student’s achievement test scores.

How do I ignore the student’s test scores during clustering, so that cluster is merely done based on student’s input characteristics?

But, at the end of the clustering process, link the students in each cluster to their achievement test scores, and compute the mean (average) test scores for the students from each cluster?

H P

Tagged:

Answers

  • lionelderkrikorlionelderkrikor Moderator, RapidMiner Certified Analyst, MemberPosts:1,195Unicorn

    Hi@goh_han_pin,

    Can you share your dataset, please, in order to test a possible process which answer to your project ?

    Regards,

    Lionel

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University ProfessorPosts:3,404RM Data Scientist

    Hi,

    The attached process should do it

    Best,

    MArtin







    <运营商激活= " true " class = "过程”兼容ibility="8.0.001" expanded="true" name="Process">








    Exclude attribute_1 for clustering


















    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
Sign InorRegisterto comment.