"Finding Top relevant document in kmeans cluster"

amir_askary_shaamir_askary_sha MemberPosts:11Contributor I
edited June 2019 inHelp

Hi,

After running kmeans clustering, how can I find out which document is the most relevant (top document) in one cluster?

Right now the documents in a cluster are sorted ascendingly by their id. I want to have them sorted by a weight score showing how relevant this document is in this cluster, or at least to see the most relevant doc in the cluster.

Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University ProfessorPosts:3,438RM Data Scientist

    Hi,

    你如何定义相关性吗?

    Best,

    Martin

    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • amir_askary_shaamir_askary_sha MemberPosts:11Contributor I

    I don't know exactly; any kind of relevancy. For example let's say every cluster has some top words in it (the centroids that kmeans finds), and then the document which has the shortest cosine/euclidian distance to those top words of the cluster, is the most relevant doc in the cluster.

Sign InorRegisterto comment.