"Bug in PCA operator output"

Legacy UserLegacy User MemberPosts:0Newbie
edited May 2019 inHelp
Input file:

X1 X2 X3
1. 1. 0.
1. 1. 1.
1. 1. 2.
0. 0. 0.
0. 0. 1.

Experiment:


















The log shows:

Principal Components:
Variance Threshold: 0.95
PC 1: + 0.439 * X1 - 0.554 * X2 + 0.707 * X3
PC 2: + 0.439 * X1 - 0.554 * X2 - 0.707 * X3
电脑3:+ 0.784 * X1 + 0.621 * X2 -0.000 * X3
(created by PCA)

I think the correct components are transpose of this, so the log should show:

PC1 = 0.439*X1 + 0.439*X2 + 0.784*X3
Pc2 = -0.554*X1 - 0.554*X2 ...




Answers

  • Legacy UserLegacy User MemberPosts:0Newbie
    Just to clarify - the incorrect (transposed) values are printed only in the log.

    The PCA matrix on the screen (tab Eigenvectors view) is displayed correctly.
    The next step, model applier, also uses the correct values.

  • IngoRMIngoRM Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University ProfessorPosts:1,751RM Founder
    Hi Vector,

    thanks for sending this in. And especially for the additional note - I got a slight shock before you mentioned that it is only transposed in the log out. Puh...:D

    I have changed the logging output and the fixed version is available via CVS in a few hours.

    Cheers,
    Ingo









Sign InorRegisterto comment.