Remove correlated features from training set and apply the same features to test set
Hello all,
I just wondering how you achieve to remove pairwise correlated features from your training set (using the Remove Correlated Attributes operator) and apply the same features to your test set? If I should compare this operation to something I think about the "Apply feature set" (as exists for the features selection operator) or somewhat OHE and the Preprocessing model output. See screenshot below of the process. I have normally these two training and test preprocessing operations in two different processes.
data:image/s3,"s3://crabby-images/94c82/94c82b3e35be61f5c838b42a7c84ca78dfcd503e" alt="Image: https://us.v-cdn.net/6030995/uploads/editor/be/20x5b8we3r4l.png"
Thanks for your help.
Tagged:
0
Best Answers
-
MartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University ProfessorPosts:3,400
RM Data Scientist
Hi@Andy3,
you usually don't need to do it. Keep in mind that Apply Model is ignoring additional attributes.
Best,Martin
- Sr. Director Data Solutions, Altair RapidMiner -
Dortmund, Germany5 -
MartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University ProfessorPosts:3,400
RM Data Scientist
Hi@Andy3,if you need to do it, you can use Data to Weights for it. Attached is an example.
BR,Martin
<宏/ >
<参数键= value =“process_duration_for_mail30"/>
<参数键=“k”值="10"/>
- Sr. Director Data Solutions, Altair RapidMiner -
Dortmund, Germany2
Answers