RM 9.1 feedback : Auto-Model limitation

lionelderkrikorlionelderkrikor Moderator, RapidMiner Certified Analyst, MemberPosts:1,195Unicorn
edited June 2019 inHelp
Hi,

I work with a dataset containing 96 examples and thus I can't use Auto-Model because the new min number of examples is 100 !
Is there any reason to this new limitation ?


Regards,

Lionel
Tagged:

Best Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University ProfessorPosts:3,404RM Data Scientist
    Solution Accepted
    Hi@lionelderkrikor,
    i guess the answer is that the new features would overfit too much?@IngoRM?
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
    lionelderkrikor
  • IngoRMIngoRM Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University ProfessorPosts:1,751RM Founder
    Solution Accepted
    Hi,
    Yes, indeed. Plus we changed the validation approach a bit (see some of the other threads in the community - I will post answers there soon as well) to get to more robust estimations. This unfortunately meant that we need more data for the validation part of the models which required to increase the limit from the 50 rows to 100.
    We have looked into the statistics and it seemed that less than 3% of all AM runs have been on data sets of less than 100 rows and while we are sorry that we had to increase the limit (making the life harder for those 3% of the runs) we still believe that the improvements in validation and the addition of feature engineering justified this decision.
    Again, sorry for the inconvenience & best,
    Ingo
    sgenzer lionelderkrikor MartinLiebig

Answers

Sign InorRegisterto comment.