Decision tree prediction's accuracy
amabdellatif
MemberPosts:2Contributor I
Would you please advise with the following:
1- How to increase the accuracy of the decision tree block?
2- Based on what shall I choose the decision tree parameter's value?
3- In case you the use of the "Optimizer" is recommended, is there any document that explains and define explicitly each parameter?
Thanks in advance and waiting for your response.
Tagged:
0
Answers
Hi@amabdellatif, actually the answer depends on what you are trying to predict.
For example
You should pick the setting that is most relevant to your data, (usually Gain Ratio, but not always)
Can you tell us a little more about what you want to do? Also explore your data to see how many of each class there are, another problem you may run into unexpectedly is if you have imbalanced data (one class is much higher than the other). This means your decision tree focusing on accuracy might be 99% accurate by predicting everything all as a single class.
For the documentation have you tried the help files for the operator? It's pretty useful as an explanation.
Hello@JEdward
thanks a lot for your reply.
With regards to your questions about the data that I am trying to predict, is "the subscription of term deposits" for customers in a Bank (This is not a real data - trial version, not the real data)
Attached is the whole data set if you can help me to figure out how should I start thinking.
thanks in advance for your time and attention
PS: The file attached is a .xlsx file, I just changed its extention to .dox to be able to upload it