How to combine Logistic regression with SOM as a hybrid model?

komeil_shaerikomeil_shaeri MemberPosts:13Contributor II
edited November 2018 inHelp

Hi,

I need to combine Logistic regression with SOM or DBSCAN as a hybrid model. This will be a hybrid "Classification + Clustering" model in which a classifier can be trained first, and its output is used as the input for the cluster to improve the clustering results.

Thanks,

Tagged:

Answers

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,761Unicorn

    Just take your pre-processed (ETL'd) data, feed it into a X-val with your Logistic Regression, the use an apply model on the outside to to score your training set and put it into the clustering algo. Of course I'm simplifying it, but it should be quite easy to do.

    Update: Something like this?






















































  • komeil_shaerikomeil_shaeri MemberPosts:13Contributor II

    Thanks for your response ...

    The problem is when I hybridize the algorithms, the performance measures (accuracy, precision, recall) don't change even if I disable the x-validation operator which contains the logistic regression. I don't know why logistic regression cannot affect the overall performance...

    Please see the attached file.

    Thanks

    111.PNG 33.3K
  • komeil_shaerikomeil_shaeri MemberPosts:13Contributor II

    Hi,

    In this example, first I have applied decision tree (DT) on Titanic data. The resulting accuracy is 80.29%.

    When the DT is hybridized with Fuzzy C-means (FCM), still the performance accuracy is 80.29%. This means that the system does not take into account the FCM. Is there another way to integrate the Classification and Clustering models? Can you help me on this issue?

    DT process:


    <过程version = " 7.2.002 " >



































    <参数值= "苏尔vived" key="attribute_name"/>















    <参数值= " true "键= " apply_pruning " / >



    <参数值=“2”键= " minimal_leaf_size"/>


















































    DT-FCM process:


    <过程version = " 7.2.002 " >



































    <参数值= "苏尔vived" key="attribute_name"/>















    <参数值= " true "键= " apply_pruning " / >



    <参数值=“2”键= " minimal_leaf_size"/>



































    <操作符类= name = " Nomin“nominal_to_numerical”al to Numerical" expanded="true" compatibility="7.2.002" activated="true" y="187" x="447" width="90" height="103">






























































    Many thanks,

    Komeil

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,761Unicorn

    I'm a bit confused as to why you want to first classify the data and then segment it? These are two different methods of learning (Supervised and Unsupervised). In the supervised method you start with knowing the truth, you know who died and didn't die in the Titanic disaster. Normally, in the Unsupervised way, you typically don't have a class label and look for statisical characteristics that 'segment' like groups together. In what you are trying to do here is build a model on the Titatinic data set with a label and then throw out that label and segement out the regular attributes. You will get different performance measures for sure, one for a classification problem and the other for a segementation problem.

    If you're looking to combine multiple algorithms, have you tried our stacking (ensembing) operator?

  • komeil_shaerikomeil_shaeri MemberPosts:13Contributor II

    Stacked Generalization is good for combining multiple classifiers. I'm wondering if is there any way to combine clustering techniques with each other? I heard about "Consensus Clustering" which is similar to stacking but for clustering methods.

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,761Unicorn

    Maybe what you can do is select one class from the Logistic Regression result and then pass that to the clustering process. This way you can segment out those attributes for the single class.

Sign InorRegisterto comment.