How can we implement dropna() in the rapidminer?

AnushaAnusha MemberPosts:19Maven
Hi All!

I have a dataset that has NAs, N/A, null, NULL, and multiple spaces in different cells. I just want to remove those particular rows.
Can anyone guide me.

Source Data:

C1 C2 C3 C4

12 ADNF NCJK NA
34 HDDW CNJ -(single space )
38 CNJKD JIC N/A
78 NJDS NCSW NULL
90 CJNEK C JDSK 12NJDNC
08 DNCJS CSKJ null
13 -(tab space) bdjf ndf097

Desired Data:

C1 C2 C3 C4

90 CJNEK C JDSK 12NJDNC

Thanks in Advance!

Best Answer

  • ceaperezceaperez MemberPosts:459Unicorn
    Solution Accepted
    Hi@Anusha,

    Into the Select Attributes operator you have many alternatives to carry out the filtration of your dataset, for example remove the missing values, or work with regular expressions.

    Best

Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University ProfessorPosts:3,439RM Data Scientist
    Hi,
    First you use declare missing values to make it a missing, then you can use filter examples with 'is not missing' to remove it.

    Best,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
Sign InorRegisterto comment.