Replacing Miscoded Values/terms
Hi! My data set should only include these entries:Positive
andNegative. However, there are miscoded entries such as Neg, NEG, and Pos. I tried the "Replace" operator for a couple of times now but some correct entries change too. For instance, I want to change Neg to Negative, the entries with the correct word Negative become Negativeative. How can I correct this?
0
Best Answer
-
BalazsBarany Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified ExpertPosts:949UnicornHi!
^neg.* would be a good expression, especially if you switch on case-insensitive matching.
^ means "beginning of the string". This makes sure that your text starts with "neg". The .* after it is "any sequence of characters". This will match neg, negative, negativeative etc.
Replace is good for replacing text that you can express as a regular expression. If you have a list of values, Map can be easier to use without regular expressions.
Regards,
Balázs1
Answers