"Excel Text Processing-Frequency"

waynestate13waynestate13 MemberPosts:3Contributor I
edited May 2019 inHelp
Hello All,

I am new to RapidMiner and after watching some youtube tutorials and going through the tutorial I have some questions regarding text processing.

I have an excel file with about 300 comments (all comments in one line). I would like to process the data to ultimately have Rapid Miner come up with association rules. However, I am having trouble to process the data.

So far I am using: Read Document (uploading data in txt file) then from there process document where the vector is set up based on term occurrences. Inside the Process Document there is a tokenizer that is tokenizing based on an expressin (&). In the text file, after each comment I have included &.

However, I am not able to even get the data to read.

Thanks in advance for any feedback.

Answers

  • haddockhaddock MemberPosts:849Maven
    Hi Wayne,

    I think the problem is the '&', but without looking at your setup it's difficult to say; in regex use ampersands need to be escaped in order to be seen 'literally' (& -> \&), and not in their regex role. Just a thought.

    Hope that helps!
Sign InorRegisterto comment.