text mining breaks text into symbols
Hello,
I'm trying to do text mining with a large excel table with many text entrys (many words in a cell). Unfortunately my "Process Documents from Files" breaks my text into a mixture of symbols and letters.
I aktually do not know why it is doing that, but also my word list looks like that.
Can you tell why this happens?
Thanks a lot
Imke
Tagged:
0
Answers
Hi,
can you make sure that you tried the right encoding? it looks like this was stored with UTF-8 (Mac/Linux Standard) but read with a Windows Encoding.
Br,
Martin
Dortmund, Germany
Hello,
underneath you can see my process. Maybe you can tell, what is wrong, and why it crashes rapid miner, too.
Thank you
Imke
For Reference,
the issue was that the files in the folder were Excel-Files. Read Document from Files is only able to handle pure text files. The attached process soled the issue.
Best,
Martin
Dortmund, Germany