"[SOLVED] Empty Word List"
Hi All,
I am counting the occurrences of words in a txt document. The text document has abstracts of other documents, as well as the document title. The general format of the file is such:
...
This continues for roughly 36,00 documents. The total size of the document is 46MB. I am expecting to get a word list of word occurrences as a result. What I actually get is an empty word list. Here is my attached process:
Please let me know what I am doing wrong. Thanks.
I am counting the occurrences of words in a txt document. The text document has abstracts of other documents, as well as the document title. The general format of the file is such:
...
This continues for roughly 36,00 documents. The total size of the document is 46MB. I am expecting to get a word list of word occurrences as a result. What I actually get is an empty word list. Here is my attached process:
I used this youtube video as a guide:https://www.youtube.com/watch?feature=endscreen&;NR=1&v=EjD2M4r4mBM
<宏/ >
<运营商激活= " true " class = "文本:process_documents" compatibility="5.2.004" expanded="true" height="94" name="Process Documents" width="90" x="447" y="75">
Please let me know what I am doing wrong. Thanks.
Tagged:
0
Answers
it might be helpful if you check the option "create word vector" in the Process Documents operator
Additionally, you are reading only one document, but your pruning settings are configured to ignore words which appear in less than two documents. So for testing I suggest to disable pruning.
Happy mining,
Marius
After changing options, it is generally a good idea to hit "enter" or click somewhere on the process pane to make sure that the changes are actually submitted. Maybe the options were not applied when you hit the run button (yes, this needs improvement :-\ )
Best, Marius