Dears I need help I have log file as text file Contains about 500 line I need to count the numbers
Ahmedte1234
MemberPosts:3Contributor I
The lines on file as
1 Jan 10:00 the chassis normal status
1 Jan 10:30 log I'd lost
1 Jan 12:30 interface down
1 Jan 1:00 power off system
2 Jan 11:00 the high temperature
2 Jan 2:00 the user log in successfully
And alot of statements like that so some statements useful and some statements no.
So the output like that
下出现的10倍
Power off appear 1 time
Interface down 3 times
And I need the algorithm to suggest the most words and how many appear in file. And also how to reduce with certain pattern.
Tagged:
0
Best Answer
-
varunm1 Moderator, MemberPosts:1,207UnicornYou can go through the tutorial of FP - Growth in RapidMiner. Type FP in search of rapidminer operators, then drag and drop the operator in your process, if you click on the operator you can see tutorials in the help window. You can see below screenshot. You can also see tutorial on academy herehttps://academy.www.turtlecreekpls.com/learn/video/text-association-rules
Regards,
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
5
Answers
First, install " text processing" and "Web mining" extensions from marketplace in rapidminer. To count the repetition of words in your document, you first need to read your text file into RapidMiner. Then you can use the below XML code (click on show) to extract details about your data attach your text file instead of the one in this XML. To use this XML, you first need to copy the XML code from here and then open a new blank process in rapidminer, you need to enable XML window by going to VIEW --> Show Panel --> XML in menu bar of RapidMiner. Copy the code from here and paste it in XML window of rapidminer new process, then click the green tick mark which will show you the process as seen in below figure. Once you get this delete the retrieve files and attach your file imported into rapidminer. I also attached the result of the process based on some data you provided. The term occurances is giving you the number of times the word is repeated in your file. There are multiple community samples as well to understand how TF-IDF works
<运营商激活= " true " class = " nominal_to_text”compatibility="9.2.001" expanded="true" height="82" name="Nominal to Text" width="90" x="246" y="136">
希望这s helps. Please inform if you are looking for a different thing.
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing