How to Segment Text Effectively Using TextTiling with Python 3.x

koushikdeb2009koushikdeb2009 MemberPosts:2Contributor I
edited December 2018 inHelp

I have written these line of codes

f = open("1.ref","r")
alice=f.read()
tt = nltk.tokenize.TextTilingTokenizer()
tiles = tt.tokenize(alice[0:2000])
print(tiles) #Total text in a single valued list

I need to consider full text.If I omit [0:2000] ,then getting an error

TypeError: slice indices must be integers or None or have anindexmethod

Spoiler

While printing tiles ,I am getting full text.I need to show segmented text.

Answers

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,761Unicorn

    @koushikdeb2009can you post the XML process using the button? It will be helpful to see how you formed the python script in Execute Python operator.

    sgenzer
  • koushikdeb2009koushikdeb2009 MemberPosts:2Contributor I

    There is no XML process.Its program to implement Text Tile process.If there is any code sample to implement texttiling properly with python 3.x ,then please send the link.It will be great help to my project.

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,761Unicorn

    @koushikdeb2009we usually see python executed in RapidMiner via the Execute Python operator, hence me asking if you can post your XML process. If you're strictly working in python, then I'd suggest visiting a forum related to Text Tiling.

Sign InorRegisterto comment.