How can I split a text into sentences?

The Natural Language Toolkit (nltk.org) has what you need. This group posting indicates this does it:

import nltk.data

tokenizer = nltk.data.load('tokenizers/punkt/english.pickle')
fp = open("test.txt")
data = fp.read()
print '\n-----\n'.join(tokenizer.tokenize(data))

(I haven’t tried it!)

More Related Contents:

How to split text without spaces into list of words
Python: Split string into multiple lines using delimiter
How to split a string into a list?
Split a string by a delimiter in python
BeautifulSoup Grab Visible Webpage Text
How might I remove duplicate lines from a file?
Splitting a string into words and punctuation
Inserting Line at Specified Position of a Text File
Only extracting text from this element, not its children
Reading a text file and splitting it into single words in python
Product code looks like abcd2343, how to split by letters and numbers?
How to use python-docx to replace text in a Word document and save
Remove very last character in file
split a generator/iterable every n items in python (splitEvery)
Convert list of strings to dictionary
Box around text in matplotlib
How do I separate my models out in django?
Add advanced features to a tkinter Text widget
Styling part of label in legend in matplotlib
Converting pandas column of comma-separated strings into dummy variables
Is there a way to split a string by every nth separator in Python?
Is there a function in python to split a word into a list? [duplicate]
How to split by comma and strip white spaces in Python?
Split string at every position where an upper-case word starts
Who originally invented this type of syntax: -*- coding: utf-8 -*- [duplicate]
Why does str.split not take keyword arguments?
Split strings in tuples into columns, in Pandas
Find text position in PDF file
Splitting a math expression string into tokens in Python
Split pandas dataframe in two if it has more than 10 rows

More Related Contents:

Leave a Comment Cancel reply