fork download
  1. import re
  2.  
  3. p = re.compile(ur'(?<!Mr|Dr|Ms|Jr|Sr)[.?!]+\s(?=[A-Z0-9])')
  4. test_str = u"Mr. Smith bought cheapsite.com for 1.5 million dollars i.e. he paid a lot for it. Did he mind? Adam Jones Jr. thinks he didn't. In any case, this isn't true... Well, with a probability of .9 it isn't.23 is the ish. My name is! Why wouldn't you... this is.\nAndrew"
  5. print(p.split(test_str))
Success #stdin #stdout 0.01s 9016KB
stdin
Standard input is empty
stdout
[u'Mr. Smith bought cheapsite.com for 1.5 million dollars i.e. he paid a lot for it', u'Did he mind', u"Adam Jones Jr. thinks he didn't", u"In any case, this isn't true", u"Well, with a probability of .9 it isn't.23 is the ish", u'My name is', u"Why wouldn't you... this is", u'Andrew']