On Thu, May 5, 2016, at 04:41, Steven D'Aprano wrote: > > There's no situation where "&&&&&" and " " will exist in the given > > dataset, and recognizing that is important. You don't have to account > > for every bit of nonsense. > > Whenever a programmer says "This case will never happen", ten thousand > computers crash.
What crash can including such an entry in the output list cause? Should the regex also ensure that the data only includes *english words* separated by space-ampersand-space? -- https://mail.python.org/mailman/listinfo/python-list