Hi,

 here is a snippet of code that opens a file (fn contains the path\name) and 
first tried to replace all endash, emdash etc characters with simple dash 
characters, before doing a search.
  But the replaces are not having any effect. Obviously a syntax 
problem....wwhat silly thing am I doing wrong?

  Thanks!

fn = 'z:\Documentation\Software'
def processdoc(fn,outfile):
    fStr = open(fn, 'rb').read()
    re.sub(b'&#x2012','-',fStr)
    re.sub(b'&#x2013','-',fStr)
    re.sub(b'&#x2014','-',fStr)
    re.sub(b'&#x2015','-',fStr)
    re.sub(b'&#x2E3A','-',fStr)
    re.sub(b'&#x2E3B','-',fStr)
    re.sub(b'&#x002D','-',fStr)
    re.sub(b'&#x00AD','-',fStr)

-- 
https://mail.python.org/mailman/listinfo/python-list

Reply via email to