[julia-users] Removing the 'DIVIDES' (U+2223) from the inside of a character string

Lauren Kersey Tue, 18 Aug 2015 20:59:23 -0700

Hi all. I have a text tool that removes punctuation from the ends of words 
(split). This seemed to work well enough for cleaning my corpus, but I'm 
now working with a dataset of texts published from 1600-1700. Back then, 
printers split a lot of lines in the middle of a word. When the printer 
came to the end of the page, he simply split the word with a hyphen. My 
digital copies mark this symbol as a "\u2223" and I need to remove this 
from the word to perform computational analysis.


Is there a way to remove this character from the inside of a character 
string?

[julia-users] Removing the 'DIVIDES' (U+2223) from the inside of a character string

Reply via email to