Trove is where all my notes with Emdash come from. 

 

When correcting the OCR there is a place at the top of the page where you
can insert a symbol and Emdash is top of the list. I have been inserting the
symbol in place of a simple dash if an Emdash is shown on the image of the
article. 

 

I then do as you do and copy the article into a .txt file (I use Notepad++)
and tidy it up before copying and pasting into Legacy. In Legacy v 7.5 there
was no problem with the Emdash.

 

As it looks like this is an ongoing problem in Legacy post v 7.5, I will
need to go and replace all those '97 instances with a simple dash (if I can
find them all) , and remember to just use a dash in the .txt file from Trove
before pasting into Legacy.

 

Jennifer

Ringwood North, Victoria, Australia

 

From: LegacyUserGroup [mailto:legacyusergroup-boun...@legacyusers.com] On
Behalf Of Ian Thomas
Sent: Friday, 21 April 2017 10:47 AM
To: Legacy User Group <legacyusergroup@legacyusers.com>
Subject: Re: [LegacyUG] Emdash rendered '97

 

This is probably peripheral to the discussion on Emdash / Endash and Legacy
FT's clean-up abilities. 

One source of information that I use a lot is from digitized (Australian)
newspapers, at Trove <http://trove.nla.gov.au/newspaper/> . The OCR text
usually requires correction (human recognition)  and that includes changing
Emdash to hyphen, inserting spaces, concatenating hyphenated words, etc etc.


But a simple Copy then Paste into a Notes field in Legacy isn't an option
for me. Stripping the HTML doesn't improve the situation - but that's fine
by me. 

Routinely, I paste into a plain-text editor set up in a simple Encoding (for
PC of course) - using an Encoding like Unicode BOM is asking for trouble, I
have found. Then, I create accurate text but sometimes leave it in newspaper
column format - or, make it more readable in continuous-text paragraphs. 

This process is much lengthier than just changing a few single-symbol bits
like an Emdash. But I have seen the 'conversion' by Legacy to '97' which is
why I adopted the method described. 

I find it has a benefit - I read more attentively, and understand better.
Some of the articles - and even common-place advertisements - are
fascinating. 

>From Trove, an image or PDF file at a selectable range of resolutions can be
saved to disk, so the scanned source imagery is available and can be
attached to Legacy as a media file, if warranted.     

 

Ian Thomas

Albert Park, Victoria 3206 Australia

-- 

LegacyUserGroup mailing list
LegacyUserGroup@legacyusers.com
To manage your subscription and unsubscribe 
http://legacyusers.com/mailman/listinfo/legacyusergroup_legacyusers.com
Archives at:
http://www.mail-archive.com/legacyusergroup@legacyusers.com/

Reply via email to