Hi

I'm new to skim-app-users and still finding my way around, so sorry if a 
solution to my question has been covered before. Here it is:

Underlying issue is poor OCR leading to a readable pdf whose structure is 
slightly scrambled (e.g. tabular data is not read properly in sequential rows). 
I need to extract text from various places based on the location of known 
column headings (using 'find' in Skim to locate these). Scrambling means I 
can't just select the next paragraph, attribute run, number of characters  etc.

I am attempting to get round this this by using Skim's interpretation of lines 
(the line bounds) for a page & Applescript to get text from within specified 
selection bounds - for instance for a given line on a page (item x of the line 
bounds). Is it possible to get text for a bounds selection, rather than 
directly from a character selection? I've managed to do it by setting the 
clipboard to grabbed text for the bounds (i.e. a mini pdf) and using some UI 
scripting to make a new document from the clipboard - & then selecting the text 
of the new cut-down document! Needless to say v.slow & inefficient. Is there a 
simpler way?

Many thanks

Bruce
------------------------------------------------------------------------------
The ultimate all-in-one performance toolkit: Intel(R) Parallel Studio XE:
Pinpoint memory and threading errors before they happen.
Find and fix more than 250 security defects in the development cycle.
Locate bottlenecks in serial and parallel code that limit performance.
http://p.sf.net/sfu/intel-dev2devfeb
_______________________________________________
Skim-app-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/skim-app-users

Reply via email to