Re: Parsing a PDF file

2016-07-11 Thread [-hh]
ck set itemdelimiter to "." put "txt" into last item of f0 set itemdelimiter to comma put clipboardData["text"] into url ("file:" ) put f0 & cr before fld "jobsDone" end repeat end mouseUp -- View this message in context:

Re: Parsing a PDF file

2016-07-11 Thread [-hh]
" into aScript repeat for each line f in ff put aScript into fScript put PDFfolder & "/" & f into f0 replace "//" with "/" in f0 do fScript as applescript go this stack set itemdelimiter to "." put "txt" into la

Re: Parsing a PDF file

2016-07-11 Thread Jim Hurley
Kay Lan wrote: In this particular case I found it much easier to open the PDF file in Adobe Acrobat and doing a “Save as — Text (Accessible)” Jim > > On Mon, Jul 11, 2016 at 9:36 AM, Roger Eller > > wrote: >> Since this seems to

Re: Parsing a PDF file

2016-07-10 Thread Kay C Lan
On Mon, Jul 11, 2016 at 9:36 AM, Roger Eller wrote: > Since this seems to be Mac only, why not "do as Applescript" the select > all, and Copy? > Because Preview isn't properly scriptable and you can't "Select All" or "Copy". As Richard said, the answer is with

Re: Parsing a PDF file

2016-07-10 Thread Roger Eller
Since this seems to be Mac only, why not "do as Applescript" the select all, and Copy? ~Roger On Jul 10, 2016 7:18 PM, "Jim Hurley" wrote: > hh wrote: > > > > > [Description for MacOS, works on Win/Linux similar.] > > > > The best results for extracting tables from

Re: Parsing a PDF file

2016-07-10 Thread Jim Hurley
hh wrote: > > [Description for MacOS, works on Win/Linux similar.] > > The best results for extracting tables from PDF I had with the free "RAW" > method: > > = Open the file with Preview. > = Select All (menu Edit). Copy. > = Go to a LC stack with a field "INCOMING" > = Use by a button or the

Re: Parsing a PDF file

2016-07-10 Thread Jim Hurley
Mark Wieder wrote: > > On 07/09/2016 08:54 AM, Richard Gaskin wrote: > >> Across the US we're beginning to see a revolution in government data >> sharing. > > > Except, of course, when it comes to actual data. Hi Mark. See my reply to Richard. The actual data I wanted was how the election

Re: Parsing a PDF file

2016-07-10 Thread Jim Hurley
Richard Gaskin wrote: > > Jim Hurley wrote: > >> Thanks Richard. >> >> You are so right about releasing data in complex formats. >> I spoke to the election's off about posting election results in PDF >> format. >> I knew there was not use fighting them when they told me that it was >> now

Re: Parsing a PDF file

2016-07-09 Thread Mark Wieder
On 07/09/2016 08:54 AM, Richard Gaskin wrote: Across the US we're beginning to see a revolution in government data sharing. Except, of course, when it comes to actual data. Many of the laws we as citizens of the US are required to follow are not available for us to read without paying a

Re: Parsing a PDF file

2016-07-09 Thread Richard Gaskin
Jim Hurley wrote: > Thanks Richard. > > You are so right about releasing data in complex formats. > I spoke to the election's off about posting election results in PDF > format. > I knew there was not use fighting them when they told me that it was > now County "policy" to post everything in

Re: Parsing a PDF file

2016-07-09 Thread Jim Hurley
ssage: 9 > Date: Fri, 8 Jul 2016 08:44:50 -0700 > From: Richard Gaskin <ambassa...@fourthworld.com> > To: use-livecode@lists.runrev.com > Subject: Re: Parsing a PDF file > Message-ID: <577fca72.2040...@fourthworld.com> > Content-Type: text/plain; charset=utf-8; format=fl

Re: Parsing a PDF file

2016-07-08 Thread Mark Rauterkus
Hi, OT Tip: Go to the source of the data, the election board. Tell them you want the raw data made available PLUS the PDF. That is not an unreasonable request. Open Government advocates / folks would support that, IMHO. -- Ta. Mark Rauterkus m...@rauterkus.com

Re: Parsing a PDF file

2016-07-08 Thread Richard Gaskin
Dar Scott wrote: >> On Jul 8, 2016, at 9:44 AM, Richard Gaskin wrote: >> It's unfortunate that so many orgs release data useful to analysis >> in complex formats that inhibit such use. ... > To make it worse, documents for human consumption are claimed to be > the same when underneath there are

Re: Parsing a PDF file

2016-07-08 Thread Dar Scott
> On Jul 8, 2016, at 9:44 AM, Richard Gaskin wrote: > > > My County is now publishing the election results to the web as a PDF > > file: > > > >

Re: Parsing a PDF file

2016-07-08 Thread [-hh]
box the line put clipboardData["Text"] into fld "INCOMING" If you use simply "paste" you get (probably unwanted) styles with your text. (If you have a lot of files: Preview is scriptable.) -- View this message in context: http://runtime-revolution.278305.n4.na

Re: Parsing a PDF file

2016-07-08 Thread Richard Gaskin
Paul Dupuis wrote: > In truth a NEW portable document format needs to be invented that > connects and preserves content to its appearance, but I suspect that > people who want to keep both intact and portable are just using HTML5 > and CSS3. CSS is a wonderful solution. Being prone to

Re: Parsing a PDF file

2016-07-08 Thread Mike Bonner
Its ugly but, could you use pdf.js to extract the text in a browser widget showing the pdf? http://git.macropus.org/2011/11/pdftotext/example/ Not sure what else is in pdf.js but it looks interesting. On Fri, Jul 8, 2016 at 10:30 AM, Paul Dupuis wrote: > On 7/8/2016

Re: Parsing a PDF file

2016-07-08 Thread Mike Bonner
Might read this one too: http://stackoverflow.com/questions/1554280/extract-text-from-pdf-in-javascript On Fri, Jul 8, 2016 at 10:48 AM, Mike Bonner wrote: > Its ugly but, could you use pdf.js to extract the text in a browser > widget showing the pdf? >

Re: Parsing a PDF file

2016-07-08 Thread Paul Dupuis
On 7/8/2016 11:55 AM, Colin Holgate wrote: > I was trying an export as spreadsheet from Acrobat Pro, but that didn’t work. > Doing a Save as Text from Acrobat Reader was more successful, but the columns > come out in a different order, and some columns get combined into a single > string. Over

Re: Parsing a PDF file

2016-07-08 Thread Colin Holgate
I was trying an export as spreadsheet from Acrobat Pro, but that didn’t work. Doing a Save as Text from Acrobat Reader was more successful, but the columns come out in a different order, and some columns get combined into a single string. > On Jul 8, 2016, at 11:44 AM, Richard Gaskin

Re: Parsing a PDF file

2016-07-08 Thread Richard Gaskin
Jim Hurley wrote: > My County is now publishing the election results to the web as a PDF > file: > > https://www.mynevadacounty.com/nc/elections/docs/2016%20Elections/June%207%2c%202016%2c%20Presidential%20Primary/Election%20Results/precinctreport.pdf > > Is there a way to parse these PDF

Re: Parsing a PDF file

2016-07-08 Thread Dan Brown
> Doesn’t that just let you show PDFs? Would it help to parse the contents? Using the "pdftotext" component of Xpdf you can use a shell command to extract the text from a pdf and place it into for example a text file which you can then parse something like.. ___ put "C:\pdftotext" & "

Re: Parsing a PDF file

2016-07-08 Thread Colin Holgate
Doesn’t that just let you show PDFs? Would it help to parse the contents? I’m trying another approach, will report back soon. > On Jul 8, 2016, at 10:58 AM, Peter TB Brett wrote: > > On 08/07/2016 15:11, Jim Hurley wrote: >> My County is now publishing the election

Re: Parsing a PDF file

2016-07-08 Thread Paul Dupuis
On 7/8/2016 10:11 AM, Jim Hurley wrote: > My County is now publishing the election results to the web as a PDF file: > > > https://www.mynevadacounty.com/nc/elections/docs/2016%20Elections/June%207%2c%202016%2c%20Presidential%20Primary/Election%20Results/precinctreport.pdf > > Is there a

Re: Parsing a PDF file

2016-07-08 Thread Peter TB Brett
On 08/07/2016 15:11, Jim Hurley wrote: My County is now publishing the election results to the web as a PDF file: https://www.mynevadacounty.com/nc/elections/docs/2016%20Elections/June%207%2c%202016%2c%20Presidential%20Primary/Election%20Results/precinctreport.pdf Is there a way to

Parsing a PDF file

2016-07-08 Thread Jim Hurley
My County is now publishing the election results to the web as a PDF file: https://www.mynevadacounty.com/nc/elections/docs/2016%20Elections/June%207%2c%202016%2c%20Presidential%20Primary/Election%20Results/precinctreport.pdf Is there a way to parse these PDF files? Thanks, Jim