While it won't fix the problem, as a last ditch effort you can
consider using my pdfUtils CFC from RIAForge. It has a function to
read the text out of a PDF. You could read that in and then index the
text. It is what my Seeker project (CF/Lucene integration) uses to
read and index PDF files.


On Mon, Oct 27, 2008 at 7:50 AM, JediHomer <[EMAIL PROTECTED]> wrote:
> Hi,
>
> We're currently having an issue on Linux with Verity.
>
> We're using CF8 on Debian, ubuntu for dev, where a client has PDFs
> that need indexing but contain extended characters in the filename, in
> particular the registered symbol.
>
> On Linux this is not being indexed by the Verity engine, on Windows
> (Where we are migrating away from) these files are correctly indexed.
>
> Using CFDirectory and Java's listFiles() I can confirm that CF and
> Java both see the files correctly, a FileExists() also returns true,
> however when CFIndexing the paths I get the following:
>
> "Error E3-0422 (VDK ): Could not find file
> /home/jedi/Dev/htdocs/jedi/verity/mypdf.pdf" ( -- Including the
> extended char, CF-Talk won't accept the Base64 post)
>
> In the Messages section of the Status dump (the symbol is shown
> correctly) but in the bad keys section of the Status dump the
> character does not come out correctly.
>
> Has anyone come across this before and know of a fix/work around?
>
> Thanks in advance
>
>
> Jedi
>
> 

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
Adobe® ColdFusion® 8 software 8 is the most important and dramatic release to 
date
Get the Free Trial
http://ad.doubleclick.net/clk;207172674;29440083;f

Archive: 
http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:314396
Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm
Unsubscribe: 
http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4

Reply via email to