Re: Any good way to index/search pdf docs via CF

2000-11-21 Thread Stephen M. Aylor


I also recall some discussion of the CF bundled Verity engine not supporting
the newer Acrobat 4x "UNICODE"  - you may want to do some legwork and verify
what the bundled verity engine can and cant do, or if version 4x pdf's are a
non issue - your in the deal.

Steve

- Original Message -
From: "Mark W. Breneman" <[EMAIL PROTECTED]>
To: "CF-Talk" <[EMAIL PROTECTED]>
Sent: Tuesday, November 21, 2000 2:55 PM
Subject: RE: Any good way to index/search pdf docs via CF


> My experience may be a little old here...
>
> My fist project, using CF 3.1 was just what you described.  Searching
pdfs.
>
> To index them you simple create a verity collection as you normally word
> but, you put the pfd extension in the place of html or cfm when setting up
> the collection.
>
> There was a problem with verity not being able to return a summary.  To
> solve that I made a database of filenames and of the corresponding
summary.
>
> Has this been fixed?
>
> Hope that helps.
>
> Mark W. Breneman
> -Cold Fusion Developer
> -Network Administrator
>   Vivid Media
>   [EMAIL PROTECTED]
>   www.vividmedia.com
>   608.270.9770
>
> -Original Message-
> From: Paul Sinclair [mailto:[EMAIL PROTECTED]]
> Sent: Tuesday, November 21, 2000 3:41 PM
> To: CF-Talk
> Subject: Any good way to index/search pdf docs via CF
>
>
> I've got about 1500 pdf documents that need to be indexed so as to be
> searchable via a web search form. Can you/how do you index these things so
> as to do a search on them? Can you create a Verity collection out of them
> and then search the Verity collection?
>
> Thanks for any pointers.
>
> Paul Sinclair
>
> ~~
> Structure your ColdFusion code with Fusebox. Get the official book at
> http://www.fusionauthority.com/bkinfo.cfm
>
> Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/
> Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists
>
> ~~
> Structure your ColdFusion code with Fusebox. Get the official book at
http://www.fusionauthority.com/bkinfo.cfm
>
> Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/
> Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists
>
>


~~
Structure your ColdFusion code with Fusebox. Get the official book at 
http://www.fusionauthority.com/bkinfo.cfm

Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/
Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists



RE: Any good way to index/search pdf docs via CF

2000-11-21 Thread James McCullough

Try this.



http://www.mysite.com/pdf/ ]"
extensions=".pdf" recurse="Yes/No"  >

Recurse will search subdirectories.

Then put this in your search form page.


Search 
   


Then  search.cfm





   

   Document  
   #Key#
   Score#NumberFormat(Score,"")#%

#Summary#
 







-Original Message-
From: Paul Sinclair [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, November 21, 2000 1:41 PM
To: CF-Talk
Subject: Any good way to index/search pdf docs via CF


I've got about 1500 pdf documents that need to be indexed so as to be
searchable via a web search form. Can you/how do you index these things so
as to do a search on them? Can you create a Verity collection out of them
and then search the Verity collection?

Thanks for any pointers.

Paul Sinclair

~~
Structure your ColdFusion code with Fusebox. Get the official book at
http://www.fusionauthority.com/bkinfo.cfm

Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/
Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists

~~
Structure your ColdFusion code with Fusebox. Get the official book at 
http://www.fusionauthority.com/bkinfo.cfm

Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/
Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists



RE: Any good way to index/search pdf docs via CF

2000-11-21 Thread Jeff Beer

Yes - the Verity implementation that comes with CF will index PDF docs.
I've used this on several commercial projects - works pretty well, actually.
Just remember to optimize the collections if they change fairly often.

Verity can be a bit slow with PDF files, depending on their size, etc.  You
might want to do the indexing at night or something...

Good luck :)

Jeff


> -Original Message-
> From: Paul Sinclair [mailto:[EMAIL PROTECTED]]
> Sent: Tuesday, November 21, 2000 4:41 PM
> To: CF-Talk
> Subject: Any good way to index/search pdf docs via CF
>
>
> I've got about 1500 pdf documents that need to be indexed so as to be
> searchable via a web search form. Can you/how do you index these things so
> as to do a search on them? Can you create a Verity collection out of them
> and then search the Verity collection?
>
> Thanks for any pointers.
>
> Paul Sinclair
>
> ~~
> Structure your ColdFusion code with Fusebox. Get the official
> book at http://www.fusionauthority.com/bkinfo.cfm
>
> Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/
> Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists

~~
Structure your ColdFusion code with Fusebox. Get the official book at 
http://www.fusionauthority.com/bkinfo.cfm

Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/
Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists



RE: Any good way to index/search pdf docs via CF

2000-11-21 Thread Mark W. Breneman

My experience may be a little old here...

My fist project, using CF 3.1 was just what you described.  Searching pdfs.

To index them you simple create a verity collection as you normally word
but, you put the pfd extension in the place of html or cfm when setting up
the collection.

There was a problem with verity not being able to return a summary.  To
solve that I made a database of filenames and of the corresponding summary.

Has this been fixed?

Hope that helps.

Mark W. Breneman
-Cold Fusion Developer
-Network Administrator
  Vivid Media
  [EMAIL PROTECTED]
  www.vividmedia.com
  608.270.9770

-Original Message-
From: Paul Sinclair [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, November 21, 2000 3:41 PM
To: CF-Talk
Subject: Any good way to index/search pdf docs via CF


I've got about 1500 pdf documents that need to be indexed so as to be
searchable via a web search form. Can you/how do you index these things so
as to do a search on them? Can you create a Verity collection out of them
and then search the Verity collection?

Thanks for any pointers.

Paul Sinclair

~~
Structure your ColdFusion code with Fusebox. Get the official book at
http://www.fusionauthority.com/bkinfo.cfm

Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/
Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists

~~
Structure your ColdFusion code with Fusebox. Get the official book at 
http://www.fusionauthority.com/bkinfo.cfm

Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/
Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists



RE: Any good way to index/search pdf docs via CF

2000-11-21 Thread Hayes, David

Check your documentation; it indicates that Verity collections support PDF
files.

Look at CFCOLLECTION, CFINDEX, and CFSEARCH.

-Original Message-
From: Paul Sinclair [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, November 21, 2000 3:41 PM
To: CF-Talk
Subject: Any good way to index/search pdf docs via CF


I've got about 1500 pdf documents that need to be indexed so as to be
searchable via a web search form. Can you/how do you index these things so
as to do a search on them? Can you create a Verity collection out of them
and then search the Verity collection?

Thanks for any pointers.

Paul Sinclair

~~
Structure your ColdFusion code with Fusebox. Get the official book at
http://www.fusionauthority.com/bkinfo.cfm

Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/
Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists
~~
Structure your ColdFusion code with Fusebox. Get the official book at 
http://www.fusionauthority.com/bkinfo.cfm

Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/
Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists