Re: Any good way to index/search pdf docs via CF
I also recall some discussion of the CF bundled Verity engine not supporting the newer Acrobat 4x "UNICODE" - you may want to do some legwork and verify what the bundled verity engine can and cant do, or if version 4x pdf's are a non issue - your in the deal. Steve - Original Message - From: "Mark W. Breneman" <[EMAIL PROTECTED]> To: "CF-Talk" <[EMAIL PROTECTED]> Sent: Tuesday, November 21, 2000 2:55 PM Subject: RE: Any good way to index/search pdf docs via CF > My experience may be a little old here... > > My fist project, using CF 3.1 was just what you described. Searching pdfs. > > To index them you simple create a verity collection as you normally word > but, you put the pfd extension in the place of html or cfm when setting up > the collection. > > There was a problem with verity not being able to return a summary. To > solve that I made a database of filenames and of the corresponding summary. > > Has this been fixed? > > Hope that helps. > > Mark W. Breneman > -Cold Fusion Developer > -Network Administrator > Vivid Media > [EMAIL PROTECTED] > www.vividmedia.com > 608.270.9770 > > -Original Message- > From: Paul Sinclair [mailto:[EMAIL PROTECTED]] > Sent: Tuesday, November 21, 2000 3:41 PM > To: CF-Talk > Subject: Any good way to index/search pdf docs via CF > > > I've got about 1500 pdf documents that need to be indexed so as to be > searchable via a web search form. Can you/how do you index these things so > as to do a search on them? Can you create a Verity collection out of them > and then search the Verity collection? > > Thanks for any pointers. > > Paul Sinclair > > ~~ > Structure your ColdFusion code with Fusebox. Get the official book at > http://www.fusionauthority.com/bkinfo.cfm > > Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/ > Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists > > ~~ > Structure your ColdFusion code with Fusebox. Get the official book at http://www.fusionauthority.com/bkinfo.cfm > > Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/ > Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists > > ~~ Structure your ColdFusion code with Fusebox. Get the official book at http://www.fusionauthority.com/bkinfo.cfm Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/ Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists
RE: Any good way to index/search pdf docs via CF
Try this. http://www.mysite.com/pdf/ ]" extensions=".pdf" recurse="Yes/No" > Recurse will search subdirectories. Then put this in your search form page. Search Then search.cfm Document #Key# Score#NumberFormat(Score,"")#% #Summary# -Original Message- From: Paul Sinclair [mailto:[EMAIL PROTECTED]] Sent: Tuesday, November 21, 2000 1:41 PM To: CF-Talk Subject: Any good way to index/search pdf docs via CF I've got about 1500 pdf documents that need to be indexed so as to be searchable via a web search form. Can you/how do you index these things so as to do a search on them? Can you create a Verity collection out of them and then search the Verity collection? Thanks for any pointers. Paul Sinclair ~~ Structure your ColdFusion code with Fusebox. Get the official book at http://www.fusionauthority.com/bkinfo.cfm Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/ Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists ~~ Structure your ColdFusion code with Fusebox. Get the official book at http://www.fusionauthority.com/bkinfo.cfm Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/ Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists
RE: Any good way to index/search pdf docs via CF
Yes - the Verity implementation that comes with CF will index PDF docs. I've used this on several commercial projects - works pretty well, actually. Just remember to optimize the collections if they change fairly often. Verity can be a bit slow with PDF files, depending on their size, etc. You might want to do the indexing at night or something... Good luck :) Jeff > -Original Message- > From: Paul Sinclair [mailto:[EMAIL PROTECTED]] > Sent: Tuesday, November 21, 2000 4:41 PM > To: CF-Talk > Subject: Any good way to index/search pdf docs via CF > > > I've got about 1500 pdf documents that need to be indexed so as to be > searchable via a web search form. Can you/how do you index these things so > as to do a search on them? Can you create a Verity collection out of them > and then search the Verity collection? > > Thanks for any pointers. > > Paul Sinclair > > ~~ > Structure your ColdFusion code with Fusebox. Get the official > book at http://www.fusionauthority.com/bkinfo.cfm > > Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/ > Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists ~~ Structure your ColdFusion code with Fusebox. Get the official book at http://www.fusionauthority.com/bkinfo.cfm Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/ Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists
RE: Any good way to index/search pdf docs via CF
My experience may be a little old here... My fist project, using CF 3.1 was just what you described. Searching pdfs. To index them you simple create a verity collection as you normally word but, you put the pfd extension in the place of html or cfm when setting up the collection. There was a problem with verity not being able to return a summary. To solve that I made a database of filenames and of the corresponding summary. Has this been fixed? Hope that helps. Mark W. Breneman -Cold Fusion Developer -Network Administrator Vivid Media [EMAIL PROTECTED] www.vividmedia.com 608.270.9770 -Original Message- From: Paul Sinclair [mailto:[EMAIL PROTECTED]] Sent: Tuesday, November 21, 2000 3:41 PM To: CF-Talk Subject: Any good way to index/search pdf docs via CF I've got about 1500 pdf documents that need to be indexed so as to be searchable via a web search form. Can you/how do you index these things so as to do a search on them? Can you create a Verity collection out of them and then search the Verity collection? Thanks for any pointers. Paul Sinclair ~~ Structure your ColdFusion code with Fusebox. Get the official book at http://www.fusionauthority.com/bkinfo.cfm Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/ Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists ~~ Structure your ColdFusion code with Fusebox. Get the official book at http://www.fusionauthority.com/bkinfo.cfm Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/ Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists
RE: Any good way to index/search pdf docs via CF
Check your documentation; it indicates that Verity collections support PDF files. Look at CFCOLLECTION, CFINDEX, and CFSEARCH. -Original Message- From: Paul Sinclair [mailto:[EMAIL PROTECTED]] Sent: Tuesday, November 21, 2000 3:41 PM To: CF-Talk Subject: Any good way to index/search pdf docs via CF I've got about 1500 pdf documents that need to be indexed so as to be searchable via a web search form. Can you/how do you index these things so as to do a search on them? Can you create a Verity collection out of them and then search the Verity collection? Thanks for any pointers. Paul Sinclair ~~ Structure your ColdFusion code with Fusebox. Get the official book at http://www.fusionauthority.com/bkinfo.cfm Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/ Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists ~~ Structure your ColdFusion code with Fusebox. Get the official book at http://www.fusionauthority.com/bkinfo.cfm Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/ Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists