Re: Verity and PDF
Well, let me step in and defend Verity. It will index quite a few format types so I think it has quite strong indexing capabilities. I do agree that a page # would be handy. It may even be possibly in the 'real' Verity, not the bundled Verity. On Sun, May 4, 2008 at 9:34 PM, Richard Steele [EMAIL PROTECTED] wrote: Ok, wow. That's a pretty rudimentary index ability then. I think I found the ticket though. It can save the index in a database format and then I can run verity on that. http://www.pdfstore.com/details.asp?ProdID=699 I don't think it's possible to do that. You could cheat though. What I'm about to d -- ~| Adobe® ColdFusion® 8 software 8 is the most important and dramatic release to date Get the Free Trial http://ad.doubleclick.net/clk;192386516;25150098;k Archive: http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:304840 Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4
Re: Verity and PDF
Actually I'm liking your idea and will explore it. I don't think it's possible to do that. You could cheat though. What I'm about to describe is NOT something I've done. It's just an idea. Use CF8 and cfpdf (with DDX) to split the PDF into N pages. Use names like original_N, where original is the original file name and N is page no. Index the pages. When the user search returns a result for original_N, link them to original and say Found on page N. On Sun, May 4, 2008 at 1:19 PM, Richard Steele [EMAIL PROTECTED] wrote: ~| Adobe® ColdFusion® 8 software 8 is the most important and dramatic release to date Get the Free Trial http://ad.doubleclick.net/clk;192386516;25150098;k Archive: http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:304760 Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4
RE: Verity and PDF
-Original Message- From: Richard Steele [EMAIL PROTECTED] Sent: Saturday, May 03, 2008 17:03 To: CF-Talk cf-talk@houseoffusion.com Subject: Verity and PDF I want a user to search a pdf document (test.pdf) for a term and then go to the page that has that term on it. In Coldfusion 8, what would the cfindex tag look like to index test.pdf to do that? Is the page number returned? In other words, how can I link to that page? ~| Adobe® ColdFusion® 8 software 8 is the most important and dramatic release to date Get the Free Trial http://ad.doubleclick.net/clk;192386516;25150098;k Archive: http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:304719 Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4
Re: Verity and PDF
Hi Ray, I can get it to index various pdf documents in a folder, however I can't get it to return the number of the page that contains the text. Is this possible? If not, is there a third party utility that does this? It would be really useful to be able to direct a user to the exact page that contains the search item, otherwise they would have to read the entire document in order to find the item that they are looking for. Thanks in advance. Verity determines how to parse the file based on the file type. You don't do anything special for PDF, you just index it like you would any other file or directory of files. I'd read the docs on cfindex and Verity. On Sat, May 3, 2008 at 4:03 PM, Richard Steele [EMAIL PROTECTED] wrote: I want a user to search a pdf document (test.pdf) for a term and then go to the page that has that term on it. In Coldfusion 8, what would the cfindex tag look like to index test.pdf to do that? Is the page number returned? In other words, how can I link to that page? ~| Adobe® ColdFusion® 8 software 8 is the most important and dramatic release to date Get the Free Trial http://ad.doubleclick.net/clk;192386516;25150098;k Archive: http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:304722 Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4
Re: Verity and PDF
I don't think it's possible to do that. You could cheat though. What I'm about to describe is NOT something I've done. It's just an idea. Use CF8 and cfpdf (with DDX) to split the PDF into N pages. Use names like original_N, where original is the original file name and N is page no. Index the pages. When the user search returns a result for original_N, link them to original and say Found on page N. On Sun, May 4, 2008 at 1:19 PM, Richard Steele [EMAIL PROTECTED] wrote: Hi Ray, I can get it to index various pdf documents in a folder, however I can't get it to return the number of the page that contains the text. Is this possible? If not, is there a third party utility that does this? It would be really useful to be able to direct a user to the exact page that contains the search item, otherwise they would have to read the entire document in order to find the item that they are looking for. Thanks in advance. Verity determines how to parse the file based on the file type. You don't do anything special for PDF, you just index it like you would any other file or directory of files. I'd read the docs on cfindex and Verity. On Sat, May 3, 2008 at 4:03 PM, Richard Steele [EMAIL PROTECTED] wrote: I want a user to search a pdf document (test.pdf) for a term and then go to the page that has that term on it. In Coldfusion 8, what would the cfindex tag look like to index test.pdf to do that? Is the page number returned? In other words, how can I link to that page? ~| Adobe® ColdFusion® 8 software 8 is the most important and dramatic release to date Get the Free Trial http://ad.doubleclick.net/clk;192386516;25150098;k Archive: http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:304723 Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4
Re: Verity and PDF
Ok, wow. That's a pretty rudimentary index ability then. I think I found the ticket though. It can save the index in a database format and then I can run verity on that. http://www.pdfstore.com/details.asp?ProdID=699 I don't think it's possible to do that. You could cheat though. What I'm about to describe is NOT something I've done. It's just an idea. Use CF8 and cfpdf (with DDX) to split the PDF into N pages. Use names like original_N, where original is the original file name and N is page no. Index the pages. When the user search returns a result for original_N, link them to original and say Found on page N. On Sun, May 4, 2008 at 1:19 PM, Richard Steele [EMAIL PROTECTED] wrote: ~| Adobe® ColdFusion® 8 software 8 is the most important and dramatic release to date Get the Free Trial http://ad.doubleclick.net/clk;192386516;25150098;k Archive: http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:304733 Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4
Verity and PDF
I want a user to search a pdf document (test.pdf) for a term and then go to the page that has that term on it. In Coldfusion 8, what would the cfindex tag look like to index test.pdf to do that? Is the page number returned? In other words, how can I link to that page? ~| Adobe® ColdFusion® 8 software 8 is the most important and dramatic release to date Get the Free Trial http://ad.doubleclick.net/clk;192386516;25150098;k Archive: http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:304708 Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4
Re: Verity and PDF
Verity determines how to parse the file based on the file type. You don't do anything special for PDF, you just index it like you would any other file or directory of files. I'd read the docs on cfindex and Verity. On Sat, May 3, 2008 at 4:03 PM, Richard Steele [EMAIL PROTECTED] wrote: I want a user to search a pdf document (test.pdf) for a term and then go to the page that has that term on it. In Coldfusion 8, what would the cfindex tag look like to index test.pdf to do that? Is the page number returned? In other words, how can I link to that page? ~| Adobe® ColdFusion® 8 software 8 is the most important and dramatic release to date Get the Free Trial http://ad.doubleclick.net/clk;192386516;25150098;k Archive: http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:304710 Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4
RE: Verity and PDF
I've developed something to this effect. What you need to do is use the acrobat service that converts pdf to html, store this somewhere, then manipulate that html file and give them the option of viewing the first instance of the search term. Adam -Original Message- From: Trey Rouse [mailto:trouse;rice.edu] Sent: 28 October 2002 23:16 To: CF-Talk Subject: RE: Verity and PDF Afraid not. The full verity server has some help for this, but they handle it by converting the pdf to html. Other than that, you have to use the built in PDF search. Trey -Original Message- From: charlie griefer [mailto:charlie;griefer.com] Sent: Friday, October 25, 2002 1:10 PM To: CF-Talk Subject: Verity and PDF Hey All: Using CF 5 and indexing a directory that contains a mix of htm and pdf files. Everything seems to be going ok, as far as successfully indexing the collection, and searching... but (don't laugh, everybody's got one)... if the search term is found in a particular pdf, and I click the link...it takes me to the first page of the pdf (which can be potentially 100 pages), meaning i've just got to use the Acrobat search tool. Is there something that i've overlooked, or a tweak that i can perform, to get pdf files to return the page within the pdf that contains the search criteria? TIA, charlie ~| Archives: http://www.houseoffusion.com/cf_lists/index.cfm?forumid=4 Subscription: http://www.houseoffusion.com/index.cfm?sidebar=listsbody=lists/cf_talk FAQ: http://www.thenetprofits.co.uk/coldfusion/faq Structure your ColdFusion code with Fusebox. Get the official book at http://www.fusionauthority.com/bkinfo.cfm
RE: Verity and PDF
Afraid not. The full verity server has some help for this, but they handle it by converting the pdf to html. Other than that, you have to use the built in PDF search. Trey -Original Message- From: charlie griefer [mailto:charlie;griefer.com] Sent: Friday, October 25, 2002 1:10 PM To: CF-Talk Subject: Verity and PDF Hey All: Using CF 5 and indexing a directory that contains a mix of htm and pdf files. Everything seems to be going ok, as far as successfully indexing the collection, and searching... but (don't laugh, everybody's got one)... if the search term is found in a particular pdf, and I click the link...it takes me to the first page of the pdf (which can be potentially 100 pages), meaning i've just got to use the Acrobat search tool. Is there something that i've overlooked, or a tweak that i can perform, to get pdf files to return the page within the pdf that contains the search criteria? TIA, charlie ~| Archives: http://www.houseoffusion.com/cf_lists/index.cfm?forumid=4 Subscription: http://www.houseoffusion.com/index.cfm?sidebar=listsbody=lists/cf_talk FAQ: http://www.thenetprofits.co.uk/coldfusion/faq Get the mailserver that powers this list at http://www.coolfusion.com
Verity and PDF
Hey All: Using CF 5 and indexing a directory that contains a mix of htm and pdf files. Everything seems to be going ok, as far as successfully indexing the collection, and searching... but (don't laugh, everybody's got one)... if the search term is found in a particular pdf, and I click the link...it takes me to the first page of the pdf (which can be potentially 100 pages), meaning i've just got to use the Acrobat search tool. Is there something that i've overlooked, or a tweak that i can perform, to get pdf files to return the page within the pdf that contains the search criteria? TIA, charlie ~| Archives: http://www.houseoffusion.com/cf_lists/index.cfm?forumid=4 Subscription: http://www.houseoffusion.com/index.cfm?sidebar=listsbody=lists/cf_talk FAQ: http://www.thenetprofits.co.uk/coldfusion/faq Your ad could be here. Monies from ads go to support these lists and provide more resources for the community. http://www.fusionauthority.com/ads.cfm
Verity and PDF Files
Is anyone using Verity with a PDF file collection? Are there any special considerations Gotchas etc any tips or hints appreciated. Gordon ~~ Structure your ColdFusion code with Fusebox. Get the official book at http://www.fusionauthority.com/bkinfo.cfm Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/ Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists
Re: Verity and PDF Files [free alternatives...??]
Current FREE CF bundled version of Verity has issues (read ... doesnt work!) with Acrobat file formats 1.3 or those pdf files generated with Uni-Code support by Adobe Acrobat tools in Acrobat 4x and up. Bummer rumor has it that the new cf5 or next release of CF/NEO? will be bundled with the newer version of Verity that DOES properly index PDF 1.3 or Acrobat 4x files. Disclaimer: Key Word - "RUMOR" I ve heard on this list that there are several other FREE site indexing tools available - hopefully those in the know on these will re-post. HTH, Steve Aylor - Original Message - From: "Gordon Burns" [EMAIL PROTECTED] To: "CF-Talk" [EMAIL PROTECTED] Sent: Sunday, March 11, 2001 12:14 PM Subject: Verity and PDF Files Is anyone using Verity with a PDF file collection? Are there any special considerations Gotchas etc any tips or hints appreciated. Gordon ~~ Structure your ColdFusion code with Fusebox. Get the official book at http://www.fusionauthority.com/bkinfo.cfm Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/ Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists
Re: Verity and PDF Files [free alternatives...??]
I use Verity with Acrobat 4.x files with CF 4.5.1 SP2 Enterprise, and it works fine. Don't know about the Unicode issues though. --Dylan - Original Message - From: "Stephen M Aylor" [EMAIL PROTECTED] To: "CF-Talk" [EMAIL PROTECTED] Sent: Sunday, March 11, 2001 12:36 PM Subject: Re: Verity and PDF Files [free alternatives...??] Current FREE CF bundled version of Verity has issues (read ... doesnt work!) with Acrobat file formats 1.3 or those pdf files generated with Uni-Code support by Adobe Acrobat tools in Acrobat 4x and up. Bummer rumor has it that the new cf5 or next release of CF/NEO? will be bundled with the newer version of Verity that DOES properly index PDF 1.3 or Acrobat 4x files. Disclaimer: Key Word - "RUMOR" I ve heard on this list that there are several other FREE site indexing tools available - hopefully those in the know on these will re-post. HTH, Steve Aylor - Original Message - From: "Gordon Burns" [EMAIL PROTECTED] To: "CF-Talk" [EMAIL PROTECTED] Sent: Sunday, March 11, 2001 12:14 PM Subject: Verity and PDF Files Is anyone using Verity with a PDF file collection? Are there any special considerations Gotchas etc any tips or hints appreciated. Gordon ~~ Structure your ColdFusion code with Fusebox. Get the official book at http://www.fusionauthority.com/bkinfo.cfm Archives: http://www.mail-archive.com/cf-talk@houseoffusion.com/ Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists