Re: Search-function on website

2006-09-06 Thread Auke van Leeuwen
If you used Verity in CF 5, did you use the K2 engine or the VDK engine? The
K2 engine is much better in many respects, and if I recall correctly, CFMX
6.x uses K2 instead of VDK.

I don't realy recall any specific choice that I made. K2 sounds the most 
familiar, but that may be because it's used in later versions of CF. 

If you want to search PDFs using SQL Server's full-text functionality, you
can do that with a little work:

http://www.adobe.com/support/downloads/detail.jsp?ftpID=2611

Interesting link, I'll look into that.

However, I don't know how well that works with large numbers of PDF
documents, since I haven't done it myself. If not for the PDF part, I'd
unhesitatingly recommend SQL Server full-text indexing over Verity.

Well the thing that bugs me the most of the SQL Server full-text search is the 
fact that it doesn't search on parts of words. I can't really think of a good 
example in English, but Dutch works just as well (only you don't understand the 
words):

Take these three words that appear in texts of different pages: zetel, zetels, 
restzetel(s).

Now if you search for 'zetel' you are presented with the pages with 'zetel' and 
'zetels' if you use a prefix (*) notation in your CONTAINS. However restzetel 
or restzetels does not come up at all. 

LIKE '%zetel%' would of course yield all of these results, which is a desired 
result, but of course I would like a ranking kinda like this: zetel  zetels  
restzetel  restzetels.

Well.. that being said, I guess I'll have to look into verity again. See if the 
results are somewhat better this time. Thanks for your response.

~|
Introducing the Fusion Authority Quarterly Update. 80 pages of hard-hitting,
up-to-date ColdFusion information by your peers, delivered to your door four 
times a year.
http://www.fusionauthority.com/quarterly

Archive: 
http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:252174
Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm
Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4


Re: Search-function on website

2006-09-06 Thread Robertson-Ravo, Neil (RX)
You may have to code/develop something using the Verity SDK but I am not
sure what is available with the OEM version which ships with ColdFusion.







This e-mail is from Reed Exhibitions (Oriel House, 26 The Quadrant,
Richmond, Surrey, TW9 1DL, United Kingdom), a division of Reed Business,
Registered in England, Number 678540.  It contains information which is
confidential and may also be privileged.  It is for the exclusive use of the
intended recipient(s).  If you are not the intended recipient(s) please note
that any form of distribution, copying or use of this communication or the
information in it is strictly prohibited and may be unlawful.  If you have
received this communication in error please return it to the sender or call
our switchboard on +44 (0) 20 89107910.  The opinions expressed within this
communication are not necessarily those expressed by Reed Exhibitions. 
Visit our website at http://www.reedexpo.com

-Original Message-
From: Auke van Leeuwen
To: CF-Talk
Sent: Wed Sep 06 07:30:39 2006
Subject: Re: Search-function on website

If you used Verity in CF 5, did you use the K2 engine or the VDK engine?
The
K2 engine is much better in many respects, and if I recall correctly, CFMX
6.x uses K2 instead of VDK.

I don't realy recall any specific choice that I made. K2 sounds the most
familiar, but that may be because it's used in later versions of CF. 

If you want to search PDFs using SQL Server's full-text functionality, you
can do that with a little work:

http://www.adobe.com/support/downloads/detail.jsp?ftpID=2611

Interesting link, I'll look into that.

However, I don't know how well that works with large numbers of PDF
documents, since I haven't done it myself. If not for the PDF part, I'd
unhesitatingly recommend SQL Server full-text indexing over Verity.

Well the thing that bugs me the most of the SQL Server full-text search is
the fact that it doesn't search on parts of words. I can't really think of a
good example in English, but Dutch works just as well (only you don't
understand the words):

Take these three words that appear in texts of different pages: zetel,
zetels, restzetel(s).

Now if you search for 'zetel' you are presented with the pages with 'zetel'
and 'zetels' if you use a prefix (*) notation in your CONTAINS. However
restzetel or restzetels does not come up at all. 

LIKE '%zetel%' would of course yield all of these results, which is a
desired result, but of course I would like a ranking kinda like this: zetel
 zetels  restzetel  restzetels.

Well.. that being said, I guess I'll have to look into verity again. See if
the results are somewhat better this time. Thanks for your response.



~|
Introducing the Fusion Authority Quarterly Update. 80 pages of hard-hitting,
up-to-date ColdFusion information by your peers, delivered to your door four 
times a year.
http://www.fusionauthority.com/quarterly

Archive: 
http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:252177
Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm
Unsubscribe: 
http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4


Search-function on website

2006-09-05 Thread Auke van Leeuwen
I was wondering if any of you have any thoughts on the implementation of a 
'search-function' on a website. It would only be a fairly simple input field 
and some ranked search results. The website content comes directly from the 
database btw.

So far I've used and come up with some scenario's:

* Simple 'LIKE' clause and a query. Works fairly well, however ranking is not 
really an option and PDFs etc are not searched at all of course.

* Full-Text search (MS-SQL 2000). Has some cool features, ranking is great, 
however only searches on complete words, complete phrases or words starting 
with a certain prefix. Some of the more advanced options such as FORMSOF, NEAR 
etc, are not used at all since it's only a simple HTML-input field. 

* Verity: In short: I hate it. I've worked with it in CF5 and found it to be 
error-prone, not really maintenance-friendly and giving crappy results. I 
haven't used it on CF6, so it may have improved (please tell me if it has :-) 
). I don't have CF7 (CF6) so the added categorization-functions are not 
available btw.

Now that I think about it, maybe I should give Verity another shot (especially 
since it searches the PDFs etc as well).. But maybe someone else has a better 
idea?

~|
Introducing the Fusion Authority Quarterly Update. 80 pages of hard-hitting,
up-to-date ColdFusion information by your peers, delivered to your door four 
times a year.
http://www.fusionauthority.com/quarterly

Archive: 
http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:252045
Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm
Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4


RE: Search-function on website

2006-09-05 Thread Dave Watts
 I was wondering if any of you have any thoughts on the 
 implementation of a 'search-function' on a website. It would 
 only be a fairly simple input field and some ranked search 
 results. The website content comes directly from the database btw.
 
 So far I've used and come up with some scenario's:
 
 * Simple 'LIKE' clause and a query. Works fairly well, 
 however ranking is not really an option and PDFs etc are not 
 searched at all of course.
 
 * Full-Text search (MS-SQL 2000). Has some cool features, 
 ranking is great, however only searches on complete words, 
 complete phrases or words starting with a certain prefix. 
 Some of the more advanced options such as FORMSOF, NEAR etc, 
 are not used at all since it's only a simple HTML-input field. 
 
 * Verity: In short: I hate it. I've worked with it in CF5 and 
 found it to be error-prone, not really maintenance-friendly 
 and giving crappy results. I haven't used it on CF6, so it 
 may have improved (please tell me if it has :-) ). I don't 
 have CF7 (CF6) so the added categorization-functions are not 
 available btw.
 
 Now that I think about it, maybe I should give Verity another 
 shot (especially since it searches the PDFs etc as well).. 
 But maybe someone else has a better idea?

If you used Verity in CF 5, did you use the K2 engine or the VDK engine? The
K2 engine is much better in many respects, and if I recall correctly, CFMX
6.x uses K2 instead of VDK.

If you want to search PDFs using SQL Server's full-text functionality, you
can do that with a little work:

http://www.adobe.com/support/downloads/detail.jsp?ftpID=2611

However, I don't know how well that works with large numbers of PDF
documents, since I haven't done it myself. If not for the PDF part, I'd
unhesitatingly recommend SQL Server full-text indexing over Verity.

Dave Watts, CTO, Fig Leaf Software
http://www.figleaf.com/

Fig Leaf Software provides the highest caliber vendor-authorized
instruction at our training centers in Washington DC, Atlanta,
Chicago, Baltimore, Northern Virginia, or on-site at your location.
Visit http://training.figleaf.com/ for more information!


~|
Introducing the Fusion Authority Quarterly Update. 80 pages of hard-hitting,
up-to-date ColdFusion information by your peers, delivered to your door four 
times a year.
http://www.fusionauthority.com/quarterly

Archive: 
http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:252048
Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm
Unsubscribe: 
http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4