Re: Search Questions

2006-11-11 Thread Denny Valliant
On 11/11/06, Les Mizzell <[EMAIL PROTECTED]> wrote:
...
> What's the most optimal way of handling this search? How do the rest of
> you handle searches for mixed static/dynamic content?

I've used htdig, but since you're on a shared server, that's probably out.

Is something like google an option?  Or maybe there's an archaic verity
"command" that would get you what you want?

Maybe there's one of those packages you just click on via the shared
server admin that will add some 3rd party tie-in type deal?

~|
Introducing the Fusion Authority Quarterly Update. 80 pages of hard-hitting,
up-to-date ColdFusion information by your peers, delivered to your door four 
times a year.
http://www.fusionauthority.com/quarterly

Archive: 
http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:260042
Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm
Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4


Re: Search Questions

2006-11-12 Thread Robertson-Ravo, Neil (RX)
Hi Les,

There are certainly options open to you, and of course your problem is not
unique so there is help!

The main issue is that your site was never designed to be searched from the
beginning where Search functionality was more of an afterthought. To get the
most of your search (I presume you will be using the outdated and feature
limited OEM Verity)  you should either index the dynamic pages by pointing a
spider at your site - but you have to ensure that your links all follow etc.
This will give you a good website search.

If you index the content from the DB directly into a collection then it's
relevancy and potential freshness becomes less and indeed may as well stay
in a SQL Full Text Search with you building the front end rather than
Verity.

A good search engine is a must here if you want to perform "real" searches
and utilise things like lemmatisation, synonym addition/conversion etc. But
you will pay for these tools and in most cases it will be a lot :-)











"This e-mail is from Reed Exhibitions (Gateway House, 28 The Quadrant,
Richmond, Surrey, TW9 1DN, United Kingdom), a division of Reed Business,
Registered in England, Number 678540.  It contains information which is
confidential and may also be privileged.  It is for the exclusive use of the
intended recipient(s).  If you are not the intended recipient(s) please note
that any form of distribution, copying or use of this communication or the
information in it is strictly prohibited and may be unlawful.  If you have
received this communication in error please return it to the sender or call
our switchboard on +44 (0) 20 89107910.  The opinions expressed within this
communication are not necessarily those expressed by Reed Exhibitions." 
Visit our website at http://www.reedexpo.com

-Original Message-
From: Les Mizzell
To: CF-Talk
Sent: Sun Nov 12 02:12:25 2006
Subject: Search Questions

It's been awhile since I've set up a search engine for a site. The last 
site I did a search for was a mixed static/dynamic content, which caused 
a lot of head-scratching. I finally *faked* it by setting a reference 
table that pretty much included "page_content" and "page_link" columns 
and just searched that. PITA, it was.

I'm now working on a much bigger site with the same problem. Half is 
static, the rest is coming from maybe 30 different tables in a MySQL 
database. The site is almost too large for me to try and fake it this time.

This is on a shared server, so I don't have access to the cf admin.

I can create a Verity collection and index through code, but that's not 
going to handle some of the dynamic content.

What's the most optimal way of handling this search? How do the rest of 
you handle searches for mixed static/dynamic content?




~|
Introducing the Fusion Authority Quarterly Update. 80 pages of hard-hitting,
up-to-date ColdFusion information by your peers, delivered to your door four 
times a year.
http://www.fusionauthority.com/quarterly

Archive: 
http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:260051
Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm
Unsubscribe: 
http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4


Re: Search questions

2005-05-27 Thread Rick Root
Anyone using Verity out there? =)

Rick Root wrote:
> I've been asked to investigate a method of making our documentation 
> searchable.
> 
> We have a database driven FAQ, numerous CFM pages with "static" content, 
>   and word/PDF documents.
> 
> We're currently running CFMX 6.1.
> 
> What are my options?  Verity can't do that in a single collection, right?
> 
> rick



~|
Logware (www.logware.us): a new and convenient web-based time tracking 
application. Start tracking and documenting hours spent on a project or with a 
client with Logware today. Try it for free with a 15 day trial account.
http://www.houseoffusion.com/banners/view.cfm?bannerid=67

Message: http://www.houseoffusion.com/lists.cfm/link=i:4:207842
Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/4
Subscription: http://www.houseoffusion.com/lists.cfm/link=s:4
Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4
Donations & Support: http://www.houseoffusion.com/tiny.cfm/54


RE: Search questions

2005-05-27 Thread James Holmes
You could search in the two collections necessary for file and DB
indexing and combine the results using a UNION QoQ.

As for the second question, not since I discovered Oracle Text. 

-Original Message-
From: Rick Root [mailto:[EMAIL PROTECTED] 
Sent: Friday, 27 May 2005 8:58 
To: CF-Talk
Subject: Re: Search questions

Anyone using Verity out there? =)

Rick Root wrote:
> I've been asked to investigate a method of making our documentation 
> searchable.
> 
> We have a database driven FAQ, numerous CFM pages with "static"
content, 
>   and word/PDF documents.
> 
> We're currently running CFMX 6.1.
> 
> What are my options?  Verity can't do that in a single collection,
right?

~|
Find out how CFTicket can increase your company's customer support 
efficiency by 100%
http://www.houseoffusion.com/banners/view.cfm?bannerid=49

Message: http://www.houseoffusion.com/lists.cfm/link=i:4:207843
Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/4
Subscription: http://www.houseoffusion.com/lists.cfm/link=s:4
Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4
Donations & Support: http://www.houseoffusion.com/tiny.cfm/54


RE: Search questions

2005-05-27 Thread Jeff Garza
Rick,

I think the answer is that no, Verity cannot index all of that information
in a single collection.  Unless of course you are using the verity Spider to
index your site...  But you can put it all into individual collections and
search them all with a single cfsearch statement.  You can pass multiple
collection names to a single cfsearch tag.

HTH,

Jeff


-Original Message-
From: Rick Root [mailto:[EMAIL PROTECTED] 
Sent: Friday, May 27, 2005 5:58 AM
To: CF-Talk
Subject: Re: Search questions

Anyone using Verity out there? =)

Rick Root wrote:
> I've been asked to investigate a method of making our documentation 
> searchable.
> 
> We have a database driven FAQ, numerous CFM pages with "static" content, 
>   and word/PDF documents.
> 
> We're currently running CFMX 6.1.
> 
> What are my options?  Verity can't do that in a single collection, right?
> 
> rick





~|
Logware (www.logware.us): a new and convenient web-based time tracking 
application. Start tracking and documenting hours spent on a project or with a 
client with Logware today. Try it for free with a 15 day trial account.
http://www.houseoffusion.com/banners/view.cfm?bannerid=67

Message: http://www.houseoffusion.com/lists.cfm/link=i:4:207845
Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/4
Subscription: http://www.houseoffusion.com/lists.cfm/link=s:4
Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4
Donations & Support: http://www.houseoffusion.com/tiny.cfm/54


Re: Search questions

2005-05-27 Thread Adam Churvis
We use it all the time (we've automated it in Plum), and we've built
customized Lucene systems as well, so we're very familiar with full text
indexing.

You can create a single collection contains both File and Path index
entries, but that collection cannot also contain Custom index entries.  This
is because while both File and Path index entries are based on disk files as
indexing targets, Custom index entries are based on database content as an
indexing target, so the configuration of Custom entries cannot coexist in
the same collection.

You can, however, search multiple collections at the same time by specifying
them as a comma-delimited list in the CFSEARCH tag call.

You're going to run into an issue with static content residing in your .cfm
pages.  The documentation suggests that you can index these, and you can,
but you'll get back code in your search results.  You have two choices here:
use Verity's Vspider utility (complicated) or use a content management
system that stores your content in a database, use a Custom index collection
for that content, and then just retrieve and display the content on your
..cfm pages (easy).  This is what we do with Plum.

Best of luck with it all.  If you run into real problems then feel free to
give me a call at 770-446-8866.

Respectfully,

Adam Phillip Churvis
Member of Team Macromedia
http://www.ProductivityEnhancement.com

Download Plum and other cool development tools,
and get advanced intensive Master-level training:

* C# & ASP.NET for ColdFusion Developers
* ColdFusion MX Master Class
* Advanced Development with CFMX and SQL Server 2000

- Original Message - 
From: "Rick Root" <[EMAIL PROTECTED]>
To: "CF-Talk" 
Sent: Friday, May 27, 2005 8:57 AM
Subject: Re: Search questions


> Anyone using Verity out there? =)
>
> Rick Root wrote:
> > I've been asked to investigate a method of making our documentation
> > searchable.
> >
> > We have a database driven FAQ, numerous CFM pages with "static" content,
> >   and word/PDF documents.
> >
> > We're currently running CFMX 6.1.
> >
> > What are my options?  Verity can't do that in a single collection,
right?
> >
> > rick
>
>
>
> 

~|
Logware (www.logware.us): a new and convenient web-based time tracking 
application. Start tracking and documenting hours spent on a project or with a 
client with Logware today. Try it for free with a 15 day trial account.
http://www.houseoffusion.com/banners/view.cfm?bannerid=67

Message: http://www.houseoffusion.com/lists.cfm/link=i:4:207847
Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/4
Subscription: http://www.houseoffusion.com/lists.cfm/link=s:4
Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4
Donations & Support: http://www.houseoffusion.com/tiny.cfm/54


RE: Search questions

2005-05-27 Thread Dave Watts
> You can create a single collection contains both File and Path 
> index entries, but that collection cannot also contain Custom index 
> entries. This is because while both File and Path index entries 
> are based on disk files as indexing targets, Custom index entries
> are based on database content as an indexing target, so the 
> configuration of Custom entries cannot coexist in the same collection.

Actually, this is incorrect. You can use database and filesystem content
within the same Verity collection if you're using the Verity VDK as opposed
to K2. When you search the collection, you can differentiate between
database and filesystem results by looking at the key column, which will
contain a filesystem path if the result points to a file, or whatever you
put in there (presumably a primary key or some custom unique identifier) if
the result points to a database record.

Dave Watts, CTO, Fig Leaf Software
http://www.figleaf.com/

Fig Leaf Software provides the highest caliber vendor-authorized 
instruction at our training centers in Washington DC, Atlanta, 
Chicago, Baltimore, Northern Virginia, or on-site at your location. 
Visit http://training.figleaf.com/ for more information!


~|
Discover CFTicket - The leading ColdFusion Help Desk and Trouble 
Ticket application

http://www.houseoffusion.com/banners/view.cfm?bannerid=48

Message: http://www.houseoffusion.com/lists.cfm/link=i:4:207884
Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/4
Subscription: http://www.houseoffusion.com/lists.cfm/link=s:4
Unsubscribe: 
http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4
Donations & Support: http://www.houseoffusion.com/tiny.cfm/54


Re: Search questions

2005-05-27 Thread Adam Churvis
Yeah, I've had K2 on the brain ever since we standardized on it.

We always setup Verity so that we can directly map VDK collections 1:1 with
K2 aliases, which enables us to migrate to/from K2 as needed without
changing anything in the code.  We have an XML configuration file element
named "search-name" that contains the collection name, which is either
CollectionName_VDK or CollectionName_K2, and our searches just point to the
appropriate collection.  That way if a customer needs to migrate to a host
that doesn't offer K2, or if K2 is giving him a problem for some reason, we
simply change the search-name from _K2 to _VDK and he's set.

The only way we can do this is by always keeping Custom index entries out of
the collections that contain File and Path index entries.  Hell, I've got
Plum on the brain! ;)

Thanks for pointing this out, Dave.  Sorry for any confusion, folks.

Respectfully,

Adam Phillip Churvis
Member of Team Macromedia
http://www.ProductivityEnhancement.com

Download Plum and other cool development tools,
and get advanced intensive Master-level training:

* C# & ASP.NET for ColdFusion Developers
* ColdFusion MX Master Class
* Advanced Development with CFMX and SQL Server 2000

- Original Message - 
From: "Dave Watts" <[EMAIL PROTECTED]>
To: "CF-Talk" 
Sent: Friday, May 27, 2005 1:07 PM
Subject: RE: Search questions


> > You can create a single collection contains both File and Path
> > index entries, but that collection cannot also contain Custom index
> > entries. This is because while both File and Path index entries
> > are based on disk files as indexing targets, Custom index entries
> > are based on database content as an indexing target, so the
> > configuration of Custom entries cannot coexist in the same collection.
>
> Actually, this is incorrect. You can use database and filesystem content
> within the same Verity collection if you're using the Verity VDK as
opposed
> to K2. When you search the collection, you can differentiate between
> database and filesystem results by looking at the key column, which will
> contain a filesystem path if the result points to a file, or whatever you
> put in there (presumably a primary key or some custom unique identifier)
if
> the result points to a database record.
>
> Dave Watts, CTO, Fig Leaf Software
> http://www.figleaf.com/
>
> Fig Leaf Software provides the highest caliber vendor-authorized
> instruction at our training centers in Washington DC, Atlanta,
> Chicago, Baltimore, Northern Virginia, or on-site at your location.
> Visit http://training.figleaf.com/ for more information!
>
>
> 

~|
Logware (www.logware.us): a new and convenient web-based time tracking 
application. Start tracking and documenting hours spent on a project or with a 
client with Logware today. Try it for free with a 15 day trial account.
http://www.houseoffusion.com/banners/view.cfm?bannerid=67

Message: http://www.houseoffusion.com/lists.cfm/link=i:4:207938
Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/4
Subscription: http://www.houseoffusion.com/lists.cfm/link=s:4
Unsubscribe: 
http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4
Donations & Support: http://www.houseoffusion.com/tiny.cfm/54