On Mon, 2005-10-17 at 11:06 -0700, Doug Cutting wrote:
Rod Taylor wrote:
Over night the trackers appeared to have difficulties finding a block
from the datanode and eventually exited. The datanode reports serving
the block mentioned below successfully on several earlier occasions.
Were
Chris Mattmann wrote:
So, one thing it seems is that fields to be indexed, and used in a field
query must be fully lowercase to work? Additionally, it seems that they
can't have symbols in them, such as _, is that correct? Would you guys
consider this to be a bug?
Yes, this sounds like a bug.
Hi Doug,
On 10/17/05 11:38 AM, Doug Cutting [EMAIL PROTECTED] wrote:
Chris Mattmann wrote:
So, one thing it seems is that fields to be indexed, and used in a field
query must be fully lowercase to work? Additionally, it seems that they
can't have symbols in them, such as _, is that
Chris Mattmann wrote:
So, my question to you then
is, what type of QueryFilter should I develop in order to get my query for
contactemail:email address to work as a standalone query? For instance,
right now I'm sub-classing the RawFieldQueryFilter, which doesn't seem to be
the right way to do it
On Mon, 2005-10-17 at 11:46 -0700, Doug Cutting wrote:
Rod Taylor wrote:
The machine the namenode is running on does have very high load at
times. Do you recommend a separate box for the namenode and jobtracker
which runs strictly those items?
That would be optimal, but it shouldn't be
Hi Doug,
Thanks, that worked.
Cheers,
Chris
On 10/17/05 11:56 AM, Doug Cutting [EMAIL PROTECTED] wrote:
Chris Mattmann wrote:
So, my question to you then
is, what type of QueryFilter should I develop in order to get my query for
contactemail:email address to work as a standalone
[ http://issues.apache.org/jira/browse/NUTCH-114?page=all ]
Stefan Groschupf updated NUTCH-114:
---
Attachment: CrawlDbStatMapper.java
As discussed now with UTF8 keys and the text based output format.
getting number of urls and links from crawldb
[
http://issues.apache.org/jira/browse/NUTCH-103?page=comments#action_12332316 ]
Bong Chih How commented on NUTCH-103:
-
I adventitously discover this wonderful application some time ago through JIRA.
It enticed me to download the achieve and install in
u need to untar clusty.tar inside your ROOT directory, jakarta/webapps/ROOT,
and then u need to open the files search.jsp/cluster.jsp and resave them so
that the webserver will pick the new ones, that's about it.
U should spend more like 5 mins on making this one run, probably u r not too
Thanks Robert. I will try it again.
Do I have to configure any settings to point to any nutch plugin?
Have good day.
Bong
On 10/18/05, Robert Benea [EMAIL PROTECTED] wrote:
u need to untar clusty.tar inside your ROOT directory,
jakarta/webapps/ROOT,
and then u need to open the files
yes,
did u enable clustering ?
property
nameplugin.includes/name
valuegeoPosition|protocol-httpclient|urlfilter-regex|protocol-file|parse-(text|html|js|word|rss)|index-basic|query-(phrase|site|url)|
clustering-carrot2/value
descriptionRegular expression naming plugin directory names to
Cheers,
11 matches
Mail list logo