[jira] [Commented] (MAPREDUCE-2715) submitAndMonitorJob() doesn't play nice with MultipleOutputFile

2011-07-20 Thread Geoffrey Young (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-2715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13068577#comment-13068577 ] Geoffrey Young commented on MAPREDUCE-2715: --- hi :) sorry for my fat fingers

[jira] [Created] (MAPREDUCE-2715) submitAndMonitorJob() doesn't play nice with MultipleOutputFile

2011-07-20 Thread Geoffrey Young (JIRA)
Type: Bug Reporter: Geoffrey Young part of submitAndMonitorJob() balks if the output directory currently exists but is non-empty: Error launching job , Output path already exists : this logic actually conflicts with the ideas behind MultipleOutputFile, where the output file path

[jira] Commented: (SOLR-874) Dismax parser exceptions on trailing OPERATOR

2010-09-30 Thread Geoffrey Young (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12916606#action_12916606 ] Geoffrey Young commented on SOLR-874: - I stumbled on this bug while researching something

Re: Apache2 + Mod Perl 2.0 + Template Toolkit

2010-09-21 Thread Geoffrey Young
hi :) On Tue, Sep 21, 2010 at 5:24 AM, Chris Ray ch...@rayjchris.co.uk wrote: Hello, I'm currently attempting to update an Apache Cluster from Apache 1.3 to Apache2. The website served by this cluster uses Template Toolkit extensively. It is my understanding that the original Apache::Template

Re: camel-casing and dismax troubles

2009-05-13 Thread Geoffrey Young
On Wed, May 13, 2009 at 6:23 AM, Yonik Seeley yo...@lucidimagination.com wrote: On Tue, May 12, 2009 at 7:19 PM, Geoffrey Young ge...@modperlcookbook.org wrote: hi all :) I'm having trouble with camel-cased query strings and the dismax handler. a user query LeAnn Rimes isn't matching

camel-casing and dismax troubles

2009-05-12 Thread Geoffrey Young
hi all :) I'm having trouble with camel-cased query strings and the dismax handler. a user query LeAnn Rimes isn't matching the indexed term Leann Rimes even though both are lower-cased in the end. furthermore, the analysis tool shows a match. the debug query looks like

Re: Security Problems ???

2009-04-01 Thread Geoffrey Young
Torsten Foertsch wrote: On Mon 23 Mar 2009, Philippe M. Chiasson wrote: almost a month ago there was this posting on the users list http://www.gossamer-threads.com/lists/modperl/modperl/99170#99170 stating there was a security related bug in modperl. Since then there were no svn

dismax and WordDelimiterFilterFactory+PreserveOriginal

2009-03-16 Thread Geoffrey Young
hi all :) I have two filters combined with dismax on the query side: WordDelimiterFilterFactory { preserveOriginal=1, generateNumberParts=1, catenateWords=0, generateWordParts=1, catenateAll=0, catenateNumbers=0} followed by lowecase filter factory. the analyzer shows the phrase gUYS and

Re: Security Related BUG within ModPerl 2.0.4

2009-03-01 Thread Geoffrey Young
William A. Rowe, Jr. wrote: I would like to report a security related bug within ModPerl 2.0.4, though prefer not to disclose the details to a public channel. The appropriate secure channel for all ASF related security vulnerability reporting is the closed list, secur...@apache.org, which

Re: Shutdown behavior

2009-02-19 Thread Geoffrey Young
And then we have $r-child_terminate which simply calls exit() at C-level in a request pool cleanup. So, no perl-level cleanup at all is done. fwiw, I've never been happy with the implementation of child_terminate() in 2.0 - in mp1 it was a nice, graceful way of cleaning up a child process,

Re: Apache::Bootstrap to ASF infrastructure

2008-11-25 Thread Geoffrey Young
We don't need to incubate this to import it, all we need is committer concensus. +1 +1 rock on --Geoff - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

filtering on blank OR specific range

2008-11-19 Thread Geoffrey Young
hi all :) I'm having difficultly filtering my documents when a field is either blank or set to a specific value. I would have thought this would work fq=-Type:[* TO *] OR Type:blue which I would expect to find all document where either Type is undefined or Type is blue. my actual result set

Re: filtering on blank OR specific range

2008-11-19 Thread Geoffrey Young
Lance Norskog wrote: Try: Type:blue OR -Type:[* TO *] You can't have a negative clause at the beginning. Yes, Lucene should barf about this. I did try that, before and again now, and still no luck. anything else? --Geoff

Re: [PATCH] prototypes/constant redefinition warnings from ModPerl::Util

2008-11-12 Thread Geoffrey Young
Casey West wrote: On Sat, Nov 8, 2008 at 8:55 PM, Geoffrey Young [EMAIL PROTECTED] wrote: Geoffrey Young wrote: -undef $fullname; +delete ${$fullname}{CODE}; I swear there is some p5p history here that treats that slot differently when it's undef versus exists

Re: [PATCH] prototypes/constant redefinition warnings from ModPerl::Util

2008-11-08 Thread Geoffrey Young
-undef $fullname; +delete ${$fullname}{CODE}; I swear there is some p5p history here that treats that slot differently when it's undef versus exists. IIRC an early 5.8.0 iteration broke mod_perl because of exactly this (or, if you prefer, mod_perl exposed a feature of

Re: [PATCH] prototypes/constant redefinition warnings from ModPerl::Util

2008-11-08 Thread Geoffrey Young
Geoffrey Young wrote: -undef $fullname; +delete ${$fullname}{CODE}; I swear there is some p5p history here that treats that slot differently when it's undef versus exists. IIRC an early 5.8.0 iteration broke mod_perl because of exactly this (or, if you prefer

Re: $r-status no correct

2008-10-29 Thread Geoffrey Young
Michael Peters wrote: Ryan Gies wrote: Below is an Apache log snippet which traces the handler phases for two requests: A) /scratch/ does NOT have a directory index B) /scratch/foo/ DOES have a directory index (index.html) So here is mine (the text for die() is A

Re: AC US 2008

2008-10-22 Thread Geoffrey Young
Philip M. Gollucci wrote: Hi All, wondering who is going to present at the Apache US 2008 conference in New Orleans. I'll be there 11/2 - 11/9 I'm presenting on wednesday afternoon: http://us.apachecon.com/c/acus2008/sessions/4 --Geoff

[jira] Created: (SOLR-809) synonyms are matched case-insensitive but inserted as case-sensitive

2008-10-15 Thread Geoffrey Young (JIRA)
Reporter: Geoffrey Young Priority: Minor I had a synonyms file with this line 2, to, two, too, II (that's a capital ii last in the list) and an index with filter class=solr.LowerCaseFilterFactory/ filter class=solr.SynonymFilterFactory synonyms=synonyms.txt

Re: solr 1.3 snapshooter doesn't work, commit never ending

2008-10-15 Thread Geoffrey Young
sunnyfr wrote: I tried last evening before leaving and this morning time elapsed was very important like you can notice above and no snapshot, no error in the logs. I'm actually having a similar trouble. I've enabled postCommit and postOptimize hooks with an absolute path to snapshooter.

Re: using DataImportHandler instead of POST?

2008-10-03 Thread Geoffrey Young
Chris Hostetter wrote: : I chugg away at 1.5 million records in a single file, but solr never : commits. specifically, it ignores my autocommit settings. (I can : commit separately at the end, of course :) the way the autocommit settings work is soemthing i always get confused by --

Re: using DataImportHandler instead of POST?

2008-10-01 Thread Geoffrey Young
Geoffrey Young wrote: Chris Hostetter wrote: : I have a well-formed xml file, suitable for POSTting to solr. that : works just fine. it's very large, though, and using curl in production : is so very lame. is there a very simple config that will let solr just : slurp up the file via

Re: using DataImportHandler instead of POST?

2008-09-29 Thread Geoffrey Young
Chris Hostetter wrote: : I have a well-formed xml file, suitable for POSTting to solr. that : works just fine. it's very large, though, and using curl in production : is so very lame. is there a very simple config that will let solr just : slurp up the file via the DataImportHandler?

using DataImportHandler instead of POST?

2008-09-28 Thread Geoffrey Young
hi all :) I'm sorry I need to ask this, but after reading and re-reading the wiki I don't see a clear path... I have a well-formed xml file, suitable for POSTting to solr. that works just fine. it's very large, though, and using curl in production is so very lame. is there a very simple

Re: mapping URI to filename.

2008-08-19 Thread Geoffrey Young
Anthony R Fletcher wrote: Thanks but that only works for the current request. I was looking for a way of translating some other URI to a pathname on the same server. $r-lookup_uri('/some/uri')-filename() http://perl.apache.org/docs/2.0/api/Apache2/SubRequest.html#C_lookup_uri_ --Geoff

Re: mapping URI to filename.

2008-08-18 Thread Geoffrey Young
Anthony R Fletcher wrote: Using mod_perl2, is there a way of mapping a URI to a filename or directory. I can get the root directory for the server and I can append the URI to it but this doesn't taking to account aliases, UserDir directives etc.. Any suggestions? $r-filename()

[Fwd: ApacheCon US 2008 committer registration discount now available!]

2008-07-28 Thread Geoffrey Young
just in case some of our committers are not part of the [EMAIL PROTECTED] alias, this is for you. the full apachecon schedule can be viewed here: http://us.apachecon.com/c/acus2008/schedule/grid --Geoff Original Message Subject: ApacheCon US 2008 committer registration

Re: spellchecker problems (bugs)

2008-07-25 Thread Geoffrey Young
This issue has been fixed in the trunk. Can you please use the latest trunk code and try? current trunk looks good. thanks! --Geoff

Re: Multiple search components in one handler - ie spellchecker

2008-07-25 Thread Geoffrey Young
Andrew Nagy wrote: Hello - I am attempting to add the spellCheck component in my search requesthandler so when a users does a search, they get the results and spelling corrections all in one query just like the way the facets work. I am having some trouble accomplishing this - can anyone

Re: Multiple search components in one handler - ie spellchecker

2008-07-25 Thread Geoffrey Young
Andrew Nagy wrote: Thanks for getting back to me Geoff. Although, that is pretty much what I have. Maybe if I show my solrconfig someone might be able to point out what I have incorrect? The problem is that nothing related to the spelling options are show in the results, just the normal

[jira] Created: (SOLR-648) SpellcheckComponent throws NullPointerException on restart without build

2008-07-23 Thread Geoffrey Young (JIRA)
: Bug Affects Versions: 1.3 Reporter: Geoffrey Young from http://mail-archives.apache.org/mod_mbox/lucene-solr-user/200807.mbox/[EMAIL PROTECTED]: I believe there is a bug in IndexBased- and FileBasedSpellChecker.java where the analyzer variable is only set on the build command

[jira] Updated: (SOLR-648) SpellcheckComponent throws NullPointerException on restart without build

2008-07-23 Thread Geoffrey Young (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Geoffrey Young updated SOLR-648: Description: from http://mail-archives.apache.org/mod_mbox/lucene-solr-user/200807.mbox/[EMAIL

Re: spell-checker and faceting

2008-07-23 Thread Geoffrey Young
dudes dudes wrote: Hi, I'm trying to couple spell-checking mechanism with faceting in one url statement.. I can get the spell check right, but the facet doesn't work when it's combined with spell-checker...

Re: spellchecker problems (bugs)

2008-07-23 Thread Geoffrey Young
Jonathan Lee wrote: I don't see the patch attached to my original email either -- does solr-user not allow attachments? This is ugly, but here's the patch inline: issue created in jira: https://issues.apache.org/jira/browse/SOLR-648 --Geoff

Re: spellchecker problems (bugs)

2008-07-22 Thread Geoffrey Young
Shalin Shekhar Mangar wrote: The problems you described in the spellchecker are noted in https://issues.apache.org/jira/browse/SOLR-622 -- I shall create an issue to synchronize spellcheck.build so that the index is not corrupted. I'd like to discuss this a little... I'm not sure that I

Re: AT Bug

2008-07-16 Thread Geoffrey Young
Now, AT experts are needed to mull over: 1) why don't the error message show up in the error_log 2) is the enclosed patch a sufficient cure or does it rather hide a real problem (why is $ENV{PATH} undef in the first place) The strace output for that process starts with these lines: setsid()

Re: AT Bug

2008-07-16 Thread Geoffrey Young
Torsten Foertsch wrote: On Wed 16 Jul 2008, Geoffrey Young wrote: I'd be interested in recording all the values passed in and see if they match during start and restart. here you are. The attached file is written with this print statement: print $f $$=.(syscall 39).: .($path

Re: [RFC] Basic LoadFile support for Apache::Test

2008-07-15 Thread Geoffrey Young
Torsten Foertsch wrote: Hi, I don't know much about Apache::Test guts. Hence I ask for review before committing this patch. It treats LoadFile directives similar to LoadModule. One can skip them by adding the name as specified in the httpd.conf to the skip list (not tested). LoadFile

[Fwd: [rt.cpan.org #37592]]

2008-07-10 Thread Geoffrey Young
Original Message Subject: [rt.cpan.org #37592] Date: Thu, 10 Jul 2008 12:15:12 -0400 From: Doug via RT [EMAIL PROTECTED] Reply-To: [EMAIL PROTECTED] To: undisclosed-recipients:; References: [EMAIL PROTECTED] Thu Jul 10 12:15:09 2008: Request 37592 was acted upon. Transaction:

Re: problems with SpellCheckComponent

2008-07-08 Thread Geoffrey Young
When I made: http://localhost:8080/solr/spellCheckCompRH?q=*:*spellcheck.q=ruckspellcheck=true I have this exception: Estado HTTP 500 - null java.lang.NullPointerException at org.apache.solr.handler.component.SpellCheckComponent.getTokens(SpellCheckComponent.java:217) I see this all the

Re: problems with SpellCheckComponent

2008-07-08 Thread Geoffrey Young
Shalin Shekhar Mangar wrote: Hi Geoff, I can't find anything in the code which would give this exception when both q and spellcheck.q is specified. Though, this exception is certainly possible when you restart solr. Anyways, I'll look into it more deeply. great, thanks. There are a few

Re: Apache2::Directives trouble with Apache::Test

2008-07-06 Thread Geoffrey Young
Mark Hedges wrote: Ugh, I am so flummoxed by this. How do I get the Apache2::Controller::Directives module to work from t/conf/extra.conf.in? Try commenting out the PerlSetVar for the same named thing and enabling the line to load the directives and set A2CRenderTemplateDir. Wacky! I will

Re: CPAN tests fail Makefile.PL using Apache::TestMM

2008-07-04 Thread Geoffrey Young
Mark Hedges wrote: On Thu, 3 Jul 2008, Fred Moyer wrote: I've been getting cpan-testers results like this. What should I do? Thanks... --m-- See sub test in: http://search.cpan.org/src/PHRED/Apache-Dispatch-0.10/Makefile.PL Wouldn't that Makefile.PL die if mod_perl is not installed,

[jira] Commented: (SOLR-14) Add the ability to preserve the original term when using WordDelimiterFilter

2008-07-03 Thread Geoffrey Young (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-14?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12610229#action_12610229 ] Geoffrey Young commented on SOLR-14: looks good from a functional pov. the ordering

[jira] Commented: (SOLR-606) spellcheck.colate doesn't handle multiple tokens properly

2008-07-01 Thread Geoffrey Young (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12609607#action_12609607 ] Geoffrey Young commented on SOLR-606: - sure :) the choice of keywords is intentional. I

[jira] Updated: (SOLR-14) Add the ability to preserve the original term when using WordDelimiterFilter

2008-06-30 Thread Geoffrey Young (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-14?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Geoffrey Young updated SOLR-14: --- Attachment: SOLR-14.patch this new patch addresses three additional cases o words prefixed

Re: Apache::Test problem

2008-06-30 Thread Geoffrey Young
Torsten Foertsch wrote: Hi, I have mod_proxy_html in my httpd.conf. This requires an additional LoadFile directive to load libxml prior to LoadModule. Unfortunately the httpd.conf generated by the test framework includes the LoadModule mod_proxy_html but omits the LoadFile libxml.so. I

Re: Apache::Template for modperl 2

2008-06-28 Thread Geoffrey Young
Fred Moyer wrote: Senthil V wrote: Hi All, I am trying to install Apache::Template for Apache 2.2.8. http://2.2.8. Whether the porting of Apache::Template is done for modperl 2. If its done, plz let me know from where i can get the source. I just took a look at Apache::Template, and it is

[jira] Commented: (SOLR-606) spellcheck.colate doesn't handle multiple tokens properly

2008-06-27 Thread Geoffrey Young (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12608770#action_12608770 ] Geoffrey Young commented on SOLR-606: - results with your patch applied: {noformat

Re: SpellCheckerRequestHandler qt parameter

2008-06-27 Thread Geoffrey Young
I had null pointer exceptions left and right while composing this email... then I added spellcheck.build=true to one and they went away. do you need to rebuild the spelling index every time you alter (certain parts) of solrconfig.xml? it was very consistent as reported below, but after

[jira] Commented: (SOLR-572) Spell Checker as a Search Component

2008-06-26 Thread Geoffrey Young (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12608487#action_12608487 ] Geoffrey Young commented on SOLR-572: - I'm seeing random weirdness in the collation

[jira] Commented: (SOLR-606) spellcheck.colate doesn't handle multiple tokens properly

2008-06-26 Thread Geoffrey Young (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12608614#action_12608614 ] Geoffrey Young commented on SOLR-606: - I'm not in charge of any of the environments, so

Re: SpellCheckerRequestHandler qt parameter

2008-06-26 Thread Geoffrey Young
Norberto Meijome wrote: Hi there, Short and sweet : Is SCRH intended to honour qt= ? longer... I'm testing the newest SCRH ( SOLR-572), using last night's nightly build. I have defined a 'dismax' request handler which searches across a number of fields. When I use the SCRH in a query,

Re: SpellCheckerRequestHandler qt parameter

2008-06-26 Thread Geoffrey Young
Grant Ingersoll wrote: On Jun 26, 2008, at 5:25 PM, Geoffrey Young wrote: well *almost* - it works most excellently with q=$term but when I add spellchecker.q=$term things implode: HTTP Status 500 - null java.lang.NullPointerException at org .apache .solr .handler

Re: mod_perl caching problem

2008-06-24 Thread Geoffrey Young
william wrote: I like to add on something, I had put the pragma use strict; use warnings; on all modules that I had, but I didn't get the warnings of Variable $foo will not stay shared at... . What other possibility that might cause my program to cache the result even when the input has

Re: [MP2]: setting group for a request (require group ...)

2008-06-23 Thread Geoffrey Young
titetluc titetluc wrote: Geoffrey, André, Thank you for your answer. Conclusion: I will have to: . write my own PerlAuthzHandler yes . define a new directive to define my group no - you can overload the Requires directive. the example I pointed you to shows you how:

Re: [MP2]: setting group for a request (require group ...)

2008-06-19 Thread Geoffrey Young
I would like to use my module in another configuration where group is checked Location /test_group PerlAuthHandler My::Auth Require group group1 /Location I can not find any mod_perl API method (Apache2::RequestRec::group ?) to set the group. that's right. you have control over the

Re: missing document count?

2008-06-18 Thread Geoffrey Young
Chris Hostetter wrote: : not hard, but useful information to have handy without additional : manipulations on my part. : our pages are the results of multiple queries. so, given a max number of : records per page (or total), the rows asked of query2 is max - query1, of in the common case,

[jira] Updated: (SOLR-14) Add the ability to preserve the original term when using WordDelimiterFilter

2008-06-17 Thread Geoffrey Young (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-14?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Geoffrey Young updated SOLR-14: --- Attachment: SOLR-14.patch ok, I've given this a shot. I'm an an open-source guy, even an ASF guy

[jira] Commented: (SOLR-14) Add the ability to preserve the original term when using WordDelimiterFilter

2008-06-14 Thread Geoffrey Young (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-14?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12605063#action_12605063 ] Geoffrey Young commented on SOLR-14: this is fairly important for our ongoing

Re: return value for response handler

2008-06-12 Thread Geoffrey Young
Torsten Foertsch wrote: On Tue 10 Jun 2008, John ORourke wrote: I had a bug with an interesting side effect which I want to understand. I wanted to redirect the user's browser, and in my response handler I was setting $r-status(302) but returning Apache2::Const::OK instead of the correct

Re: searching only within allowed documents

2008-06-11 Thread Geoffrey Young
Solr allows you to specify filters in separate parameters that are applied to the main query, but cached separately. q=the user queryfq=folder:f13fq=folder:f24 I've been wanting more explanation around this for a while, so maybe now is a good time to ask :) the cached separately verbiage

[Fwd: [rt.cpan.org #36305] Overridden get_basic_credentials broken w/ NTLM Authentication]

2008-05-30 Thread Geoffrey Young
Original Message Subject: [rt.cpan.org #36305] Overridden get_basic_credentials broken w/ NTLM Authentication Date: Fri, 30 May 2008 13:43:17 -0400 From: RFRANKEL via RT [EMAIL PROTECTED] Reply-To: [EMAIL PROTECTED] To: undisclosed-recipients:; References: [EMAIL PROTECTED]

Re: Bug: Character sets and $r-custom_response

2008-05-23 Thread Geoffrey Young
Clinton Gormley wrote: Hi all There seems to be a bug in the mod_perl2/apache2 handling of character sets for $r-custom_response(). I'm not sure which is at fault. My pages are all in UTF8, but I can't find a way to set this character set for custom generated error pages. I've tried: -

adding expand=true to WordDelimiterFilter

2008-05-19 Thread Geoffrey Young
hi :) I'm having an interesting problem with my data. in general, I want the results of the WordDelimiterFilter for better matching, but there are times when it's just too aggressive. for example boys2men = boys 2 men (good) p!nk = pnk (maybe) !!! = (nothing - bad) there's

Re: adding expand=true to WordDelimiterFilter

2008-05-19 Thread Geoffrey Young
Chris Hostetter wrote: by expand=true it sounds like you mean you are looking for a way to preserve the orriginal term without any characteres removed. yes, that's it. This sounds like SOLR-14 ... you might want to take a look at it, and see if the patch is still useable, and if not see

Re: token concat filter?

2008-05-08 Thread Geoffrey Young
Otis Gospodnetic wrote: Geoff, Whether synonyms are applied at index time or query time is controlled via schema.xml - it depends on where you put the synonym factory, whether in the index-time or query-time section of a fieldType. Synonyms are read once on start, I believe. It might be

Re: token concat filter?

2008-05-08 Thread Geoffrey Young
Otis Gospodnetic wrote: There is actually a Wiki page explaining this pretty well... have you seen it? I guess not. I've been reading the wiki, but the trouble with wiki's always seems to be (for me) finding stuff. can you point it out? Index-time expansion means larger indices and

Re: Sort results on a field not ordered

2008-05-02 Thread Geoffrey Young
Erik Hatcher wrote: What field type is chapterTitle? I'm betting it is an analyzed field with multiple values (tokens/terms) per document. To successfully sort, you'll need to have a single value per document - using copyField can help with this to have both a searchable field and a

token concat filter?

2008-05-01 Thread Geoffrey Young
hi :) I'm looking for a filter that will compress all tokens into a single token. the WordDelimiterFilterFactory does it for tokens it finds itself, but not ones passed to it. basically, I'm trying to match Radiohead in the index with radio head in the query. if it were spelled

Re: token concat filter?

2008-05-01 Thread Geoffrey Young
Yonik Seeley wrote: If there are only a few such cases, it might be better to use synonyms to correct them. unfortunately, there are too many to handle this way. Off the top of my head there's no concatenating token filter, but it wouldn't be hard to make one. hmm, ok. I'm not a java

Re: token concat filter?

2008-05-01 Thread Geoffrey Young
Walter Underwood wrote: I've been doing it with synonyms and I have several hundred of them. I'm dealing mostly with proper names, so I expect more like 80k of them for our data :) Concatenating bi-word groups is pretty useful for English. We have a habit of gluing words together.

Re: token concat filter?

2008-05-01 Thread Geoffrey Young
Walter Underwood wrote: I doubt it would be that many. I recommend tracking the searches and the clicks, and working on queries with low clickthrough. the trouble is I'm in a dynamic biz - last weeks popular clicks are very different from this weeks, so by the time I analyze last weeks

Re: token concat filter?

2008-05-01 Thread Geoffrey Young
Otis Gospodnetic wrote: Geoff, Whether synonyms are applied at index time or query time is controlled via schema.xml - it depends on where you put the synonym factory, whether in the index-time or query-time section of a fieldType. Synonyms are read once on start, I believe. It might be

[Fwd: HELP-can you help me find MODPERL programmers? in BAY AREA PLEEZE]

2008-04-29 Thread Geoffrey Young
apparently, due diligence doesn't cover proper spelling or YELLING. but for the interested... --Geoff Original Message Subject: HELP-can you help me find MODPERL programmers? in BAY AREA PLEEZE Date: Tue, 29 Apr 2008 15:29:59 -0500 From: Morse, Patrice [EMAIL PROTECTED] I

Re: Got parseException when search keyword AND on a text field

2008-04-24 Thread Geoffrey Young
Otis Gospodnetic wrote: Not in one place and documented. The place to look are query parsers, but things like AND OR NOT TO are the ones to look out for. this seems like something solr ought to handle gracefully on the backend for me - if I need to write logic to make sure a malicious

another spellchecker question

2008-04-23 Thread Geoffrey Young
hi :) I've noticed that (with solr 1.2) the returned order (as well as the actual matched set) is affected by the number of matches you ask for: q=hannasuggestionCount=1 suggestions:[Yanna] q=hannasuggestionCount=2 suggestions:[Manna, Yanna] q=hannasuggestionCount=5

Re: another spellchecker question

2008-04-23 Thread Geoffrey Young
Shalin Shekhar Mangar wrote: Hi Geoffrey, Yes, this is a caveat in the lucene contrib spellchecker which Solr uses. From the lucene spell checker javadocs: * pAs the Lucene similarity that is used to fetch the most relevant n-grammed terms * is not the same as the edit distance strategy

Re: Porting Bricolage to mp2: TransHandler Interference

2008-04-23 Thread Geoffrey Young
Under mod_perl2, however, the same request looks like this: 75749 Apache2::RequestRec=SCALAR(0x29f3300) TransHandler start for /workflow/profile/desk/101/101/ 75749 Apache2::RequestRec=SCALAR(0x29f3300) TransHandler finish for /workflow/profile/desk/101/101/ 75749

Re: config for very frequent solr updates

2008-04-18 Thread Geoffrey Young
found the distributed search docs from there and will keep that in mind as I move forward. --Geoff Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Geoffrey Young [EMAIL PROTECTED] To: solr-user@lucene.apache.org Sent: Thursday, April 17, 2008

Re: What happened with Apache2::Reload in mod_perl-2.0.4?

2008-04-18 Thread Geoffrey Young
Fred Moyer wrote: Niels van Dijke wrote: Hi mod_perl maintainers, Thank you for releasing mod_perl 2.0.4. I was wondering what happened to Apache2::Reload? Was it missed in packaging for the upload to CPAN? If so then there may be other files missing too. See the Changelog:

Re: What happened with Apache2::Reload in mod_perl-2.0.4?

2008-04-18 Thread Geoffrey Young
Fred Moyer wrote: Niels van Dijke wrote: Hi mod_perl maintainers, Thank you for releasing mod_perl 2.0.4. I was wondering what happened to Apache2::Reload? Was it missed in packaging for the upload to CPAN? If so then there may be other files missing too. See the Changelog:

config for very frequent solr updates

2008-04-17 Thread Geoffrey Young
hi all :) I didn't see any documentation on this, so I was wondering what the experience here was with updating solr with a small but constant trickle of daemon-style updates. unfortunately, it's a business requirement that backend db updates make it to search as the changes roll in (5

Re: [RELEASE CANDIDATE] mod_perl-2.0.4 RC1

2008-04-15 Thread Geoffrey Young
Philippe M. Chiasson wrote: The mod_perl 2.0.4 release candidate 1 Works with Perl 5.10 is ready. It can be downloaded here: http://www.apache.org/~gozer/mp2/mod_perl-2.0.4-rc1.tar.gz tests fine on my system. +1 --Geoff

Re: [RELEASE CANDIDATE] mod_perl-2.0.4 RC1

2008-04-15 Thread Geoffrey Young
Philippe M. Chiasson wrote: The mod_perl 2.0.4 release candidate 1 Works with Perl 5.10 is ready. It can be downloaded here: http://www.apache.org/~gozer/mp2/mod_perl-2.0.4-rc1.tar.gz tests fine on my system. +1 --Geoff

Re: Handing over the POST input to a virtual included part?

2008-04-14 Thread Geoffrey Young
Torsten Foertsch wrote: Hi, is it possible to hand over the POST input of a SSI document to a virtual included part of it? I have an SSI document that contains !--#include virtual=/ptest/ptest.pl?$QUERY_STRING -- This way I can pass parameters that are passed in the URI to the

Re: ModPerl::MM or ExtUtils::MakeMaker or what?

2008-04-03 Thread Geoffrey Young
Torsten Foertsch wrote: Hi, I am a bit confused about what MakeMaker is best to use with mp2 modules. There is the good old ExtUtils::MakeMaker that is mentioned in http://perl.apache.org/docs/general/testing/testing.html. How I got there I don't remember but I have used in all my modules

perl at apachecon

2008-04-02 Thread Geoffrey Young
hi all :) perl's (and mod_perl's) presence at apachecon has been dwindling over the years. I won't get into a rant about why this might be, so please don't you do it either :) anyway, if you have any interest in attending apachecon us this year, please login to the apachecon site and take

perl at apachecon

2008-04-02 Thread Geoffrey Young
hi all :) perl's (and mod_perl's) presence at apachecon has been dwindling over the years. I won't get into a rant about why this might be, so please don't you do it either :) anyway, if you have any interest in attending apachecon us this year, please login to the apachecon site and take

[Fwd: [rt.cpan.org #34470] make test trying to load os2 process on cygwin]

2008-03-27 Thread Geoffrey Young
Original Message Subject: [rt.cpan.org #34470] make test trying to load os2 process on cygwin Date: Thu, 27 Mar 2008 09:25:43 -0400 From: SatheeshKumar Mohan via RT [EMAIL PROTECTED] Reply-To: [EMAIL PROTECTED] To: undisclosed-recipients:; References: [EMAIL PROTECTED]

Re: [rt.cpan.org #34470] make test trying to load os2 process on cygwin

2008-03-27 Thread Geoffrey Young
Can't locate loadable object for module OS2::Process in @INC (@INC contains: C:\cygwin\opt\spikesource\var\tmp\portage\apache-2.2.3\work\Apache-Test-1.28\blib\lib C:\cygwin\opt\spikesource\var\tmp\portage\apache-2.2.3\work\Apache-Test-1.28\blib\arch

Re: utf8 urls

2008-03-19 Thread Geoffrey Young
John ORourke wrote: Eli Shemer wrote: For some reason the following test doesn’t print anything out to the screen Do I need to change something in the apache configuration, or mod_perl’s ? /articles_read.pl?id=חוזרת ## get http parameters $r = shift; $apr =

Re: schema help

2008-03-12 Thread Geoffrey Young
Rachel McConnell wrote: Our Solr use consists of several rather different data types, some of which have one-to-many relationships with other types. We don't need to do any searching of quite the kind you describe, but I have an idea about it, depending on what you need to do with the book

Re: schema help

2008-03-12 Thread Geoffrey Young
the trouble I'm having is one of dimension. an author has many, many attributes (name, birthdate, biography in $language, etc). as does each book (title in $language, summary in $language, genre, etc). as does each library (name, address, directions in $language, etc). so an author with N

schema help

2008-03-11 Thread Geoffrey Young
hi :) I'm trying to work out a schema for our widgets. more than just coming up with something I'd like something idiomatic in solr terms. any help is much appreciated. here's a similar problem space to what I'm working with... lets say we're talking books. books are written by authors

Re: schema help

2008-03-11 Thread Geoffrey Young
. --Geoff Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Geoffrey Young [EMAIL PROTECTED] To: solr-user@lucene.apache.org Sent: Tuesday, March 11, 2008 12:17:32 PM Subject: schema help hi :) I'm trying to work out a schema for our widgets

[Fwd: Call for Papers Opens for ApacheCon US 2008 (pls forward)]

2008-03-07 Thread Geoffrey Young
Original Message Subject: Call for Papers Opens for ApacheCon US 2008 (pls forward) Date: Sat, 01 Mar 2008 21:49:34 -0500 From: Shane Curcuru [EMAIL PROTECTED] To: [EMAIL PROTECTED] Help drive more CFP submissions - send this to your users@ lists to help drive awareness.

[Fwd: [rt.cpan.org #33765] Fix for 5.10]

2008-03-03 Thread Geoffrey Young
Original Message Subject: [rt.cpan.org #33765] Fix for 5.10 Date: Sun, 02 Mar 2008 20:10:56 -0500 From: Reini Urban via RT [EMAIL PROTECTED] Reply-To: [EMAIL PROTECTED] To: undisclosed-recipients:; References: [EMAIL PROTECTED] Sun Mar 02 20:10:54 2008: Request 33765 was

Re: return DECLINED or OK?

2008-02-28 Thread Geoffrey Young
J. Peng wrote: At what cases should we return a DECLINED or a OK from a handler? I saw the handler of PerlTransHandler returns a Apache2::Const::DECLINED but dont know why. see the introduction to part III here: http://www.modperlcookbook.org/chapters.html in general, trans handlers

Re: Time for a 2.0 RC ?

2008-02-24 Thread Geoffrey Young
Philippe M. Chiasson wrote: Since the last few patches for 5.10, I haven't heard of anything bad hapenning. Since Perl 5.10 has already been out for a while, I'd like to volounteer to RC the next mod_perl 2.0, assuming it *does* work correctly with Perl 5.10 by now. +1 thanks for

  1   2   3   4   5   6   7   8   9   10   >