[jira] Created: (LUCENE-903) FilteredQuery explanation inaccuracy with boost

2007-06-03 Thread Doron Cohen (JIRA)
FilteredQuery explanation inaccuracy with boost
---

 Key: LUCENE-903
 URL: https://issues.apache.org/jira/browse/LUCENE-903
 Project: Lucene - Java
  Issue Type: Bug
  Components: Search
Affects Versions: 2.2
Reporter: Doron Cohen
Assignee: Doron Cohen
Priority: Minor
 Fix For: 2.2


The value of explanation is different than the product of its part if boost  1.
This is exposed after tightening the explanation check (part of LUCENE-446).



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-903) FilteredQuery explanation inaccuracy with boost

2007-06-03 Thread Doron Cohen (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Doron Cohen updated LUCENE-903:
---

Attachment: lucene-903.patch

Attached lucene-903.patch change the explanation test in CheckHits to verify 
that the explanation value matches the explanation details. This new test makes 
assumptions on the description of an explanation that has multi sub-details, 
which might be controversial: the description is assumed to comply with one of:
  - end with product of:
  - end with sum of:
  - end with sum of:
  - contain max plus x times others (where x is float)
While this seems to cover all the current scoring combinations, future scorers 
might not fit in.
We can always expand this list of course.

With this check, TestSimpleExplanations fails for FilteredQuery - testFQ6(). 
Fix included.

Also failing is TestBoostingTermQuery, just because its explanation had no 
product of:.
Fix included.

All tests pass.

 FilteredQuery explanation inaccuracy with boost
 ---

 Key: LUCENE-903
 URL: https://issues.apache.org/jira/browse/LUCENE-903
 Project: Lucene - Java
  Issue Type: Bug
  Components: Search
Affects Versions: 2.2
Reporter: Doron Cohen
Assignee: Doron Cohen
Priority: Minor
 Fix For: 2.2

 Attachments: lucene-903.patch


 The value of explanation is different than the product of its part if boost  
 1.
 This is exposed after tightening the explanation check (part of LUCENE-446).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-903) FilteredQuery explanation inaccuracy with boost

2007-06-03 Thread Doron Cohen (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Doron Cohen updated LUCENE-903:
---

Lucene Fields: [Patch Available]  (was: [New])

 FilteredQuery explanation inaccuracy with boost
 ---

 Key: LUCENE-903
 URL: https://issues.apache.org/jira/browse/LUCENE-903
 Project: Lucene - Java
  Issue Type: Bug
  Components: Search
Affects Versions: 2.2
Reporter: Doron Cohen
Assignee: Doron Cohen
Priority: Minor
 Fix For: 2.2

 Attachments: lucene-903.patch


 The value of explanation is different than the product of its part if boost  
 1.
 This is exposed after tightening the explanation check (part of LUCENE-446).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[EMAIL PROTECTED]: Project lucene-java (in module lucene-java) failed

2007-06-03 Thread Jason van Zyl
To whom it may engage...

This is an automated request, but not an unsolicited one. For 
more information please visit http://gump.apache.org/nagged.html, 
and/or contact the folk at [EMAIL PROTECTED]

Project lucene-java has an issue affecting its community integration.
This issue affects 4 projects,
 and has been outstanding for 6 runs.
The current state of this project is 'Failed', with reason 'Build Failed'.
For reference only, the following projects are affected by this:
- eyebrowse :  Web-based mail archive browsing
- jakarta-lucene :  Java Based Search Engine
- jakarta-slide :  Content Management System based on WebDAV technology
- lucene-java :  Java Based Search Engine


Full details are available at:
http://vmgump.apache.org/gump/public/lucene-java/lucene-java/index.html

That said, some information snippets are provided here.

The following annotations (debug/informational/warning/error messages) were 
provided:
 -DEBUG- Sole output [lucene-core-03062007.jar] identifier set to project name
 -DEBUG- Dependency on javacc exists, no need to add for property javacc.home.
 -INFO- Failed with reason build failed
 -INFO- Failed to extract fallback artifacts from Gump Repository



The following work was performed:
http://vmgump.apache.org/gump/public/lucene-java/lucene-java/gump_work/build_lucene-java_lucene-java.html
Work Name: build_lucene-java_lucene-java (Type: Build)
Work ended in a state of : Failed
Elapsed: 20 secs
Command Line: /opt/jdk1.5/bin/java -Djava.awt.headless=true 
-Xbootclasspath/p:/usr/local/gump/public/workspace/xml-commons/java/external/build/xml-apis.jar:/usr/local/gump/public/workspace/xml-xerces2/build/xercesImpl.jar
 org.apache.tools.ant.Main -Dgump.merge=/x1/gump/public/gump/work/merge.xml 
-Dbuild.sysclasspath=only -Dversion=03062007 
-Djavacc.home=/usr/local/gump/packages/javacc-3.1 package 
[Working Directory: /usr/local/gump/public/workspace/lucene-java]
CLASSPATH: 
/opt/jdk1.5/lib/tools.jar:/usr/local/gump/public/workspace/lucene-java/build/classes/java:/usr/local/gump/public/workspace/lucene-java/build/classes/demo:/usr/local/gump/public/workspace/lucene-java/build/classes/test:/usr/local/gump/public/workspace/ant/dist/lib/ant-jmf.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-swing.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-apache-resolver.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-trax.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-junit.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-launcher.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-nodeps.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant.jar:/usr/local/gump/packages/junit3.8.1/junit.jar:/usr/local/gump/public/workspace/xml-commons/java/build/resolver.jar:/usr/local/gump/packages/je-1.7.1/lib/je.jar:/usr/local/gump/public/workspace/jakarta-commons/digester/dist/commons-digester.jar:/usr/local/gump/public/workspace/jakarta-regexp/build/jakarta-regexp-03062007.jar:/usr/local/gump/packages/javacc-3.1/bin/lib/javacc.jar:/usr/local/gump/public/workspace/jline/target/jline-0.9.92-SNAPSHOT.jar:/usr/local/gump/packages/jtidy-04aug2000r7-dev/build/Tidy.jar:/usr/local/gump/public/workspace/junit/dist/junit-03062007.jar:/usr/local/gump/public/workspace/xml-commons/java/external/build/xml-apis-ext.jar
-
[javac] location: class org.apache.lucene.store.db.DbDirectory
[javac] DatabaseEntry key = new DatabaseEntry(new byte[0]);
[javac] ^
[javac] 
/x1/gump/public/workspace/lucene-java/contrib/db/bdb/src/java/org/apache/lucene/store/db/DbDirectory.java:171:
 cannot find symbol
[javac] symbol  : class DatabaseEntry
[javac] location: class org.apache.lucene.store.db.DbDirectory
[javac] DatabaseEntry data = new DatabaseEntry((byte[]) 
null);
[javac] ^
[javac] 
/x1/gump/public/workspace/lucene-java/contrib/db/bdb/src/java/org/apache/lucene/store/db/DbDirectory.java:171:
 cannot find symbol
[javac] symbol  : class DatabaseEntry
[javac] location: class org.apache.lucene.store.db.DbDirectory
[javac] DatabaseEntry data = new DatabaseEntry((byte[]) 
null);
[javac]  ^
[javac] 
/x1/gump/public/workspace/lucene-java/contrib/db/bdb/src/java/org/apache/lucene/store/db/DbDirectory.java:178:
 cannot find symbol
[javac] symbol  : variable DbConstants
[javac] location: class org.apache.lucene.store.db.DbDirectory
[javac]DbConstants.DB_SET_RANGE | flags) != 
DbConstants.DB_NOTFOUND)
[javac]^
[javac] 
/x1/gump/public/workspace/lucene-java/contrib/db/bdb/src/java/org/apache/lucene/store/db/DbDirectory.java:178:
 cannot find symbol
[javac] symbol  : variable DbConstants
[javac] location: class org.apache.lucene.store.db.DbDirectory

[EMAIL PROTECTED]: Project lucene-java (in module lucene-java) failed

2007-06-03 Thread Jason van Zyl
To whom it may engage...

This is an automated request, but not an unsolicited one. For 
more information please visit http://gump.apache.org/nagged.html, 
and/or contact the folk at [EMAIL PROTECTED]

Project lucene-java has an issue affecting its community integration.
This issue affects 4 projects,
 and has been outstanding for 6 runs.
The current state of this project is 'Failed', with reason 'Build Failed'.
For reference only, the following projects are affected by this:
- eyebrowse :  Web-based mail archive browsing
- jakarta-lucene :  Java Based Search Engine
- jakarta-slide :  Content Management System based on WebDAV technology
- lucene-java :  Java Based Search Engine


Full details are available at:
http://vmgump.apache.org/gump/public/lucene-java/lucene-java/index.html

That said, some information snippets are provided here.

The following annotations (debug/informational/warning/error messages) were 
provided:
 -DEBUG- Sole output [lucene-core-03062007.jar] identifier set to project name
 -DEBUG- Dependency on javacc exists, no need to add for property javacc.home.
 -INFO- Failed with reason build failed
 -INFO- Failed to extract fallback artifacts from Gump Repository



The following work was performed:
http://vmgump.apache.org/gump/public/lucene-java/lucene-java/gump_work/build_lucene-java_lucene-java.html
Work Name: build_lucene-java_lucene-java (Type: Build)
Work ended in a state of : Failed
Elapsed: 20 secs
Command Line: /opt/jdk1.5/bin/java -Djava.awt.headless=true 
-Xbootclasspath/p:/usr/local/gump/public/workspace/xml-commons/java/external/build/xml-apis.jar:/usr/local/gump/public/workspace/xml-xerces2/build/xercesImpl.jar
 org.apache.tools.ant.Main -Dgump.merge=/x1/gump/public/gump/work/merge.xml 
-Dbuild.sysclasspath=only -Dversion=03062007 
-Djavacc.home=/usr/local/gump/packages/javacc-3.1 package 
[Working Directory: /usr/local/gump/public/workspace/lucene-java]
CLASSPATH: 
/opt/jdk1.5/lib/tools.jar:/usr/local/gump/public/workspace/lucene-java/build/classes/java:/usr/local/gump/public/workspace/lucene-java/build/classes/demo:/usr/local/gump/public/workspace/lucene-java/build/classes/test:/usr/local/gump/public/workspace/ant/dist/lib/ant-jmf.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-swing.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-apache-resolver.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-trax.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-junit.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-launcher.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-nodeps.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant.jar:/usr/local/gump/packages/junit3.8.1/junit.jar:/usr/local/gump/public/workspace/xml-commons/java/build/resolver.jar:/usr/local/gump/packages/je-1.7.1/lib/je.jar:/usr/local/gump/public/workspace/jakarta-commons/digester/dist/commons-digester.jar:/usr/local/gump/public/workspace/jakarta-regexp/build/jakarta-regexp-03062007.jar:/usr/local/gump/packages/javacc-3.1/bin/lib/javacc.jar:/usr/local/gump/public/workspace/jline/target/jline-0.9.92-SNAPSHOT.jar:/usr/local/gump/packages/jtidy-04aug2000r7-dev/build/Tidy.jar:/usr/local/gump/public/workspace/junit/dist/junit-03062007.jar:/usr/local/gump/public/workspace/xml-commons/java/external/build/xml-apis-ext.jar
-
[javac] location: class org.apache.lucene.store.db.DbDirectory
[javac] DatabaseEntry key = new DatabaseEntry(new byte[0]);
[javac] ^
[javac] 
/x1/gump/public/workspace/lucene-java/contrib/db/bdb/src/java/org/apache/lucene/store/db/DbDirectory.java:171:
 cannot find symbol
[javac] symbol  : class DatabaseEntry
[javac] location: class org.apache.lucene.store.db.DbDirectory
[javac] DatabaseEntry data = new DatabaseEntry((byte[]) 
null);
[javac] ^
[javac] 
/x1/gump/public/workspace/lucene-java/contrib/db/bdb/src/java/org/apache/lucene/store/db/DbDirectory.java:171:
 cannot find symbol
[javac] symbol  : class DatabaseEntry
[javac] location: class org.apache.lucene.store.db.DbDirectory
[javac] DatabaseEntry data = new DatabaseEntry((byte[]) 
null);
[javac]  ^
[javac] 
/x1/gump/public/workspace/lucene-java/contrib/db/bdb/src/java/org/apache/lucene/store/db/DbDirectory.java:178:
 cannot find symbol
[javac] symbol  : variable DbConstants
[javac] location: class org.apache.lucene.store.db.DbDirectory
[javac]DbConstants.DB_SET_RANGE | flags) != 
DbConstants.DB_NOTFOUND)
[javac]^
[javac] 
/x1/gump/public/workspace/lucene-java/contrib/db/bdb/src/java/org/apache/lucene/store/db/DbDirectory.java:178:
 cannot find symbol
[javac] symbol  : variable DbConstants
[javac] location: class org.apache.lucene.store.db.DbDirectory

Re: [EMAIL PROTECTED]: Project lucene-java (in module lucene-java) failed

2007-06-03 Thread Chris Hostetter

Greetings Gumpers,

The Lucene-Java gump build seems to been failing consistently for the past
few days.  The problem seems to relate to some cleanup we did recently to
our build files to help ensure that the various contrib sub projects
could be cleanly built/tested on their own, or as part of hte master
build.

currently gump is reporting some javac errors when compiling the
contrib/db/bdb subproject relating to packages and symbols which can't
be found -- these are defined in the db-4.3.29.jar
 which is automaticly downloaded in the get-db-jar task executed just
prior to compilation.

This seems to work fine for develoeprs locally, as well as for our own
nightly build system running on Hudson...

...If someone could take a look at the GUMP setup for Lucene-Java and
either:
   a) fix whatever problem it is currently having
or b) let us know if there is something wrong with out build process that
  we're not noticing becuase we are too close to the problem.

...we would greatly appreciate it.


-Hoss

: Date: Sun, 03 Jun 2007 03:20:44 PDT
: Subject: [EMAIL PROTECTED]: Project lucene-java (in module lucene-java) failed
:
: To whom it may engage...
:
: This is an automated request, but not an unsolicited one. For
: more information please visit http://gump.apache.org/nagged.html,
: and/or contact the folk at [EMAIL PROTECTED]
:
: Project lucene-java has an issue affecting its community integration.
: This issue affects 4 projects,
:  and has been outstanding for 6 runs.
: The current state of this project is 'Failed', with reason 'Build Failed'.
: For reference only, the following projects are affected by this:
: - eyebrowse :  Web-based mail archive browsing
: - jakarta-lucene :  Java Based Search Engine
: - jakarta-slide :  Content Management System based on WebDAV technology
: - lucene-java :  Java Based Search Engine
:
:
: Full details are available at:
: http://vmgump.apache.org/gump/public/lucene-java/lucene-java/index.html
:
: That said, some information snippets are provided here.
:
: The following annotations (debug/informational/warning/error messages) were 
provided:
:  -DEBUG- Sole output [lucene-core-03062007.jar] identifier set to project name
:  -DEBUG- Dependency on javacc exists, no need to add for property javacc.home.
:  -INFO- Failed with reason build failed
:  -INFO- Failed to extract fallback artifacts from Gump Repository
:
:
:
: The following work was performed:
: 
http://vmgump.apache.org/gump/public/lucene-java/lucene-java/gump_work/build_lucene-java_lucene-java.html
: Work Name: build_lucene-java_lucene-java (Type: Build)
: Work ended in a state of : Failed
: Elapsed: 20 secs
: Command Line: /opt/jdk1.5/bin/java -Djava.awt.headless=true 
-Xbootclasspath/p:/usr/local/gump/public/workspace/xml-commons/java/external/build/xml-apis.jar:/usr/local/gump/public/workspace/xml-xerces2/build/xercesImpl.jar
 org.apache.tools.ant.Main -Dgump.merge=/x1/gump/public/gump/work/merge.xml 
-Dbuild.sysclasspath=only -Dversion=03062007 
-Djavacc.home=/usr/local/gump/packages/javacc-3.1 package
: [Working Directory: /usr/local/gump/public/workspace/lucene-java]
: CLASSPATH: 
/opt/jdk1.5/lib/tools.jar:/usr/local/gump/public/workspace/lucene-java/build/classes/java:/usr/local/gump/public/workspace/lucene-java/build/classes/demo:/usr/local/gump/public/workspace/lucene-java/build/classes/test:/usr/local/gump/public/workspace/ant/dist/lib/ant-jmf.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-swing.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-apache-resolver.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-trax.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-junit.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-launcher.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant-nodeps.jar:/usr/local/gump/public/workspace/ant/dist/lib/ant.jar:/usr/local/gump/packages/junit3.8.1/junit.jar:/usr/local/gump/public/workspace/xml-commons/java/build/resolver.jar:/usr/local/gump/packages/je-1.7.1/lib/je.jar:/usr/local/gump/public/workspace/jakarta-commons/digester/dist/commons-digester.jar:/usr/local/gump/public/workspa
 ce
:  
/jakarta-regexp/build/jakarta-regexp-03062007.jar:/usr/local/gump/packages/javacc-3.1/bin/lib/javacc.jar:/usr/local/gump/public/workspace/jline/target/jline-0.9.92-SNAPSHOT.jar:/usr/local/gump/packages/jtidy-04aug2000r7-dev/build/Tidy.jar:/usr/local/gump/public/workspace/junit/dist/junit-03062007.jar:/usr/local/gump/public/workspace/xml-commons/java/external/build/xml-apis-ext.jar
: -
: [javac] location: class org.apache.lucene.store.db.DbDirectory
: [javac] DatabaseEntry key = new DatabaseEntry(new 
byte[0]);
: [javac] ^
: [javac] 
/x1/gump/public/workspace/lucene-java/contrib/db/bdb/src/java/org/apache/lucene/store/db/DbDirectory.java:171:
 cannot find symbol
: [javac] symbol  : class DatabaseEntry
: [javac] 

[jira] Updated: (LUCENE-894) Custom build.xml for binary distributions

2007-06-03 Thread Hoss Man (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man updated LUCENE-894:


Attachment: lucene-894.patch

Michael: this approach seems great.  I couldn't help but make a few tweaks...

1) used a filterset to do the token replacements as part of hte copy (isntead 
of after the fact)
2) since the demo build file is now just a template (and can't be run until 
it's pacakged) it now seems more like source then part of the build system, 
so i moved it into src/demo (which has the added benefit of keeping the root 
dir a little cleaner)

 Custom build.xml for binary distributions
 -

 Key: LUCENE-894
 URL: https://issues.apache.org/jira/browse/LUCENE-894
 Project: Lucene - Java
  Issue Type: Bug
  Components: Build
Affects Versions: 2.1
Reporter: Michael Busch
Assignee: Michael Busch
Priority: Minor
 Fix For: 2.2

 Attachments: lucene-894.patch, lucene-894.patch, lucene-894.patch, 
 lucene-894.patch


 The binary files of a distribution come with the demo sources
 and a build.xml file. However, the build.xml doesn't work for
 the binary distribution, so it can't be used to build the 
 demos.
 This problem was notices the first time when release 2.1 was
 made. Before we ship 2.2 we should fix this.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-894) Custom build.xml for binary distributions

2007-06-03 Thread Hoss Man (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man updated LUCENE-894:


Attachment: lucene-894.patch

whoops ... forgot to svn add src/demo/demo-build.template after i moved it .. 
it has some minor changes because of how i setup the filterset 

 Custom build.xml for binary distributions
 -

 Key: LUCENE-894
 URL: https://issues.apache.org/jira/browse/LUCENE-894
 Project: Lucene - Java
  Issue Type: Bug
  Components: Build
Affects Versions: 2.1
Reporter: Michael Busch
Assignee: Michael Busch
Priority: Minor
 Fix For: 2.2

 Attachments: lucene-894.patch, lucene-894.patch, lucene-894.patch, 
 lucene-894.patch, lucene-894.patch


 The binary files of a distribution come with the demo sources
 and a build.xml file. However, the build.xml doesn't work for
 the binary distribution, so it can't be used to build the 
 demos.
 This problem was notices the first time when release 2.1 was
 made. Before we ship 2.2 we should fix this.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Commented: (LUCENE-903) FilteredQuery explanation inaccuracy with boost

2007-06-03 Thread Hoss Man (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12501067
 ] 

Hoss Man commented on LUCENE-903:
-

I have some reservations about this patch.

I think it's fine to set as a goal that all core query classes should have 
Explanations where the description accurately describes a mathematical 
calculation that can be performed on the details to arrive at the value of the 
Explanation -- which is easy to do since we have the luxury of changing the 
CHeckHits class to suit our needs anytime we add a new class that has a more 
interesting mathematical calculation then we can current account for.

But we should also try to maintain CheckHIts as a reusable class that clients 
can use to run basic sanity tests on their own custom Query classes, and 
holding them to the same standard (when they can't easily modify the string 
pattern rules in CheckHits) doesn't seem fair.

lemme try to refactor the current patch a bit so that the deep Explanation 
testing is optional (and used by the core tests)

 FilteredQuery explanation inaccuracy with boost
 ---

 Key: LUCENE-903
 URL: https://issues.apache.org/jira/browse/LUCENE-903
 Project: Lucene - Java
  Issue Type: Bug
  Components: Search
Affects Versions: 2.2
Reporter: Doron Cohen
Assignee: Doron Cohen
Priority: Minor
 Fix For: 2.2

 Attachments: lucene-903.patch


 The value of explanation is different than the product of its part if boost  
 1.
 This is exposed after tightening the explanation check (part of LUCENE-446).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Commented: (LUCENE-903) FilteredQuery explanation inaccuracy with boost

2007-06-03 Thread Doron Cohen (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12501074
 ] 

Doron Cohen commented on LUCENE-903:


 holding them to the same standard (when they can't easily modify 
 the string pattern rules in CheckHits) doesn't seem fair. 

 lemme try to refactor the current patch a bit so that the deep 
 Explanation testing is optional (and used by the core tests) 

I agree (and pointed that) the deep explanation test is imperfect in this 
regard. But it seems to me that it is the benefit of new query writers to have 
their new queries explanations tested as thorough as possible. So I am not sure 
that disabling this test by default (for non core queries) is the right thing 
to do.

How do you feel about adding an escape way, allowing one to specify that a 
certain (part of an) explanation is not to be checked in this manner?

I see two immediate ways to add this: 
- by the explanation text: if the description starts with other: the 
explanation is not subject to this deep check
- by api: adding a property to Explanation class - disabledDeepCheck - false by 
default - allows new unique queries to deliberately disable this deep test.


 FilteredQuery explanation inaccuracy with boost
 ---

 Key: LUCENE-903
 URL: https://issues.apache.org/jira/browse/LUCENE-903
 Project: Lucene - Java
  Issue Type: Bug
  Components: Search
Affects Versions: 2.2
Reporter: Doron Cohen
Assignee: Doron Cohen
Priority: Minor
 Fix For: 2.2

 Attachments: lucene-903.patch


 The value of explanation is different than the product of its part if boost  
 1.
 This is exposed after tightening the explanation check (part of LUCENE-446).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-903) FilteredQuery explanation inaccuracy with boost

2007-06-03 Thread Hoss Man (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man updated LUCENE-903:


Attachment: lucene-903.patch

in general i'm not fond of the inspecting the string to determine whether the 
math is correct ... in an ideal world every query would have it's own unit test 
class and it would have a testExplanation method that would know exactly what 
structure (ie: how many sub-details) to expect from an artificially constructed 
query instance.  the current approach of having big explanation test classes 
that validate an explanation by comparing them to the scores is a fairly big 
hack (and i say that as the guy that added it).

But i'm okay with the string comparisons as a Convenience to make it easier 
for us to test the explanations of the core queries (since so many of them 
follow very clear patterns) ... but i think we should avoid having any sort of 
expectation on the string for custom queries (either that they match product 
of or that they match other:)

my main concern is purely with giving people a way to do the same kind of 
Explanation testing they could do before (that the root value equals the score) 
even if the details of their explanation don't fit any of the patterns of the 
existing core queries.  (i'd hate for people to not even do shallow testing of 
their Explanations just because we don't give them a method for doing so)

this refactoring...
 
- makes verifyExplanations a public method in CheckHits that anyone can call
- adds a deep boolean option to verifyExplanations
- adds variants of CheckHits.checkExplanations and the ExplanationAsserter to 
make the deep testing optional
- makes QueryUtils.checkExplanations use the deep option
- makes TestExplanations explicitly use deep testing

...at the moment, the old APIs default to deep==false but i'm on board with 
changing those to true if you want to (in theory it's a non-backawards 
compatible API change, but since it's in src/test we can probably get away with 
it).


 FilteredQuery explanation inaccuracy with boost
 ---

 Key: LUCENE-903
 URL: https://issues.apache.org/jira/browse/LUCENE-903
 Project: Lucene - Java
  Issue Type: Bug
  Components: Search
Affects Versions: 2.2
Reporter: Doron Cohen
Assignee: Doron Cohen
Priority: Minor
 Fix For: 2.2

 Attachments: lucene-903.patch, lucene-903.patch


 The value of explanation is different than the product of its part if boost  
 1.
 This is exposed after tightening the explanation check (part of LUCENE-446).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Commented: (LUCENE-902) Check on PositionIncrement with StopFilter.

2007-06-03 Thread Toru Matsuzawa (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12501078
 ] 

Toru Matsuzawa commented on LUCENE-902:
---

My patch was consideration shortage as shown in the point. 
However, I think that processing positionIncrements with StopFilter is 
necessary. 

 Check on PositionIncrement  with StopFilter.
 

 Key: LUCENE-902
 URL: https://issues.apache.org/jira/browse/LUCENE-902
 Project: Lucene - Java
  Issue Type: Bug
  Components: Analysis
Affects Versions: 2.2
Reporter: Toru Matsuzawa
 Attachments: stopfilter.patch


 PositionIncrement set with Tokenizer is not considered with StopFilter. 
 When PositionIncrement of Token is 1, it is deleted by StopFilter. However, 
 when PositionIncrement of Token following afterwards is 0, it is not deleted. 
 I think that it is necessary to be deleted. Because it is thought same Token 
 when PositionIncrement is 0.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Commented: (LUCENE-622) Provide More of Lucene For Maven

2007-06-03 Thread Hoss Man (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12501079
 ] 

Hoss Man commented on LUCENE-622:
-

i just skimmed Sami's patch (did not test it) ... i can't help but think that 
the enumerated list of all contribs in the generate-maven-artifacts task 
seems like it will easily become just as hard to maintain as the related list 
in the javadocs macro.  it seems like a more maintainable approach would be 
to...
  1) put the maven files for each contrib in the respective contrib dirs.
  2) put this new m2-deploy macro in common-build.xml
  3) add a dist-maven task to contrib/contrib-build.xml where it can be 
inherited by each contrib
  and calls m2-deploy using hte project name as the artifactId
  4) change generate-maven-artifacts to use contrib-crawl to take care of the 
pom's for the contribs.
 
some other questions: 
  a) it doesn't seem like all contribs are accounted for ... were some excluded 
intentionally?
  b) if we're going to start officially releasing POMs for lucene and the 
contribs, we should probably have a way of validating they are correct ... is 
there an easy mechanism for doing that?  (my worry is that overtime a contrib 
changes, it's POM doesn't get updated, and we have a release in which the 
maven artifacts are incorrect (which in my opinion is worse then not having any 
maven artifacts at all)

final minor note: using copy immediately followed by a replace on the file 
that was just copied can be done more cleanly using a filterset on the copy




 Provide More of Lucene For Maven
 

 Key: LUCENE-622
 URL: https://issues.apache.org/jira/browse/LUCENE-622
 Project: Lucene - Java
  Issue Type: Task
Affects Versions: 2.0.0
Reporter: Stephen Duncan Jr
Assignee: Michael Busch
 Fix For: 2.2

 Attachments: lucene-622.txt, lucene-core.pom, 
 lucene-highlighter-2.0.0.pom, lucene-maven.patch, lucene-maven.tar.bz2


 Please provide javadoc  source jars for lucene-core.  Also, please provide 
 the rest of lucene (the jars inside of contrib in the download bundle) if 
 possible.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Commented: (LUCENE-902) Check on PositionIncrement with StopFilter.

2007-06-03 Thread Hoss Man (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12501080
 ] 

Hoss Man commented on LUCENE-902:
-

without a unit test demonstrating an actual problem, i'm having a hard time 
udnerstanding what exactly the bug is in this issue.

from what i can tell based on the comments and my reading of the patch, Toru is 
concerned about cumulative positionIncrements of tokens being lost when one of 
those tokens is a stop word.  (ie: if indexing multiple names of movies in a 
Document about an actor, and using a positionIncriment of 10 between each 
Field value (ie: movie name), indexing the values Dirty Harry and The Good 
the bad and the Ugly could result in no gap between the tokens harry and 
good since the is a stop word.

is my understanding of the problem correct?

if so, then i'm not sure how this patch really addresses the problem ... 
besides the fact that it treats 1 as a special case (the problem can come up 
with any positionIncrement) it doesn't seem to make any allowance for the 
situation where multiple stop words appear in sequence.

i'm also not clear on why non stop words immediately following stop words (ie: 
the else if(flag) case) are not returned unless their positionIncriment is 1.





 Check on PositionIncrement  with StopFilter.
 

 Key: LUCENE-902
 URL: https://issues.apache.org/jira/browse/LUCENE-902
 Project: Lucene - Java
  Issue Type: Bug
  Components: Analysis
Affects Versions: 2.2
Reporter: Toru Matsuzawa
 Attachments: stopfilter.patch


 PositionIncrement set with Tokenizer is not considered with StopFilter. 
 When PositionIncrement of Token is 1, it is deleted by StopFilter. However, 
 when PositionIncrement of Token following afterwards is 0, it is not deleted. 
 I think that it is necessary to be deleted. Because it is thought same Token 
 when PositionIncrement is 0.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Commented: (LUCENE-894) Custom build.xml for binary distributions

2007-06-03 Thread Michael Busch (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12501081
 ] 

Michael Busch commented on LUCENE-894:
--

 I couldn't help but make a few tweaks...

Thanks for reviewing! 

 1) used a filterset to do the token replacements as part of hte copy 
 (isntead of after the fact)

Neat!

 2) since the demo build file is now just a template (and can't be run 
 until it's pacakged) it now seems more like source then part of the 
 build system, so i moved it into src/demo (which has the added benefit 
 of keeping the root dir a little cleaner)

Good! Yes I agree that src/demo is a better place for the template file.

 Custom build.xml for binary distributions
 -

 Key: LUCENE-894
 URL: https://issues.apache.org/jira/browse/LUCENE-894
 Project: Lucene - Java
  Issue Type: Bug
  Components: Build
Affects Versions: 2.1
Reporter: Michael Busch
Assignee: Michael Busch
Priority: Minor
 Fix For: 2.2

 Attachments: lucene-894.patch, lucene-894.patch, lucene-894.patch, 
 lucene-894.patch, lucene-894.patch


 The binary files of a distribution come with the demo sources
 and a build.xml file. However, the build.xml doesn't work for
 the binary distribution, so it can't be used to build the 
 demos.
 This problem was notices the first time when release 2.1 was
 made. Before we ship 2.2 we should fix this.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Commented: (LUCENE-446) search.function - (1) score based on field value, (2) simple score customizability

2007-06-03 Thread Hoss Man (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12501082
 ] 

Hoss Man commented on LUCENE-446:
-

Doron: I haven't really been able to keep up with the way this issue has 
evolved, or dig into your new patches, but to answer your question about the 
Ord functions: yes they are very useful, and it active use in Solr.  I believe 
the warning about MultiSearcher mainly has to do with the fact that the 
MultiSearcher/FieldCache APIs give us know way to know the lowest of 
highest value in a field cache across an entire logical index, so the Ord 
functions can't really be queried against a MultiSearcher.

 search.function - (1) score based on field value, (2) simple score 
 customizability
 --

 Key: LUCENE-446
 URL: https://issues.apache.org/jira/browse/LUCENE-446
 Project: Lucene - Java
  Issue Type: New Feature
  Components: Search
Reporter: Yonik Seeley
Assignee: Doron Cohen
Priority: Minor
 Fix For: 2.2

 Attachments: function.patch.txt, function.patch.txt, function.zip, 
 function.zip


 FunctionQuery can return a score based on a field's value or on it's ordinal 
 value.
 FunctionFactory subclasses define the details of the function.  There is 
 currently a LinearFloatFunction (a line specified by slope and intercept).
 Field values are typically obtained from FieldValueSourceFactory.  
 Implementations include FloatFieldSource, IntFieldSource, and OrdFieldSource.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Commented: (LUCENE-902) Check on PositionIncrement with StopFilter.

2007-06-03 Thread Toru Matsuzawa (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12501085
 ] 

Toru Matsuzawa commented on LUCENE-902:
---

for stop word only is.
sample words A is B.

For instance,When Tokenizer on StopFilter returns the following as a result.
 termText  positionIncrement
  A1
  is   1
  are  0
  be   0
  B1

The result of StopFilter.
 termText  positionIncrement
  A1
  are  0
  be   0
  B1

A and are and be become the same positions.

When thinking that it will process the result of a Japanese morphological 
analysis with StopFilter, it becomes a problem.

 Check on PositionIncrement  with StopFilter.
 

 Key: LUCENE-902
 URL: https://issues.apache.org/jira/browse/LUCENE-902
 Project: Lucene - Java
  Issue Type: Bug
  Components: Analysis
Affects Versions: 2.2
Reporter: Toru Matsuzawa
 Attachments: stopfilter.patch


 PositionIncrement set with Tokenizer is not considered with StopFilter. 
 When PositionIncrement of Token is 1, it is deleted by StopFilter. However, 
 when PositionIncrement of Token following afterwards is 0, it is not deleted. 
 I think that it is necessary to be deleted. Because it is thought same Token 
 when PositionIncrement is 0.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-831) Complete overhaul of FieldCache API/Implementation

2007-06-03 Thread Hoss Man (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man updated LUCENE-831:


Attachment: fieldcache-overhaul.diff

i haven't had any time to do further work on this issue ... partly because i 
haven't had a lot of time, but mainly because i'm hoping to get some feedback 
on the overall approach before any more serious effort investment.

updated patch to work against the trunk (r544035)

 Complete overhaul of FieldCache API/Implementation
 --

 Key: LUCENE-831
 URL: https://issues.apache.org/jira/browse/LUCENE-831
 Project: Lucene - Java
  Issue Type: Improvement
  Components: Search
Reporter: Hoss Man
 Attachments: fieldcache-overhaul.diff, fieldcache-overhaul.diff


 Motivation:
 1) Complete overhaul the API/implementation of FieldCache type things...
 a) eliminate global static map keyed on IndexReader (thus
 eliminating synch block between completley independent IndexReaders)
 b) allow more customization of cache management (ie: use 
 expiration/replacement strategies, disk backed caches, etc)
 c) allow people to define custom cache data logic (ie: custom
 parsers, complex datatypes, etc... anything tied to a reader)
 d) allow people to inspect what's in a cache (list of CacheKeys) for
 an IndexReader so a new IndexReader can be likewise warmed. 
 e) Lend support for smarter cache management if/when
 IndexReader.reopen is added (merging of cached data from subReaders).
 2) Provide backwards compatibility to support existing FieldCache API with
 the new implementation, so there is no redundent caching as client code
 migrades to new API.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Lucene 2.2 soon?

2007-06-03 Thread Chris Hostetter

: The part I am not sure of in this regard are my changes
: to FieldCache and FieldcacheImpl, while LUCENE-831 is
: ongoing too. (though btw 831 doesn't apply cleanly on
: current trunk). (It is my plan to get into LUCENE-831
: but I haven't got to it yet.)

1) i updated 831 to work on the trunk
2) the spirt of 831 is to change the underlying impl but still be
backwards compatible with the FieldCache API ... i skimmed the FieldCache
API changes in the latest patch on 446, and they seem to just be adding
new get methods (getShort, and getBytes) on the existing underlying impl
.. that should be easily supported by the stuff in 831 (and easy to
convert/migrate the same way i did the getInt/getLong/getString methods)



-Hoss


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Resolved: (LUCENE-894) Custom build.xml for binary distributions

2007-06-03 Thread Michael Busch (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Busch resolved LUCENE-894.
--

Resolution: Fixed

Works great, Hoss! Thanks again. I just committed this.

 Custom build.xml for binary distributions
 -

 Key: LUCENE-894
 URL: https://issues.apache.org/jira/browse/LUCENE-894
 Project: Lucene - Java
  Issue Type: Bug
  Components: Build
Affects Versions: 2.1
Reporter: Michael Busch
Assignee: Michael Busch
Priority: Minor
 Fix For: 2.2

 Attachments: lucene-894.patch, lucene-894.patch, lucene-894.patch, 
 lucene-894.patch, lucene-894.patch


 The binary files of a distribution come with the demo sources
 and a build.xml file. However, the build.xml doesn't work for
 the binary distribution, so it can't be used to build the 
 demos.
 This problem was notices the first time when release 2.1 was
 made. Before we ship 2.2 we should fix this.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Created: (LUCENE-904) Calculate MD5 checksums in target dist-all

2007-06-03 Thread Michael Busch (JIRA)
Calculate MD5 checksums in target dist-all


 Key: LUCENE-904
 URL: https://issues.apache.org/jira/browse/LUCENE-904
 Project: Lucene - Java
  Issue Type: Improvement
  Components: Build
Reporter: Michael Busch
Assignee: Michael Busch
Priority: Trivial
 Fix For: 2.2


Trivial patch that extends the ant target dist-all to calculate
the MD5 checksums for the dist files.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Updated: (LUCENE-904) Calculate MD5 checksums in target dist-all

2007-06-03 Thread Michael Busch (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Busch updated LUCENE-904:
-

Attachment: lucene-904.patch

 Calculate MD5 checksums in target dist-all
 

 Key: LUCENE-904
 URL: https://issues.apache.org/jira/browse/LUCENE-904
 Project: Lucene - Java
  Issue Type: Improvement
  Components: Build
Reporter: Michael Busch
Assignee: Michael Busch
Priority: Trivial
 Fix For: 2.2

 Attachments: lucene-904.patch


 Trivial patch that extends the ant target dist-all to calculate
 the MD5 checksums for the dist files.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



[jira] Commented: (LUCENE-904) Calculate MD5 checksums in target dist-all

2007-06-03 Thread Hoss Man (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12501097
 ] 

Hoss Man commented on LUCENE-904:
-

FWIW ... i put something like this in the solr build.xml a while back, but 
discovered the format ant's checksum task uses by default is really trivial 
and doesn't work when people use md5sum -c to try and check the sums.  
(checksum has a format attribute now, but that wasn't added until ~Dec2006, 
so I think it requires ant 1.7)

the way we solved the problem was with the solr-checksum macro added to our 
build.xml in this commit...
http://svn.apache.org/viewvc/lucene/solr/trunk/build.xml?r1=483882r2=484780

background...
http://www.nabble.com/%22correct%22-format-for-the-md5-files--tf2779495.html#a7754724

 Calculate MD5 checksums in target dist-all
 

 Key: LUCENE-904
 URL: https://issues.apache.org/jira/browse/LUCENE-904
 Project: Lucene - Java
  Issue Type: Improvement
  Components: Build
Reporter: Michael Busch
Assignee: Michael Busch
Priority: Trivial
 Fix For: 2.2

 Attachments: lucene-904.patch


 Trivial patch that extends the ant target dist-all to calculate
 the MD5 checksums for the dist files.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]