[jira] Updated: (SOLR-1803) ExtractingRequestHandler does not propagate multiple values to a multi-valued field

2010-03-02 Thread Lance Norskog (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lance Norskog updated SOLR-1803:


Attachment: display-extracting-bug.patch

This patch illustrates the problem. Line 90 should succeed but does not.

> ExtractingRequestHandler does not propagate multiple values to a multi-valued 
> field
> ---
>
> Key: SOLR-1803
> URL: https://issues.apache.org/jira/browse/SOLR-1803
> Project: Solr
>  Issue Type: Bug
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lance Norskog
>Priority: Minor
> Attachments: display-extracting-bug.patch
>
>
> When multiple values for one field are extracted from a document, only the 
> last value is stored in the document. If one or more values are given as 
> parameters, those values are all stored.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-1803) ExtractingRequestHandler does not propagate multiple values to a multi-valued field

2010-03-02 Thread Lance Norskog (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-1803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12840472#action_12840472
 ] 

Lance Norskog commented on SOLR-1803:
-

The attached patch does not fix the bug. It causes the ExtractingRequestHandler 
test program to illustrated the bug.  Line 90 in 
TestExtractingRequestHandler.java should succeed and does not.

[SOLR-1633|http://issues.apache.org/jira/browse/SOLR-1633] comments on a 
related behavior. This test patch also checks for that behavior. 

Since both this and 1633 are in the same area, work should be combined.

> ExtractingRequestHandler does not propagate multiple values to a multi-valued 
> field
> ---
>
> Key: SOLR-1803
> URL: https://issues.apache.org/jira/browse/SOLR-1803
> Project: Solr
>  Issue Type: Bug
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lance Norskog
>Priority: Minor
>
> When multiple values for one field are extracted from a document, only the 
> last value is stored in the document. If one or more values are given as 
> parameters, those values are all stored.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (SOLR-1803) ExtractingRequestHandler does not propagate multiple values to a multi-valued field

2010-03-02 Thread Lance Norskog (JIRA)
ExtractingRequestHandler does not propagate multiple values to a multi-valued 
field
---

 Key: SOLR-1803
 URL: https://issues.apache.org/jira/browse/SOLR-1803
 Project: Solr
  Issue Type: Bug
  Components: contrib - Solr Cell (Tika extraction)
Reporter: Lance Norskog
Priority: Minor


When multiple values for one field are extracted from a document, only the last 
value is stored in the document. If one or more values are given as parameters, 
those values are all stored.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-236) Field collapsing

2010-03-02 Thread Yao Ge (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12840470#action_12840470
 ] 

Yao Ge commented on SOLR-236:
-

I just applied the latest patch to trunk and I don't quite understand how the 
"numFound" in the response list is computed. With rows=10&collapse.threshold=1, 
I got numFound=11, with rows=10&collapse.threshold=2, I got numFound=22.
I both cases the actual doc in the list is 10. Why is the numFound reported 
this way?

> Field collapsing
> 
>
> Key: SOLR-236
> URL: https://issues.apache.org/jira/browse/SOLR-236
> Project: Solr
>  Issue Type: New Feature
>  Components: search
>Affects Versions: 1.3
>Reporter: Emmanuel Keller
>Assignee: Shalin Shekhar Mangar
> Fix For: 1.5
>
> Attachments: collapsing-patch-to-1.3.0-dieter.patch, 
> collapsing-patch-to-1.3.0-ivan.patch, collapsing-patch-to-1.3.0-ivan_2.patch, 
> collapsing-patch-to-1.3.0-ivan_3.patch, field-collapse-3.patch, 
> field-collapse-4-with-solrj.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-solr-236-2.patch, field-collapse-solr-236.patch, 
> field-collapsing-extended-592129.patch, field_collapsing_1.1.0.patch, 
> field_collapsing_1.3.patch, field_collapsing_dsteigerwald.diff, 
> field_collapsing_dsteigerwald.diff, field_collapsing_dsteigerwald.diff, 
> quasidistributed.additional.patch, SOLR-236-FieldCollapsing.patch, 
> SOLR-236-FieldCollapsing.patch, SOLR-236-FieldCollapsing.patch, 
> SOLR-236.patch, SOLR-236.patch, SOLR-236.patch, SOLR-236.patch, 
> SOLR-236.patch, SOLR-236.patch, SOLR-236.patch, solr-236.patch, 
> SOLR-236_collapsing.patch, SOLR-236_collapsing.patch
>
>
> This patch include a new feature called "Field collapsing".
> "Used in order to collapse a group of results with similar value for a given 
> field to a single entry in the result set. Site collapsing is a special case 
> of this, where all results for a given web site is collapsed into one or two 
> entries in the result set, typically with an associated "more documents from 
> this site" link. See also Duplicate detection."
> http://www.fastsearch.com/glossary.aspx?m=48&amid=299
> The implementation add 3 new query parameters (SolrParams):
> "collapse.field" to choose the field used to group results
> "collapse.type" normal (default value) or adjacent
> "collapse.max" to select how many continuous results are allowed before 
> collapsing
> TODO (in progress):
> - More documentation (on source code)
> - Test cases
> Two patches:
> - "field_collapsing.patch" for current development version
> - "field_collapsing_1.1.0.patch" for Solr-1.1.0
> P.S.: Feedback and misspelling correction are welcome ;-)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (SOLR-1724) Real Basic Core Management with Zookeeper

2010-03-02 Thread Jason Rutherglen (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Rutherglen updated SOLR-1724:
---

Attachment: SOLR-1724.patch

Fixed the unit tests that were failing due to the switch over to using 
CoreContainer's initZooKeeper method.  ZkNodeCoresManager is instantiated in 
CoreContainer.  

There's a beginning of a UI in zkcores.jsp

I think we still need a core move test.  I'm thinking of adding backing up a 
core as an action that may be performed in a new cores version file.  

> Real Basic Core Management with Zookeeper
> -
>
> Key: SOLR-1724
> URL: https://issues.apache.org/jira/browse/SOLR-1724
> Project: Solr
>  Issue Type: New Feature
>  Components: multicore
>Affects Versions: 1.4
>Reporter: Jason Rutherglen
> Fix For: 1.5
>
> Attachments: commons-lang-2.4.jar, gson-1.4.jar, 
> hadoop-0.20.2-dev-core.jar, hadoop-0.20.2-dev-test.jar, SOLR-1724.patch, 
> SOLR-1724.patch, SOLR-1724.patch, SOLR-1724.patch, SOLR-1724.patch, 
> SOLR-1724.patch, SOLR-1724.patch, SOLR-1724.patch, SOLR-1724.patch, 
> SOLR-1724.patch, SOLR-1724.patch
>
>
> Though we're implementing cloud, I need something real soon I can
> play with and deploy. So this'll be a patch that only deploys
> new cores, and that's about it. The arch is real simple:
> On Zookeeper there'll be a directory that contains files that
> represent the state of the cores of a given set of servers which
> will look like the following:
> /production/cores-1.txt
> /production/cores-2.txt
> /production/core-host-1-actual.txt (ephemeral node per host)
> Where each core-N.txt file contains:
> hostname,corename,instanceDir,coredownloadpath
> coredownloadpath is a URL such as file://, http://, hftp://, hdfs://, ftp://, 
> etc
> and
> core-host-actual.txt contains:
> hostname,corename,instanceDir,size
> Everytime a new core-N.txt file is added, the listening host
> finds it's entry in the list and begins the process of trying to
> match the entries. Upon completion, it updates it's
> /core-host-1-actual.txt file to it's completed state or logs an error.
> When all host actual files are written (without errors), then a
> new core-1-actual.txt file is written which can be picked up by
> another process that can create a new core proxy.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-1750) SystemInfoRequestHandler - replacement for stats.jsp and registry.jsp

2010-03-02 Thread Erik Hatcher (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-1750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12840208#action_12840208
 ] 

Erik Hatcher commented on SOLR-1750:


Thanks Hoss for committing!

naming: I'm fine with how it is, but fine if the name changes too and +1 to 
adding default registration



> SystemInfoRequestHandler - replacement for stats.jsp and registry.jsp
> -
>
> Key: SOLR-1750
> URL: https://issues.apache.org/jira/browse/SOLR-1750
> Project: Solr
>  Issue Type: Improvement
>  Components: web gui
>Reporter: Erik Hatcher
>Assignee: Erik Hatcher
>Priority: Trivial
> Fix For: 1.5
>
> Attachments: SystemStatsRequestHandler.java, 
> SystemStatsRequestHandler.java, SystemStatsRequestHandler.java
>
>
> stats.jsp is cool and all, but suffers from escaping issues, and also is not 
> accessible from SolrJ or other standard Solr APIs.
> Here's a request handler that emits everything stats.jsp does.
> For now, it needs to be registered in solrconfig.xml like this:
> {code}
>  class="solr.SystemStatsRequestHandler" />
> {code}
> But will register this in AdminHandlers automatically before committing.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-1743) error reporting is rendering "404 missing core name in path" for all type of errors

2010-03-02 Thread Mark Miller (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-1743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12840177#action_12840177
 ] 

Mark Miller commented on SOLR-1743:
---

Hoss - I've got to digest and respond to this more fully later, but:

Yes, I def agree the best solution is something entirely different. I agree 
that the current, historical based stuff is just not right for multicore. What 
I've tried to do is just hack things back closer to what they were, but a real 
solution would be to start from scratch and handle things better -

Sounds like you have an idea for that, and I'll dig into your explanation when 
I get a chance and perhaps off some feedback.

> error reporting is rendering "404 missing core name in path" for all type of 
> errors
> ---
>
> Key: SOLR-1743
> URL: https://issues.apache.org/jira/browse/SOLR-1743
> Project: Solr
>  Issue Type: Bug
>  Components: Build
> Environment: all
>Reporter: Marcin
>Assignee: Mark Miller
> Fix For: 1.5
>
> Attachments: SOLR-1743.patch, SOLR-1743.patch, SOLR-1743.patch, 
> SOLR-1743.patch, SOLR-1743.patch
>
>
> despite the error in schema syntax or any other type of error you will always 
> get:
> "404 missing core name in path" communicate.
> cheers,
> /Marcin

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.