[jira] Created: (SOLR-763) Support for LUCENE-1380 :: ShingleFilter.coterminalPositionIncrement

2008-09-10 Thread Michael Semb Wever (JIRA)
Support for LUCENE-1380 :: ShingleFilter.coterminalPositionIncrement


 Key: SOLR-763
 URL: https://issues.apache.org/jira/browse/SOLR-763
 Project: Solr
  Issue Type: Improvement
Reporter: Michael Semb Wever
 Fix For: 1.4


Patch so that ShingleFilterFactory can deserialise the 
coterminalPositionIncrement property.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (SOLR-763) Support for LUCENE-1380 :: ShingleFilter.coterminalPositionIncrement

2008-09-10 Thread Michael Semb Wever (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Semb Wever updated SOLR-763:


Attachment: SOLR-763.patch

Patch to support new property coterminalPositionIncrement that appears in 
LUCENE-1380

 Support for LUCENE-1380 :: ShingleFilter.coterminalPositionIncrement
 

 Key: SOLR-763
 URL: https://issues.apache.org/jira/browse/SOLR-763
 Project: Solr
  Issue Type: Improvement
Reporter: Michael Semb Wever
 Fix For: 1.4

 Attachments: SOLR-763.patch


 Patch so that ShingleFilterFactory can deserialise the 
 coterminalPositionIncrement property.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



log number of hits returned

2008-09-10 Thread Yonik Seeley
I just noticed that the hits parameter that gets logged seems to be
returning the number of documents actually returned, not the number of
documents that matched the query + filters.  Should it?

num docs returned can be derived from num docs matched and
offset,rows, but not vice-versa.

-Yonik


Re: log number of hits returned

2008-09-10 Thread Erik Hatcher
Ouch yes, it definitely should log the total number hits, not the  
number of documents returned.


Erik

On Sep 10, 2008, at 12:19 PM, Yonik Seeley wrote:


I just noticed that the hits parameter that gets logged seems to be
returning the number of documents actually returned, not the number of
documents that matched the query + filters.  Should it?

num docs returned can be derived from num docs matched and
offset,rows, but not vice-versa.

-Yonik




[jira] Created: (SOLR-764) facet.limit=-1 with facet.sort=false does not work in distributed search

2008-09-10 Thread Wojtek Piaseczny (JIRA)
facet.limit=-1 with facet.sort=false does not work in distributed search


 Key: SOLR-764
 URL: https://issues.apache.org/jira/browse/SOLR-764
 Project: Solr
  Issue Type: New Feature
Affects Versions: 1.3
Reporter: Wojtek Piaseczny
 Fix For: 1.4


If you specify facet.limit=-1 and facet.sort=false, no facet results are 
returned. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-764) facet.limit=-1 with facet.sort=false does not work in distributed search

2008-09-10 Thread Amit Nithian (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12629874#action_12629874
 ] 

Amit Nithian commented on SOLR-764:
---

Are you purposely trying to disable facet results? Why not simply pass in 
facet=false parameter?

 facet.limit=-1 with facet.sort=false does not work in distributed search
 

 Key: SOLR-764
 URL: https://issues.apache.org/jira/browse/SOLR-764
 Project: Solr
  Issue Type: New Feature
Affects Versions: 1.3
Reporter: Wojtek Piaseczny
 Fix For: 1.4


 If you specify facet.limit=-1 and facet.sort=false, no facet results are 
 returned. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



Re: 1.3.0 Candidate

2008-09-10 Thread Grant Ingersoll
So, how we doing on the Woodstox stuff?  Have people had a chance to  
try it out on trunk?  If so, can we port to the 1.3 branch?


On Sep 9, 2008, at 9:19 AM, Grant Ingersoll wrote:



On Sep 7, 2008, at 8:59 PM, Chris Hostetter wrote:


(Curiousity: Grant, is the hudson config for doing releases not  
working
out?  it seemed like a really sweet way to automate everything and  
i'm
just wondering if you found something that made it not  
worthwhile ... it

seems to be gone now)



FWIW, I added a prepare-release target to the build, that I think  
does most of the steps automatically.  Now, don't get me wrong, one  
should still check out the docs, etc.  It requires svn, forrest and  
gpg to be on the path.  If anyone is so inclined, they could put  
available checks in.  Would probably be useful to have other  
validation steps, too.








[jira] Commented: (SOLR-764) facet.limit=-1 with facet.sort=false does not work in distributed search

2008-09-10 Thread Lars Kotthoff (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12629880#action_12629880
 ] 

Lars Kotthoff commented on SOLR-764:


Have you tried a version which incorporates SOLR-755? This should have been 
fixed in that issue.

 facet.limit=-1 with facet.sort=false does not work in distributed search
 

 Key: SOLR-764
 URL: https://issues.apache.org/jira/browse/SOLR-764
 Project: Solr
  Issue Type: New Feature
Affects Versions: 1.3
Reporter: Wojtek Piaseczny
 Fix For: 1.4


 If you specify facet.limit=-1 and facet.sort=false, no facet results are 
 returned. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-764) facet.limit=-1 with facet.sort=false does not work in distributed search

2008-09-10 Thread Wojtek Piaseczny (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12629892#action_12629892
 ] 

Wojtek Piaseczny commented on SOLR-764:
---

I want to show dynamic ranges for numeric facets. My (and probably most??) 
implementation for building dynamic ranges requires the numbers to be sorted by 
facet name rather than by occurrences.

 facet.limit=-1 with facet.sort=false does not work in distributed search
 

 Key: SOLR-764
 URL: https://issues.apache.org/jira/browse/SOLR-764
 Project: Solr
  Issue Type: New Feature
Affects Versions: 1.3
Reporter: Wojtek Piaseczny
 Fix For: 1.4


 If you specify facet.limit=-1 and facet.sort=false, no facet results are 
 returned. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (SOLR-764) Support facet.sort=false (index order) with distributed search

2008-09-10 Thread Yonik Seeley (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yonik Seeley updated SOLR-764:
--

Description: Distributed search does not currently support sorting facets 
by index order (facet.sort=false).  (was: If you specify facet.limit=-1 and 
facet.sort=false, no facet results are returned. )
Summary: Support facet.sort=false (index order) with distributed search 
 (was: facet.limit=-1 with facet.sort=false does not work in distributed search)

 Support facet.sort=false (index order) with distributed search
 --

 Key: SOLR-764
 URL: https://issues.apache.org/jira/browse/SOLR-764
 Project: Solr
  Issue Type: New Feature
Affects Versions: 1.3
Reporter: Wojtek Piaseczny
 Fix For: 1.4


 Distributed search does not currently support sorting facets by index order 
 (facet.sort=false).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (SOLR-765) ant example fails if example/work directory doesn't exist

2008-09-10 Thread Lars Kotthoff (JIRA)
ant example fails if example/work directory doesn't exist
-

 Key: SOLR-765
 URL: https://issues.apache.org/jira/browse/SOLR-765
 Project: Solr
  Issue Type: Bug
Affects Versions: 1.3
Reporter: Lars Kotthoff
Priority: Minor


Running ant example when there's no example/work directory causes the build 
to fail because the task tries to delete the (non-existent) directory.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (SOLR-765) ant example fails if example/work directory doesn't exist

2008-09-10 Thread Lars Kotthoff (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Kotthoff updated SOLR-765:
---

Attachment: SOLR-755.patch

Patch adding 'failonerror=false' to the delete in the example ant task.

 ant example fails if example/work directory doesn't exist
 -

 Key: SOLR-765
 URL: https://issues.apache.org/jira/browse/SOLR-765
 Project: Solr
  Issue Type: Bug
Affects Versions: 1.3
Reporter: Lars Kotthoff
Priority: Minor
 Attachments: SOLR-755.patch


 Running ant example when there's no example/work directory causes the build 
 to fail because the task tries to delete the (non-existent) directory.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (SOLR-764) Support facet.sort=false (index order) with distributed search

2008-09-10 Thread Lars Kotthoff (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Kotthoff updated SOLR-764:
---

Attachment: SOLR-764.patch

Patch implementing support for facet.sort=false with distributed search.

Simplified the existing code by removing things that weren't used and added an 
additional method to sort facets by term number. Note that this implementation 
is flawed -- there are no distributed term numbers, therefore it's possible 
that several facet values have the same term number because they come from 
different shards. In that case the lexicographic order of the value will be 
used to break the tie. It is possible that facet values are returned in a 
different order for distributed and local setups.

The patch also adds a unit test, commented out because of the issues described 
above.

 Support facet.sort=false (index order) with distributed search
 --

 Key: SOLR-764
 URL: https://issues.apache.org/jira/browse/SOLR-764
 Project: Solr
  Issue Type: New Feature
Affects Versions: 1.3
Reporter: Wojtek Piaseczny
 Fix For: 1.4

 Attachments: SOLR-764.patch


 Distributed search does not currently support sorting facets by index order 
 (facet.sort=false).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (SOLR-765) ant example fails if example/work directory doesn't exist

2008-09-10 Thread Lars Kotthoff (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Kotthoff updated SOLR-765:
---

Attachment: (was: SOLR-755.patch)

 ant example fails if example/work directory doesn't exist
 -

 Key: SOLR-765
 URL: https://issues.apache.org/jira/browse/SOLR-765
 Project: Solr
  Issue Type: Bug
Affects Versions: 1.3
Reporter: Lars Kotthoff
Priority: Minor
 Attachments: SOLR-765.patch


 Running ant example when there's no example/work directory causes the build 
 to fail because the task tries to delete the (non-existent) directory.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (SOLR-765) ant example fails if example/work directory doesn't exist

2008-09-10 Thread Lars Kotthoff (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Kotthoff updated SOLR-765:
---

Attachment: SOLR-765.patch

I promise to clean up my solr-patch-mess and not attach the wrong patch again.

 ant example fails if example/work directory doesn't exist
 -

 Key: SOLR-765
 URL: https://issues.apache.org/jira/browse/SOLR-765
 Project: Solr
  Issue Type: Bug
Affects Versions: 1.3
Reporter: Lars Kotthoff
Priority: Minor
 Attachments: SOLR-765.patch


 Running ant example when there's no example/work directory causes the build 
 to fail because the task tries to delete the (non-existent) directory.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-764) Support facet.sort=false (index order) with distributed search

2008-09-10 Thread Wojtek Piaseczny (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12629949#action_12629949
 ] 

Wojtek Piaseczny commented on SOLR-764:
---

I notice this patch consolidates the FacetInfo class' 'topFacets'  
'listFacets' into a single collection. Do you know why these were ever 
separate? I had guessed it was because when 'listFacets' was being populated 
(only when facet.sort=false), it was being populated with data that was already 
ordered correctly per shard, and that combining the data (while maintaining 
sort order) from each shard could be optimized beyond calling Arrays.sort(). 

Shouldn't the getTermSorted method always use name instead of term number? 

 Support facet.sort=false (index order) with distributed search
 --

 Key: SOLR-764
 URL: https://issues.apache.org/jira/browse/SOLR-764
 Project: Solr
  Issue Type: New Feature
Affects Versions: 1.3
Reporter: Wojtek Piaseczny
 Fix For: 1.4

 Attachments: SOLR-764.patch


 Distributed search does not currently support sorting facets by index order 
 (facet.sort=false).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-765) ant example fails if example/work directory doesn't exist

2008-09-10 Thread Hoss Man (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12629958#action_12629958
 ] 

Hoss Man commented on SOLR-765:
---

why don't you have an example/work directory?

it's actually committed to svn as an empty directory, and explicitly included 
as an empty directory in the releases.

(i'm completely surprised that the example works at all if you've manually 
deleted the work directory)

 ant example fails if example/work directory doesn't exist
 -

 Key: SOLR-765
 URL: https://issues.apache.org/jira/browse/SOLR-765
 Project: Solr
  Issue Type: Bug
Affects Versions: 1.3
Reporter: Lars Kotthoff
Priority: Minor
 Attachments: SOLR-765.patch


 Running ant example when there's no example/work directory causes the build 
 to fail because the task tries to delete the (non-existent) directory.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-216) Improvements to solr.py

2008-09-10 Thread Dariusz Suchojad (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12629976#action_12629976
 ] 

Dariusz Suchojad commented on SOLR-216:
---

Hi Mike,

I've joined the solrpy (http://code.google.com/p/solrpy/) project where I'd
like to incorporate the changes I had made and to work on adding more
features to the Python client. I hope to get back to the discussion when,
like you said, it becomes more stable and popular.

 Improvements to solr.py
 ---

 Key: SOLR-216
 URL: https://issues.apache.org/jira/browse/SOLR-216
 Project: Solr
  Issue Type: Improvement
  Components: clients - python
Affects Versions: 1.2
Reporter: Jason Cater
Assignee: Mike Klaas
Priority: Trivial
 Attachments: solr-solrpy-r5.patch, solr.py, solr.py, solr.py, 
 solr.py, test_all.py


 I've taken the original solr.py code and extended it to include higher-level 
 functions.
   * Requires python 2.3+
   * Supports SSL (https://) schema
   * Conforms (mostly) to PEP 8 -- the Python Style Guide
   * Provides a high-level results object with implicit data type conversion
   * Supports batching of update commands

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-216) Improvements to solr.py

2008-09-10 Thread Mike Klaas (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12629981#action_12629981
 ] 

Mike Klaas commented on SOLR-216:
-

That's great!  Be sure to update http://wiki.apache.org/solr/SolPython as the 
project progresses.



 Improvements to solr.py
 ---

 Key: SOLR-216
 URL: https://issues.apache.org/jira/browse/SOLR-216
 Project: Solr
  Issue Type: Improvement
  Components: clients - python
Affects Versions: 1.2
Reporter: Jason Cater
Assignee: Mike Klaas
Priority: Trivial
 Attachments: solr-solrpy-r5.patch, solr.py, solr.py, solr.py, 
 solr.py, test_all.py


 I've taken the original solr.py code and extended it to include higher-level 
 functions.
   * Requires python 2.3+
   * Supports SSL (https://) schema
   * Conforms (mostly) to PEP 8 -- the Python Style Guide
   * Provides a high-level results object with implicit data type conversion
   * Supports batching of update commands

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (SOLR-766) Remove python client from 1.3 distribution

2008-09-10 Thread Mike Klaas (JIRA)
Remove python client from 1.3 distribution
--

 Key: SOLR-766
 URL: https://issues.apache.org/jira/browse/SOLR-766
 Project: Solr
  Issue Type: Task
  Components: clients - python
Affects Versions: 1.3
Reporter: Mike Klaas
Assignee: Mike Klaas
Priority: Blocker
 Fix For: 1.3


see solr-dev thread:

http://mail-archives.apache.org/mod_mbox/lucene-solr-dev/200809.mbox/[EMAIL 
PROTECTED]

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-766) Remove python client from 1.3 distribution

2008-09-10 Thread Mike Klaas (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12630004#action_12630004
 ] 

Mike Klaas commented on SOLR-766:
-

JIRA seems to be not allowing me to upload a patch.  Here is the text of the 
proposed README:

Note: As of version 1.3, Solr no longer comes bundled with a Python client.  
The existing client
was not sufficiently maintained or tested as development of Solr progressed, 
and committers
felt that the code was not up to our usual high standards of release.

The client bundled with previous versions of Solr will continue to be available 
indefinitely at:
http://svn.apache.org/viewvc/lucene/solr/tags/release-1.2.0/client/python/

Please see http://wiki.apache.org/solr/SolPython for information on third-party 
Solr python
clients.



 Remove python client from 1.3 distribution
 --

 Key: SOLR-766
 URL: https://issues.apache.org/jira/browse/SOLR-766
 Project: Solr
  Issue Type: Task
  Components: clients - python
Affects Versions: 1.3
Reporter: Mike Klaas
Assignee: Mike Klaas
Priority: Blocker
 Fix For: 1.3


 see solr-dev thread:
 http://mail-archives.apache.org/mod_mbox/lucene-solr-dev/200809.mbox/[EMAIL 
 PROTECTED]

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (SOLR-766) Remove python client from 1.3 distribution

2008-09-10 Thread Mike Klaas (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mike Klaas updated SOLR-766:


Attachment: SOLR-766.patch

 Remove python client from 1.3 distribution
 --

 Key: SOLR-766
 URL: https://issues.apache.org/jira/browse/SOLR-766
 Project: Solr
  Issue Type: Task
  Components: clients - python
Affects Versions: 1.3
Reporter: Mike Klaas
Assignee: Mike Klaas
Priority: Blocker
 Fix For: 1.3

 Attachments: SOLR-766.patch


 see solr-dev thread:
 http://mail-archives.apache.org/mod_mbox/lucene-solr-dev/200809.mbox/[EMAIL 
 PROTECTED]

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (SOLR-767) logged hits should be num matching docs, not num returned

2008-09-10 Thread Yonik Seeley (JIRA)
logged hits should be num matching docs, not num returned
---

 Key: SOLR-767
 URL: https://issues.apache.org/jira/browse/SOLR-767
 Project: Solr
  Issue Type: Bug
Affects Versions: 1.3
Reporter: Yonik Seeley
Priority: Minor
 Fix For: 1.3




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (SOLR-768) Set last_index_time in full-import command

2008-09-10 Thread Wojtek Piaseczny (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wojtek Piaseczny updated SOLR-768:
--

Attachment: SOLR-768.patch

Patch sets last index time before doing a full-import

 Set last_index_time in full-import command
 --

 Key: SOLR-768
 URL: https://issues.apache.org/jira/browse/SOLR-768
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 1.3
Reporter: Wojtek Piaseczny
 Fix For: 1.4

 Attachments: SOLR-768.patch


 Variables are being resolved before executing the full-import command, but 
 the last_index_time is not set (not read from file).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (SOLR-768) Set last_index_time in full-import command

2008-09-10 Thread Wojtek Piaseczny (JIRA)
Set last_index_time in full-import command
--

 Key: SOLR-768
 URL: https://issues.apache.org/jira/browse/SOLR-768
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 1.3
Reporter: Wojtek Piaseczny
 Fix For: 1.4
 Attachments: SOLR-768.patch

Variables are being resolved before executing the full-import command, but the 
last_index_time is not set (not read from file).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (SOLR-651) A SearchComponent for fetching TF-IDF values

2008-09-10 Thread Grant Ingersoll (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Ingersoll updated SOLR-651:
-

Attachment: SOLR-651.patch

Addresses Noble's thoughts.

 A SearchComponent for fetching TF-IDF values
 

 Key: SOLR-651
 URL: https://issues.apache.org/jira/browse/SOLR-651
 Project: Solr
  Issue Type: New Feature
Affects Versions: 1.3
Reporter: Noble Paul
Assignee: Grant Ingersoll
Priority: Minor
 Fix For: 1.4

 Attachments: SOLR-651.patch, SOLR-651.patch


 A SearchComponent that can return TF-IDF vector for any given document in the 
 SOLR index
 Query : A Document Number / a query identifying a Document
 Response :  A Map of term vs.TF-IDF value of every term in the Selected
 Document
 Why ?
 Most of the Machine Learning Algorithms work on TFIDF representation of
 documents, hence adding a Request Handler proving the TFIDF representation
 will pave the way for incorporating Learning Paradigms to SOLR framework.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (SOLR-769) Support Document and Search Result clustering

2008-09-10 Thread Grant Ingersoll (JIRA)
Support Document and Search Result clustering
-

 Key: SOLR-769
 URL: https://issues.apache.org/jira/browse/SOLR-769
 Project: Solr
  Issue Type: New Feature
Reporter: Grant Ingersoll
Assignee: Grant Ingersoll
Priority: Minor


Clustering is a useful tool for working with documents and search results, 
similar to the notion of dynamic faceting.  Carrot2 
(http://project.carrot2.org/) is a nice, BSD-licensed, library for doing search 
results clustering.  Mahout (http://lucene.apache.org/mahout) is well suited 
for whole-corpus clustering.  

The patch I lays out a contrib module that starts off w/ an integration of a 
SearchComponent for doing clustering and an implementation using Carrot.  In 
search results mode, it will use the DocList as the input for the cluster.   
While Carrot2 comes w/ a Solr input component, it is not the same as the 
SearchComponent that I have in that the Carrot example actually submits a query 
to Solr, whereas my SearchComponent is just chained into the Component list and 
uses the ResponseBuilder to add in the cluster results.

While not fully fleshed out yet, the collection based mode will take in a list 
of ids or just use the whole collection and will produce clusters.  Since this 
is a longer, typically offline task, there will need to be some type of storage 
mechanism (and replication??) for the clusters.  I _may_ push this off to a 
separate JIRA issue, but I at least want to present the use case as part of the 
design of this component/contrib.  It may even make sense that we split this 
out, such that the building piece is something like an UpdateProcessor and then 
the SearchComponent just acts as a lookup mechanism.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (SOLR-768) Set last_index_time in full-import command

2008-09-10 Thread Noble Paul (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Noble Paul updated SOLR-768:


Attachment: SOLR-768.patch

makes sense. 
This value can be set irrespective of the the command. So it is moved to 
runCmd()

 Set last_index_time in full-import command
 --

 Key: SOLR-768
 URL: https://issues.apache.org/jira/browse/SOLR-768
 Project: Solr
  Issue Type: Improvement
  Components: contrib - DataImportHandler
Affects Versions: 1.3
Reporter: Wojtek Piaseczny
 Fix For: 1.4

 Attachments: SOLR-768.patch, SOLR-768.patch


 Variables are being resolved before executing the full-import command, but 
 the last_index_time is not set (not read from file).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.