[jira] [Commented] (SOLR-2155) Geospatial search using geohash prefixes

2013-03-01 Thread Sandeep Tucknat (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13590810#comment-13590810
 ] 

Sandeep Tucknat commented on SOLR-2155:
---

Once again, thanks for the prompt response AND the information! It makes sense, 
you have a forward index of grid cells to documents, no geo comparisons 
required at run time. I won't be able to attend the conference but will 
definitely look forward to your presentation!

> Geospatial search using geohash prefixes
> 
>
> Key: SOLR-2155
> URL: https://issues.apache.org/jira/browse/SOLR-2155
> Project: Solr
>  Issue Type: Improvement
>Reporter: David Smiley
>Assignee: David Smiley
> Attachments: GeoHashPrefixFilter.patch, GeoHashPrefixFilter.patch, 
> GeoHashPrefixFilter.patch, Solr2155-1.0.2-project.zip, 
> Solr2155-1.0.3-project.zip, Solr2155-1.0.4-project.zip, 
> Solr2155-for-1.0.2-3.x-port.patch, 
> SOLR-2155_GeoHashPrefixFilter_with_sorting_no_poly.patch, SOLR.2155.p3.patch, 
> SOLR.2155.p3tests.patch
>
>
> {panel:title=NOTICE} The status of this issue is a plugin for Solr 3.x 
> located here: https://github.com/dsmiley/SOLR-2155.  Look at the introductory 
> readme and download the plugin .jar file.  Lucene 4's new spatial module is 
> largely based on this code.  The Solr 4 glue for it should come very soon but 
> as of this writing it's hosted temporarily at https://github.com/spatial4j.  
> For more information on using SOLR-2155 with Solr 3, see 
> http://wiki.apache.org/solr/SpatialSearch#SOLR-2155  This JIRA issue is 
> closed because it won't be committed in its current form.
> {panel}
> There currently isn't a solution in Solr for doing geospatial filtering on 
> documents that have a variable number of points.  This scenario occurs when 
> there is location extraction (i.e. via a "gazateer") occurring on free text.  
> None, one, or many geospatial locations might be extracted from any given 
> document and users want to limit their search results to those occurring in a 
> user-specified area.
> I've implemented this by furthering the GeoHash based work in Lucene/Solr 
> with a geohash prefix based filter.  A geohash refers to a lat-lon box on the 
> earth.  Each successive character added further subdivides the box into a 4x8 
> (or 8x4 depending on the even/odd length of the geohash) grid.  The first 
> step in this scheme is figuring out which geohash grid squares cover the 
> user's search query.  I've added various extra methods to GeoHashUtils (and 
> added tests) to assist in this purpose.  The next step is an actual Lucene 
> Filter, GeoHashPrefixFilter, that uses these geohash prefixes in 
> TermsEnum.seek() to skip to relevant grid squares in the index.  Once a 
> matching geohash grid is found, the points therein are compared against the 
> user's query to see if it matches.  I created an abstraction GeoShape 
> extended by subclasses named PointDistance... and CartesianBox to support 
> different queried shapes so that the filter need not care about these details.
> This work was presented at LuceneRevolution in Boston on October 8th.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-2155) Geospatial search using geohash prefixes

2013-03-01 Thread Sandeep Tucknat (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13590679#comment-13590679
 ] 

Sandeep Tucknat commented on SOLR-2155:
---

First of all, thanks for the prompt response! It feels good to see you are 
supporting the approach we took in the interim :) I was just thinking that in 
order to do the ranking, the filter has to go through all the values of the 
field and it shouldn't be hard to persist this information and return to the 
client. We'll be waiting for that optimization to come through! Many thanks! 
let me know if I can help in any way (3 weeks in spatial or solr).

> Geospatial search using geohash prefixes
> 
>
> Key: SOLR-2155
> URL: https://issues.apache.org/jira/browse/SOLR-2155
> Project: Solr
>  Issue Type: Improvement
>Reporter: David Smiley
>Assignee: David Smiley
> Attachments: GeoHashPrefixFilter.patch, GeoHashPrefixFilter.patch, 
> GeoHashPrefixFilter.patch, Solr2155-1.0.2-project.zip, 
> Solr2155-1.0.3-project.zip, Solr2155-1.0.4-project.zip, 
> Solr2155-for-1.0.2-3.x-port.patch, 
> SOLR-2155_GeoHashPrefixFilter_with_sorting_no_poly.patch, SOLR.2155.p3.patch, 
> SOLR.2155.p3tests.patch
>
>
> {panel:title=NOTICE} The status of this issue is a plugin for Solr 3.x 
> located here: https://github.com/dsmiley/SOLR-2155.  Look at the introductory 
> readme and download the plugin .jar file.  Lucene 4's new spatial module is 
> largely based on this code.  The Solr 4 glue for it should come very soon but 
> as of this writing it's hosted temporarily at https://github.com/spatial4j.  
> For more information on using SOLR-2155 with Solr 3, see 
> http://wiki.apache.org/solr/SpatialSearch#SOLR-2155  This JIRA issue is 
> closed because it won't be committed in its current form.
> {panel}
> There currently isn't a solution in Solr for doing geospatial filtering on 
> documents that have a variable number of points.  This scenario occurs when 
> there is location extraction (i.e. via a "gazateer") occurring on free text.  
> None, one, or many geospatial locations might be extracted from any given 
> document and users want to limit their search results to those occurring in a 
> user-specified area.
> I've implemented this by furthering the GeoHash based work in Lucene/Solr 
> with a geohash prefix based filter.  A geohash refers to a lat-lon box on the 
> earth.  Each successive character added further subdivides the box into a 4x8 
> (or 8x4 depending on the even/odd length of the geohash) grid.  The first 
> step in this scheme is figuring out which geohash grid squares cover the 
> user's search query.  I've added various extra methods to GeoHashUtils (and 
> added tests) to assist in this purpose.  The next step is an actual Lucene 
> Filter, GeoHashPrefixFilter, that uses these geohash prefixes in 
> TermsEnum.seek() to skip to relevant grid squares in the index.  Once a 
> matching geohash grid is found, the points therein are compared against the 
> user's query to see if it matches.  I created an abstraction GeoShape 
> extended by subclasses named PointDistance... and CartesianBox to support 
> different queried shapes so that the filter need not care about these details.
> This work was presented at LuceneRevolution in Boston on October 8th.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Comment Edited] (SOLR-3955) Return only matched multiValued field

2013-03-01 Thread Sandeep Tucknat (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13590645#comment-13590645
 ] 

Sandeep Tucknat edited comment on SOLR-3955 at 3/1/13 3:43 PM:
---

This is especially important in a spatial search since there's an important 
business case of finding the branches/locations for an entity within a spatial 
filtering query. While the multi-valued spatial field implementation provides 
for filtering and scoring, it does not return this information to the client at 
the moment.


PS : I am relatively new to Solr and SE in general but have years of Java 
coding and debugging experience. I'd love to help resolve this if someone can 
point me in the right direction and something more than 'hook it up to the 
debugger and start looking' would be appreciated.

  was (Author: mathakuna):
This is especially important in a spatial search since there's an important 
business case of finding the branches/locations for an entity within a spatial 
filtering query. While the multi-valued spatial field implementation provides 
for filtering and scoring, it does not return this information to the client at 
the moment.
  
> Return only matched multiValued field
> -
>
> Key: SOLR-3955
> URL: https://issues.apache.org/jira/browse/SOLR-3955
> Project: Solr
>  Issue Type: Improvement
>  Components: search
>Affects Versions: 4.0
>Reporter: Dotan Cohen
>  Labels: features
>
> Assuming a multivalued, stored and indexed field named "comment". When 
> performing a search, it would be very helpful if there were a way to return 
> only the values of "comment" which contain the match. For example:
> When searching for "gold" instead of getting this result:
> 
> 
> Theres a lady whos sure
> all that glitters is gold
> and shes buying a stairway to heaven
> 
> 
> I would prefer to get this result:
> 
> 
> all that glitters is gold
> 
> 
> (psuedo-XML from memory, may not be accurate but illustrates the point)
> Thanks.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-3955) Return only matched multiValued field

2013-03-01 Thread Sandeep Tucknat (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13590645#comment-13590645
 ] 

Sandeep Tucknat commented on SOLR-3955:
---

This is especially important in a spatial search since there's an important 
business case of finding the branches/locations for an entity within a spatial 
filtering query. While the multi-valued spatial field implementation provides 
for filtering and scoring, it does not return this information to the client at 
the moment.

> Return only matched multiValued field
> -
>
> Key: SOLR-3955
> URL: https://issues.apache.org/jira/browse/SOLR-3955
> Project: Solr
>  Issue Type: Improvement
>  Components: search
>Affects Versions: 4.0
>Reporter: Dotan Cohen
>  Labels: features
>
> Assuming a multivalued, stored and indexed field named "comment". When 
> performing a search, it would be very helpful if there were a way to return 
> only the values of "comment" which contain the match. For example:
> When searching for "gold" instead of getting this result:
> 
> 
> Theres a lady whos sure
> all that glitters is gold
> and shes buying a stairway to heaven
> 
> 
> I would prefer to get this result:
> 
> 
> all that glitters is gold
> 
> 
> (psuedo-XML from memory, may not be accurate but illustrates the point)
> Thanks.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-2155) Geospatial search using geohash prefixes

2013-03-01 Thread Sandeep Tucknat (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13590636#comment-13590636
 ] 

Sandeep Tucknat commented on SOLR-2155:
---

I have a similar requirement to Sujan. I am doing a filtering spatial query, 
trying to find businesses (with multiple locations stored in a multi-valued 
field) available within a radius of a given point. I also need to know how many 
locations are actually within the radius as well as which one is the closest. 
Wondering if that's possible with the Solr 3 or 4 spatial implementation.

> Geospatial search using geohash prefixes
> 
>
> Key: SOLR-2155
> URL: https://issues.apache.org/jira/browse/SOLR-2155
> Project: Solr
>  Issue Type: Improvement
>Reporter: David Smiley
>Assignee: David Smiley
> Attachments: GeoHashPrefixFilter.patch, GeoHashPrefixFilter.patch, 
> GeoHashPrefixFilter.patch, Solr2155-1.0.2-project.zip, 
> Solr2155-1.0.3-project.zip, Solr2155-1.0.4-project.zip, 
> Solr2155-for-1.0.2-3.x-port.patch, 
> SOLR-2155_GeoHashPrefixFilter_with_sorting_no_poly.patch, SOLR.2155.p3.patch, 
> SOLR.2155.p3tests.patch
>
>
> {panel:title=NOTICE} The status of this issue is a plugin for Solr 3.x 
> located here: https://github.com/dsmiley/SOLR-2155.  Look at the introductory 
> readme and download the plugin .jar file.  Lucene 4's new spatial module is 
> largely based on this code.  The Solr 4 glue for it should come very soon but 
> as of this writing it's hosted temporarily at https://github.com/spatial4j.  
> For more information on using SOLR-2155 with Solr 3, see 
> http://wiki.apache.org/solr/SpatialSearch#SOLR-2155  This JIRA issue is 
> closed because it won't be committed in its current form.
> {panel}
> There currently isn't a solution in Solr for doing geospatial filtering on 
> documents that have a variable number of points.  This scenario occurs when 
> there is location extraction (i.e. via a "gazateer") occurring on free text.  
> None, one, or many geospatial locations might be extracted from any given 
> document and users want to limit their search results to those occurring in a 
> user-specified area.
> I've implemented this by furthering the GeoHash based work in Lucene/Solr 
> with a geohash prefix based filter.  A geohash refers to a lat-lon box on the 
> earth.  Each successive character added further subdivides the box into a 4x8 
> (or 8x4 depending on the even/odd length of the geohash) grid.  The first 
> step in this scheme is figuring out which geohash grid squares cover the 
> user's search query.  I've added various extra methods to GeoHashUtils (and 
> added tests) to assist in this purpose.  The next step is an actual Lucene 
> Filter, GeoHashPrefixFilter, that uses these geohash prefixes in 
> TermsEnum.seek() to skip to relevant grid squares in the index.  Once a 
> matching geohash grid is found, the points therein are compared against the 
> user's query to see if it matches.  I created an abstraction GeoShape 
> extended by subclasses named PointDistance... and CartesianBox to support 
> different queried shapes so that the filter need not care about these details.
> This work was presented at LuceneRevolution in Boston on October 8th.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org