[ 
https://issues.apache.org/jira/browse/LUCENE-6450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14527103#comment-14527103
 ] 

Nicholas Knize edited comment on LUCENE-6450 at 5/4/15 8:00 PM:
----------------------------------------------------------------

Was out last week but had some time this weekend to add TermsEnum logic to 
visit only those ranges along the SFC that represent the bounding box.

Updated patch attached - this code currently exists in sandbox.  Benchmarks 
(using luceneutil thanks to [~mikemccand] for adding geo benchmarking) are 
below:

Data Set:  60M points of Planet OSM GPS data 
(http://wiki.openstreetmap.org/wiki/File:World-gps-points-120604-2048.png)

*QuadPrefixTree* 
Parameters: distErrPct: 0, pruneLeafyBranches: true, pointsOnly: true, level: 29

Index Time: 2449.08 sec
Index Size: 13G
Mean Query Time: 0.066 sec

*PackedQuadPrefixTree* 
Parameters: distErrPct: 0, pruneLeafyBranches: true, pointsOnly: true, level: 29

Index Time:  1945.288 sec
Index Size: 11G
Mean Query Time:  0.058 sec

*GeoHashPrefixTree*

Index Time:  695.079 sec
Index Size: 4.2G
Mean Query Time:  0.071 sec

*GeoPointField*

Index Time:  180.872 sec
Index Size: 1.8G
Mean Query Time:  0.107 sec

Hardware: 8 core System76 Ubuntu 14.10 laptop w/ 16GB memory



was (Author: nknize):
Was out last week but had some time this weekend to add TermsEnum logic to 
visit only those ranges along the SFC that represent the bounding box.

Updated patch attached - this code currently exists in sandbox.  Benchmarks 
(using luceneutil thanks to [~mikemccand] for adding geo benchmarking) are 
below:

Data Set:  60M points of Planet OSM GPS data 
(http://wiki.openstreetmap.org/wiki/File:World-gps-points-120604-2048.png)

*QuadPrefixTree* 
Parameters: distErrPct: 0, pruneLeafyBranches: true, pointsOnly: true, level: 29

Index Time: 2449.08 sec
Index Size: 13G
Mean Query Time: 0.066 sec

*PackedQuadPrefixTree* 
Parameters: distErrPct: 0, pruneLeafyBranches: true, pointsOnly: true, level: 29

Index Time:  1945.288 sec
Index Size: 11G
Mean Query Time:  0.058 sec

*GeoHashPrefixTree*

Index Time:  695.079 sec
Index Size: 4.2G
Mean Query Time:  0.071 sec

*GeoPointField*

Index Time:  180.872 sec
Index Size: 1.8G
Mean Query Time:  0.107 sec



> Add simple encoded GeoPointField type to core
> ---------------------------------------------
>
>                 Key: LUCENE-6450
>                 URL: https://issues.apache.org/jira/browse/LUCENE-6450
>             Project: Lucene - Core
>          Issue Type: New Feature
>    Affects Versions: Trunk, 5.x
>            Reporter: Nicholas Knize
>            Priority: Minor
>         Attachments: LUCENE-6450-5x.patch, LUCENE-6450-TRUNK.patch, 
> LUCENE-6450.patch, LUCENE-6450.patch, LUCENE-6450.patch, LUCENE-6450.patch
>
>
> At the moment all spatial capabilities, including basic point based indexing 
> and querying, require the lucene-spatial module. The spatial module, designed 
> to handle all things geo, requires dependency overhead (s4j, jts) to provide 
> spatial rigor for even the most simplistic spatial search use-cases (e.g., 
> lat/lon bounding box, point in poly, distance search). This feature trims the 
> overhead by adding a new GeoPointField type to core along with 
> GeoBoundingBoxQuery and GeoPolygonQuery classes to the .search package. This 
> field is intended as a straightforward lightweight type for the most basic 
> geo point use-cases without the overhead. 
> The field uses simple bit twiddling operations (currently morton hashing) to 
> encode lat/lon into a single long term.  The queries leverage simple 
> multi-phase filtering that starts by leveraging NumericRangeQuery to reduce 
> candidate terms deferring the more expensive mathematics to the smaller 
> candidate sets.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to