[ https://issues.apache.org/jira/browse/LUCENE-5205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14073323#comment-14073323 ]
ASF GitHub Bot commented on LUCENE-5205: ---------------------------------------- GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/68 Lucene5205 LUCENE-5205 1) merge from trunk 2) roll in March 10, 2014 LUCENE-5205 patch for improved stopword handling 3) roll in SOLR-5410 You can merge this pull request into a Git repository by running: $ git pull https://github.com/tballison/lucene-solr lucene5205 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/lucene-solr/pull/68.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #68 ---- commit 3687d27902c3d993291a9f169f1c4a338c417327 Author: Uwe Schindler <uschind...@apache.org> Date: 2014-06-11T17:50:45Z SOLR-5940: post.jar reports back detailed error in case of error responses git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1601970 13f79535-47bb-0310-9956-ffa450edef68 commit 4f2da71473619def348518402cc567f429047cc0 Author: Joel Bernstein <jbern...@apache.org> Date: 2014-06-11T19:35:19Z SOLR-6150: Improving AnalyticsMergeStrategyTest git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1601997 13f79535-47bb-0310-9956-ffa450edef68 commit 109c4c47679a193ac3ca3a4a449d759dbad59725 Author: shalin Shekhar Mangar <sha...@apache.org> Date: 2014-06-12T11:18:33Z SOLR-6056: Don't publish recovery state until recovery runs to avoid overwhelming the overseer state queue git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602123 13f79535-47bb-0310-9956-ffa450edef68 commit d553138492454798b9abeff7e610f0e8f3ddfb8b Author: Michael McCandless <mikemcc...@apache.org> Date: 2014-06-12T11:54:20Z fix typo git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602131 13f79535-47bb-0310-9956-ffa450edef68 commit 4dd3197621324234e77e741fd843c4d76df07719 Author: Noble Paul <no...@apache.org> Date: 2014-06-12T12:18:21Z SOLR-6048 the assert was not really failing the test git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602138 13f79535-47bb-0310-9956-ffa450edef68 commit 2cdb0941446628663849f56ffbe4b42c62d00e0c Author: Shai Erera <sh...@apache.org> Date: 2014-06-12T12:26:20Z add comments to clarify code git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602140 13f79535-47bb-0310-9956-ffa450edef68 commit d1274853919c1c9867e8e71117ff1303b6cc8816 Author: shalin Shekhar Mangar <sha...@apache.org> Date: 2014-06-12T15:45:08Z Fix typo, rf is actually 3 in code git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602210 13f79535-47bb-0310-9956-ffa450edef68 commit 0b9f7edd3109467052137004d36abb7f793e5835 Author: Robert Muir <rm...@apache.org> Date: 2014-06-12T19:40:36Z LUCENE-5748: Add SORTED_NUMERIC docvalues type git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602277 13f79535-47bb-0310-9956-ffa450edef68 commit e2f2c2fdaa77b4c17f6922fb9c5e25b02563855a Author: Uwe Schindler <uschind...@apache.org> Date: 2014-06-13T08:54:20Z LUCENE-5754: Allow "$" as part of variable and function names in expressions module git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602344 13f79535-47bb-0310-9956-ffa450edef68 commit 40137f9162350a6281e0d3fba99898fd66be28b2 Author: Adrien Grand <jpou...@apache.org> Date: 2014-06-13T11:39:43Z LUCENE-5695: DocIdSet implements Accountable. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602387 13f79535-47bb-0310-9956-ffa450edef68 commit ccf0a812d1644e70b33157d5c33b34e78889f327 Author: Simon Willnauer <sim...@apache.org> Date: 2014-06-13T11:41:19Z LUCENE-5756: Implement Accountable from IndexWriter git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602388 13f79535-47bb-0310-9956-ffa450edef68 commit 0114c4e7292aa261996688b4f0813622d3ff99b3 Author: Simon Willnauer <sim...@apache.org> Date: 2014-06-13T11:49:54Z Add Import Layout Table to idea codestyle git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602389 13f79535-47bb-0310-9956-ffa450edef68 commit 0e92dc55e6293c26c020550742e2272547589df7 Author: Robert Muir <rm...@apache.org> Date: 2014-06-13T20:41:17Z LUCENE-5757: move RamUsageEstimator reflector to test-framework git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602515 13f79535-47bb-0310-9956-ffa450edef68 commit 912e74424411c9055371924f403c0f66535c3066 Author: Chris M. Hostetter <hoss...@apache.org> Date: 2014-06-13T21:15:50Z SOLR-5426: Fixed a bug in ReverseWildCardFilter that could cause InvalidTokenOffsetsException when highlighting git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602525 13f79535-47bb-0310-9956-ffa450edef68 commit e9cb1382808cdd8f04dd837ce7fc473ed1e4a0b2 Author: Robert Muir <rm...@apache.org> Date: 2014-06-13T21:55:20Z LUCENE-5760: Speed up BufferedIndexInput.randomAccessSlice git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602530 13f79535-47bb-0310-9956-ffa450edef68 commit 2df9ee28785e35e6e102b647b29e79660c10083f Author: shalin Shekhar Mangar <sha...@apache.org> Date: 2014-06-14T14:46:10Z SOLR-6161: SolrDispatchFilter should throw java.lang.Error back even if wrapped in another exception git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602590 13f79535-47bb-0310-9956-ffa450edef68 commit 0b567b02453725acb47ed451b518a6103cc7a898 Author: shalin Shekhar Mangar <sha...@apache.org> Date: 2014-06-14T14:57:30Z SOLR-6153: ReplicationHandler backup response format should contain backup name git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602592 13f79535-47bb-0310-9956-ffa450edef68 commit 2ca52f0e5949f85427616ee2d3c6dfb836a80ff8 Author: Alan Woodward <romseyg...@apache.org> Date: 2014-06-16T09:17:23Z SOLR-6169: Properly remove deprecated CoreAdminHandler handleAlias action git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602825 13f79535-47bb-0310-9956-ffa450edef68 commit 9fe4454fdd1160a4df87b3f65512a00bf822c5d1 Author: Robert Muir <rm...@apache.org> Date: 2014-06-16T11:07:19Z LUCENE-5762: Disable old codecs as much as possible git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602845 13f79535-47bb-0310-9956-ffa450edef68 commit 4525f7ab6e5289a1f5357a577b1fb5dca7f6d46f Author: Dawid Weiss <dwe...@apache.org> Date: 2014-06-16T12:17:57Z SOLR-6151: Intermittent TestReplicationHandlerBackup failures. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602854 13f79535-47bb-0310-9956-ffa450edef68 commit 7c506afc3bda28d056389e7679588cf8e6be0b94 Author: David Wayne Smiley <dsmi...@apache.org> Date: 2014-06-16T12:38:43Z LUCENE-5648: (NumberRangePrefixTree) Bug-fix in initIter optimization. Re-index required. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602857 13f79535-47bb-0310-9956-ffa450edef68 commit 730c70a59d26f2bc5aedc474c317c277b4c14b9e Author: Robert Muir <rm...@apache.org> Date: 2014-06-16T13:14:01Z LUCENE-5761: Remove DiskDocValuesFormat git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602862 13f79535-47bb-0310-9956-ffa450edef68 commit a55cbab679d5a88dbb36be7bf08d4dd27d39f602 Author: Adrien Grand <jpou...@apache.org> Date: 2014-06-16T14:22:02Z LUCENE-5759: Add PackedInts.unsignedBitsRequired. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602873 13f79535-47bb-0310-9956-ffa450edef68 commit c412a51bc386cb8636a96daaa4bf7ed4c63101bd Author: Adrien Grand <jpou...@apache.org> Date: 2014-06-16T14:25:12Z LUCENE-5764: Add tests to DocIdSet.ramBytesUsed. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602876 13f79535-47bb-0310-9956-ffa450edef68 commit 83c64d82de99a9180bb03f7648aca10412a513eb Author: Adrien Grand <jpou...@apache.org> Date: 2014-06-16T14:41:15Z LUCENE-5765: Add tests to OrdinalMap.ramBytesUsed. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602880 13f79535-47bb-0310-9956-ffa450edef68 commit a28e98b142418347d260662e1b8c14f86536f9d5 Author: Timothy Potter <thelabd...@apache.org> Date: 2014-06-16T16:44:15Z SOLR-6157: Added some logging and re-opened the socket proxy to try to figure out why this test is hanging; reenabling temporarily to see if these changes help diagnose the cause of the hang. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602924 13f79535-47bb-0310-9956-ffa450edef68 commit 5664fdd77a785b1c620527e1ecebb6659c457580 Author: Timothy Potter <thelabd...@apache.org> Date: 2014-06-16T18:28:07Z SOLR-6015: Moving change note to 4.9 bugfix section. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602950 13f79535-47bb-0310-9956-ffa450edef68 commit f233dd7698ea7f2134e86bc696868f7116d6140e Author: Timothy Potter <thelabd...@apache.org> Date: 2014-06-16T18:36:08Z SOLR-5956: Use getInstanceDir instead of getRawInstanceDir as that was causing issues when trying to create a backup of an index on Linux when solr.solr.home is a symbolic link. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602953 13f79535-47bb-0310-9956-ffa450edef68 commit c1e8bb94e81eaa3c3188e81966ce311b624c0e96 Author: Adrien Grand <jpou...@apache.org> Date: 2014-06-16T20:33:44Z LUCENE-5767: OrdinalMap optimizations. git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602997 13f79535-47bb-0310-9956-ffa450edef68 commit 2b985b580148ecb9f969a918c715f6488d92577b Author: Robert Muir <rm...@apache.org> Date: 2014-06-16T21:56:05Z LUCENE-5768: hunspell condition checks with character classes were buggy git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1603007 13f79535-47bb-0310-9956-ffa450edef68 ---- > [PATCH] SpanQueryParser with recursion, analysis and syntax very similar to > classic QueryParser > ----------------------------------------------------------------------------------------------- > > Key: LUCENE-5205 > URL: https://issues.apache.org/jira/browse/LUCENE-5205 > Project: Lucene - Core > Issue Type: Improvement > Components: core/queryparser > Reporter: Tim Allison > Labels: patch > Fix For: 4.9 > > Attachments: LUCENE-5205-cleanup-tests.patch, > LUCENE-5205-date-pkg-prvt.patch, LUCENE-5205.patch.gz, LUCENE-5205.patch.gz, > LUCENE-5205_dateTestReInitPkgPrvt.patch, > LUCENE-5205_improve_stop_word_handling.patch, > LUCENE-5205_smallTestMods.patch, LUCENE_5205.patch, > SpanQueryParser_v1.patch.gz, patch.txt > > > This parser extends QueryParserBase and includes functionality from: > * Classic QueryParser: most of its syntax > * SurroundQueryParser: recursive parsing for "near" and "not" clauses. > * ComplexPhraseQueryParser: can handle "near" queries that include multiterms > (wildcard, fuzzy, regex, prefix), > * AnalyzingQueryParser: has an option to analyze multiterms. > At a high level, there's a first pass BooleanQuery/field parser and then a > span query parser handles all terminal nodes and phrases. > Same as classic syntax: > * term: test > * fuzzy: roam~0.8, roam~2 > * wildcard: te?t, test*, t*st > * regex: /\[mb\]oat/ > * phrase: "jakarta apache" > * phrase with slop: "jakarta apache"~3 > * default "or" clause: jakarta apache > * grouping "or" clause: (jakarta apache) > * boolean and +/-: (lucene OR apache) NOT jakarta; +lucene +apache -jakarta > * multiple fields: title:lucene author:hatcher > > Main additions in SpanQueryParser syntax vs. classic syntax: > * Can require "in order" for phrases with slop with the \~> operator: > "jakarta apache"\~>3 > * Can specify "not near": "fever bieber"!\~3,10 :: > find "fever" but not if "bieber" appears within 3 words before or 10 > words after it. > * Fully recursive phrasal queries with \[ and \]; as in: \[\[jakarta > apache\]~3 lucene\]\~>4 :: > find "jakarta" within 3 words of "apache", and that hit has to be within > four words before "lucene" > * Can also use \[\] for single level phrasal queries instead of " as in: > \[jakarta apache\] > * Can use "or grouping" clauses in phrasal queries: "apache (lucene solr)"\~3 > :: find "apache" and then either "lucene" or "solr" within three words. > * Can use multiterms in phrasal queries: "jakarta\~1 ap*che"\~2 > * Did I mention full recursion: \[\[jakarta\~1 ap*che\]\~2 (solr~ > /l\[ou\]\+\[cs\]\[en\]\+/)]\~10 :: Find something like "jakarta" within two > words of "ap*che" and that hit has to be within ten words of something like > "solr" or that "lucene" regex. > * Can require at least x number of hits at boolean level: "apache AND (lucene > solr tika)~2 > * Can use negative only query: -jakarta :: Find all docs that don't contain > "jakarta" > * Can use an edit distance > 2 for fuzzy query via SlowFuzzyQuery (beware of > potential performance issues!). > Trivial additions: > * Can specify prefix length in fuzzy queries: jakarta~1,2 (edit distance =1, > prefix =2) > * Can specifiy Optimal String Alignment (OSA) vs Levenshtein for distance > <=2: (jakarta~1 (OSA) vs jakarta~>1(Levenshtein) > This parser can be very useful for concordance tasks (see also LUCENE-5317 > and LUCENE-5318) and for analytical search. > Until LUCENE-2878 is closed, this might have a use for fans of SpanQuery. > Most of the documentation is in the javadoc for SpanQueryParser. > Any and all feedback is welcome. Thank you. -- This message was sent by Atlassian JIRA (v6.2#6252) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org