[ 
https://issues.apache.org/jira/browse/LUCENE-5205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14073323#comment-14073323
 ] 

ASF GitHub Bot commented on LUCENE-5205:
----------------------------------------

GitHub user tballison opened a pull request:

    https://github.com/apache/lucene-solr/pull/68

    Lucene5205

    LUCENE-5205
    1) merge from trunk
    2) roll in March 10, 2014 LUCENE-5205 patch for improved stopword handling
    3) roll in SOLR-5410

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/tballison/lucene-solr lucene5205

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/lucene-solr/pull/68.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #68
    
----
commit 3687d27902c3d993291a9f169f1c4a338c417327
Author: Uwe Schindler <uschind...@apache.org>
Date:   2014-06-11T17:50:45Z

    SOLR-5940: post.jar reports back detailed error in case of error responses
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1601970 
13f79535-47bb-0310-9956-ffa450edef68

commit 4f2da71473619def348518402cc567f429047cc0
Author: Joel Bernstein <jbern...@apache.org>
Date:   2014-06-11T19:35:19Z

     SOLR-6150: Improving AnalyticsMergeStrategyTest
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1601997 
13f79535-47bb-0310-9956-ffa450edef68

commit 109c4c47679a193ac3ca3a4a449d759dbad59725
Author: shalin Shekhar Mangar <sha...@apache.org>
Date:   2014-06-12T11:18:33Z

    SOLR-6056: Don't publish recovery state until recovery runs to avoid 
overwhelming the overseer state queue
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602123 
13f79535-47bb-0310-9956-ffa450edef68

commit d553138492454798b9abeff7e610f0e8f3ddfb8b
Author: Michael McCandless <mikemcc...@apache.org>
Date:   2014-06-12T11:54:20Z

    fix typo
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602131 
13f79535-47bb-0310-9956-ffa450edef68

commit 4dd3197621324234e77e741fd843c4d76df07719
Author: Noble Paul <no...@apache.org>
Date:   2014-06-12T12:18:21Z

    SOLR-6048 the assert was not really failing the test
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602138 
13f79535-47bb-0310-9956-ffa450edef68

commit 2cdb0941446628663849f56ffbe4b42c62d00e0c
Author: Shai Erera <sh...@apache.org>
Date:   2014-06-12T12:26:20Z

    add comments to clarify code
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602140 
13f79535-47bb-0310-9956-ffa450edef68

commit d1274853919c1c9867e8e71117ff1303b6cc8816
Author: shalin Shekhar Mangar <sha...@apache.org>
Date:   2014-06-12T15:45:08Z

    Fix typo, rf is actually 3 in code
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602210 
13f79535-47bb-0310-9956-ffa450edef68

commit 0b9f7edd3109467052137004d36abb7f793e5835
Author: Robert Muir <rm...@apache.org>
Date:   2014-06-12T19:40:36Z

    LUCENE-5748: Add SORTED_NUMERIC docvalues type
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602277 
13f79535-47bb-0310-9956-ffa450edef68

commit e2f2c2fdaa77b4c17f6922fb9c5e25b02563855a
Author: Uwe Schindler <uschind...@apache.org>
Date:   2014-06-13T08:54:20Z

    LUCENE-5754: Allow "$" as part of variable and function names in 
expressions module
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602344 
13f79535-47bb-0310-9956-ffa450edef68

commit 40137f9162350a6281e0d3fba99898fd66be28b2
Author: Adrien Grand <jpou...@apache.org>
Date:   2014-06-13T11:39:43Z

    LUCENE-5695: DocIdSet implements Accountable.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602387 
13f79535-47bb-0310-9956-ffa450edef68

commit ccf0a812d1644e70b33157d5c33b34e78889f327
Author: Simon Willnauer <sim...@apache.org>
Date:   2014-06-13T11:41:19Z

    LUCENE-5756: Implement Accountable from IndexWriter
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602388 
13f79535-47bb-0310-9956-ffa450edef68

commit 0114c4e7292aa261996688b4f0813622d3ff99b3
Author: Simon Willnauer <sim...@apache.org>
Date:   2014-06-13T11:49:54Z

    Add Import Layout Table to idea codestyle
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602389 
13f79535-47bb-0310-9956-ffa450edef68

commit 0e92dc55e6293c26c020550742e2272547589df7
Author: Robert Muir <rm...@apache.org>
Date:   2014-06-13T20:41:17Z

    LUCENE-5757: move RamUsageEstimator reflector to test-framework
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602515 
13f79535-47bb-0310-9956-ffa450edef68

commit 912e74424411c9055371924f403c0f66535c3066
Author: Chris M. Hostetter <hoss...@apache.org>
Date:   2014-06-13T21:15:50Z

    SOLR-5426: Fixed a bug in ReverseWildCardFilter that could cause 
InvalidTokenOffsetsException when highlighting
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602525 
13f79535-47bb-0310-9956-ffa450edef68

commit e9cb1382808cdd8f04dd837ce7fc473ed1e4a0b2
Author: Robert Muir <rm...@apache.org>
Date:   2014-06-13T21:55:20Z

    LUCENE-5760: Speed up BufferedIndexInput.randomAccessSlice
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602530 
13f79535-47bb-0310-9956-ffa450edef68

commit 2df9ee28785e35e6e102b647b29e79660c10083f
Author: shalin Shekhar Mangar <sha...@apache.org>
Date:   2014-06-14T14:46:10Z

    SOLR-6161: SolrDispatchFilter should throw java.lang.Error back even if 
wrapped in another exception
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602590 
13f79535-47bb-0310-9956-ffa450edef68

commit 0b567b02453725acb47ed451b518a6103cc7a898
Author: shalin Shekhar Mangar <sha...@apache.org>
Date:   2014-06-14T14:57:30Z

    SOLR-6153: ReplicationHandler backup response format should contain backup 
name
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602592 
13f79535-47bb-0310-9956-ffa450edef68

commit 2ca52f0e5949f85427616ee2d3c6dfb836a80ff8
Author: Alan Woodward <romseyg...@apache.org>
Date:   2014-06-16T09:17:23Z

    SOLR-6169: Properly remove deprecated CoreAdminHandler handleAlias action
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602825 
13f79535-47bb-0310-9956-ffa450edef68

commit 9fe4454fdd1160a4df87b3f65512a00bf822c5d1
Author: Robert Muir <rm...@apache.org>
Date:   2014-06-16T11:07:19Z

    LUCENE-5762: Disable old codecs as much as possible
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602845 
13f79535-47bb-0310-9956-ffa450edef68

commit 4525f7ab6e5289a1f5357a577b1fb5dca7f6d46f
Author: Dawid Weiss <dwe...@apache.org>
Date:   2014-06-16T12:17:57Z

    SOLR-6151: Intermittent TestReplicationHandlerBackup failures.
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602854 
13f79535-47bb-0310-9956-ffa450edef68

commit 7c506afc3bda28d056389e7679588cf8e6be0b94
Author: David Wayne Smiley <dsmi...@apache.org>
Date:   2014-06-16T12:38:43Z

    LUCENE-5648: (NumberRangePrefixTree) Bug-fix in initIter optimization. 
Re-index required.
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602857 
13f79535-47bb-0310-9956-ffa450edef68

commit 730c70a59d26f2bc5aedc474c317c277b4c14b9e
Author: Robert Muir <rm...@apache.org>
Date:   2014-06-16T13:14:01Z

    LUCENE-5761: Remove DiskDocValuesFormat
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602862 
13f79535-47bb-0310-9956-ffa450edef68

commit a55cbab679d5a88dbb36be7bf08d4dd27d39f602
Author: Adrien Grand <jpou...@apache.org>
Date:   2014-06-16T14:22:02Z

    LUCENE-5759: Add PackedInts.unsignedBitsRequired.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602873 
13f79535-47bb-0310-9956-ffa450edef68

commit c412a51bc386cb8636a96daaa4bf7ed4c63101bd
Author: Adrien Grand <jpou...@apache.org>
Date:   2014-06-16T14:25:12Z

    LUCENE-5764: Add tests to DocIdSet.ramBytesUsed.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602876 
13f79535-47bb-0310-9956-ffa450edef68

commit 83c64d82de99a9180bb03f7648aca10412a513eb
Author: Adrien Grand <jpou...@apache.org>
Date:   2014-06-16T14:41:15Z

    LUCENE-5765: Add tests to OrdinalMap.ramBytesUsed.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602880 
13f79535-47bb-0310-9956-ffa450edef68

commit a28e98b142418347d260662e1b8c14f86536f9d5
Author: Timothy Potter <thelabd...@apache.org>
Date:   2014-06-16T16:44:15Z

    SOLR-6157: Added some logging and re-opened the socket proxy to try to 
figure out why this test is hanging; reenabling temporarily to see if these 
changes help diagnose the cause of the hang.
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602924 
13f79535-47bb-0310-9956-ffa450edef68

commit 5664fdd77a785b1c620527e1ecebb6659c457580
Author: Timothy Potter <thelabd...@apache.org>
Date:   2014-06-16T18:28:07Z

    SOLR-6015: Moving change note to 4.9 bugfix section.
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602950 
13f79535-47bb-0310-9956-ffa450edef68

commit f233dd7698ea7f2134e86bc696868f7116d6140e
Author: Timothy Potter <thelabd...@apache.org>
Date:   2014-06-16T18:36:08Z

    SOLR-5956: Use getInstanceDir instead of getRawInstanceDir as that was 
causing issues when trying to create a backup of an index on Linux when 
solr.solr.home is a symbolic link.
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602953 
13f79535-47bb-0310-9956-ffa450edef68

commit c1e8bb94e81eaa3c3188e81966ce311b624c0e96
Author: Adrien Grand <jpou...@apache.org>
Date:   2014-06-16T20:33:44Z

    LUCENE-5767: OrdinalMap optimizations.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602997 
13f79535-47bb-0310-9956-ffa450edef68

commit 2b985b580148ecb9f969a918c715f6488d92577b
Author: Robert Muir <rm...@apache.org>
Date:   2014-06-16T21:56:05Z

    LUCENE-5768: hunspell condition checks with character classes were buggy
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1603007 
13f79535-47bb-0310-9956-ffa450edef68

----


> [PATCH] SpanQueryParser with recursion, analysis and syntax very similar to 
> classic QueryParser
> -----------------------------------------------------------------------------------------------
>
>                 Key: LUCENE-5205
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5205
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: core/queryparser
>            Reporter: Tim Allison
>              Labels: patch
>             Fix For: 4.9
>
>         Attachments: LUCENE-5205-cleanup-tests.patch, 
> LUCENE-5205-date-pkg-prvt.patch, LUCENE-5205.patch.gz, LUCENE-5205.patch.gz, 
> LUCENE-5205_dateTestReInitPkgPrvt.patch, 
> LUCENE-5205_improve_stop_word_handling.patch, 
> LUCENE-5205_smallTestMods.patch, LUCENE_5205.patch, 
> SpanQueryParser_v1.patch.gz, patch.txt
>
>
> This parser extends QueryParserBase and includes functionality from:
> * Classic QueryParser: most of its syntax
> * SurroundQueryParser: recursive parsing for "near" and "not" clauses.
> * ComplexPhraseQueryParser: can handle "near" queries that include multiterms 
> (wildcard, fuzzy, regex, prefix),
> * AnalyzingQueryParser: has an option to analyze multiterms.
> At a high level, there's a first pass BooleanQuery/field parser and then a 
> span query parser handles all terminal nodes and phrases.
> Same as classic syntax:
> * term: test 
> * fuzzy: roam~0.8, roam~2
> * wildcard: te?t, test*, t*st
> * regex: /\[mb\]oat/
> * phrase: "jakarta apache"
> * phrase with slop: "jakarta apache"~3
> * default "or" clause: jakarta apache
> * grouping "or" clause: (jakarta apache)
> * boolean and +/-: (lucene OR apache) NOT jakarta; +lucene +apache -jakarta
> * multiple fields: title:lucene author:hatcher
>  
> Main additions in SpanQueryParser syntax vs. classic syntax:
> * Can require "in order" for phrases with slop with the \~> operator: 
> "jakarta apache"\~>3
> * Can specify "not near": "fever bieber"!\~3,10 ::
>     find "fever" but not if "bieber" appears within 3 words before or 10 
> words after it.
> * Fully recursive phrasal queries with \[ and \]; as in: \[\[jakarta 
> apache\]~3 lucene\]\~>4 :: 
>     find "jakarta" within 3 words of "apache", and that hit has to be within 
> four words before "lucene"
> * Can also use \[\] for single level phrasal queries instead of " as in: 
> \[jakarta apache\]
> * Can use "or grouping" clauses in phrasal queries: "apache (lucene solr)"\~3 
> :: find "apache" and then either "lucene" or "solr" within three words.
> * Can use multiterms in phrasal queries: "jakarta\~1 ap*che"\~2
> * Did I mention full recursion: \[\[jakarta\~1 ap*che\]\~2 (solr~ 
> /l\[ou\]\+\[cs\]\[en\]\+/)]\~10 :: Find something like "jakarta" within two 
> words of "ap*che" and that hit has to be within ten words of something like 
> "solr" or that "lucene" regex.
> * Can require at least x number of hits at boolean level: "apache AND (lucene 
> solr tika)~2
> * Can use negative only query: -jakarta :: Find all docs that don't contain 
> "jakarta"
> * Can use an edit distance > 2 for fuzzy query via SlowFuzzyQuery (beware of 
> potential performance issues!).
> Trivial additions:
> * Can specify prefix length in fuzzy queries: jakarta~1,2 (edit distance =1, 
> prefix =2)
> * Can specifiy Optimal String Alignment (OSA) vs Levenshtein for distance 
> <=2: (jakarta~1 (OSA) vs jakarta~>1(Levenshtein)
> This parser can be very useful for concordance tasks (see also LUCENE-5317 
> and LUCENE-5318) and for analytical search.  
> Until LUCENE-2878 is closed, this might have a use for fans of SpanQuery.
> Most of the documentation is in the javadoc for SpanQueryParser.
> Any and all feedback is welcome.  Thank you.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to