Re: [VOTE] Lucene logo contest, third time's a charm

2020-09-01 Thread Steve Rowe
D (binding) -- Steve > On Sep 1, 2020, at 4:21 PM, Ryan Ernst wrote: > > Dear Lucene and Solr developers! > > Sorry for the multiple threads. This should be the last one. > > In February a contest was started to design a new logo for Lucene > [jira-issue]. The initial attempt [first-vote] to

Re: [VOTE] Lucene logo contest, here we go again

2020-09-01 Thread Steve Rowe
D (binding) -- Steve > On Aug 31, 2020, at 8:26 PM, Ryan Ernst wrote: > > Dear Lucene and Solr developers! > > In February a contest was started to design a new logo for Lucene > [jira-issue]. The initial attempt [first-vote] to call a vote resulted in > some confusion on the rules, as well

Re: [VOTE] Lucene logo contest

2020-06-16 Thread Steve Rowe
C. The current Lucene logo -- Steve > On Jun 15, 2020, at 6:08 PM, Ryan Ernst wrote: > > Dear Lucene and Solr developers! > > In February a contest was started to design a new logo for Lucene [1]. That > contest concluded, and I am now (admittedly a little late!) calling a vote. > > The entr

[ANNOUNCE] Apache Lucene 6.6.3 released

2018-03-07 Thread Steve Rowe
7 March 2018, Apache Lucene™ 6.6.3 available The Lucene PMC is pleased to announce the release of Apache Lucene 6.6.3. Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires f

Re: Maven snapshots

2018-01-11 Thread Steve Rowe
> On Jan 11, 2018, at 11:18 AM, Terry Smith wrote: > > Steve, > > Thanks for looking into this. I see the artifacts for 7.2.1-SNAPSHOT, > 7.3.0-SNAPSHOT, and 8.0.0-SNAPSHOT are now available so things are looking > good. Cool, thanks for checking on it. > On Tue, Jan 9, 2018 at 4:38 PM, Uwe S

Re: Maven snapshots

2018-01-09 Thread Steve Rowe
napshots.https m2.repository.url=https://repository.apache.org/content/repositories/snapshots skipTests=true > On Jan 9, 2018, at 2:24 PM, Steve Rowe wrote: > > Hi Terry, > > Thanks for the heads-up about this problem. > > There are ASF Jenkins jobs that regularly build tho

Re: Maven snapshots

2018-01-09 Thread Steve Rowe
Hi Terry, Thanks for the heads-up about this problem. There are ASF Jenkins jobs that regularly build those snapshots - see the jobs with “Maven” in their names here: . I’ll look into the cause of the long lags and report back. -- Steve www.lucid

Re: Lucene config issue cannot run demo

2017-11-10 Thread Steve Rowe
Hi Mike, Just above the line you give, there is a discussion of “setting up your Java CLASSPATH” - you need to do this first. Assuming you’re on Windows (because your email includes “Sent from Mail for Windows 10”), you’ll need to do something like the following before invoking the java comman

[ANNOUNCE] Apache Lucene 5.5.5 released

2017-10-24 Thread Steve Rowe
24 October 2017, Apache Lucene™ 5.5.5 available The Lucene PMC is pleased to announce the release of Apache Lucene 5.5.5. Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that require

[ANNOUNCE] Apache Lucene 7.0.1 released

2017-10-06 Thread Steve Rowe
6 October 2017, Apache Lucene™ 7.0.1 available The Lucene PMC is pleased to announce the release of Apache Lucene 7.0.1 Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires

Re: Tokens produced by Shingle filter are not added in the query

2017-07-24 Thread Steve Rowe
gt;>query = parser.parse(queryStr); >>System.out.println("\nQuery is"); >>System.out.print(query.toString()); >>} >> } > > > Output: >> Tokens are : >> cup, cup board, board >> Query is n >> n:cup n:b

Re: Tokens produced by Shingle filter are not added in the query

2017-07-24 Thread Steve Rowe
Hi hariram, There may be other problems, but at a minimum you have two different analysis classes here. You’re printing the output stream from one (CustomSynynymAnalyzer, the source of which is not shown in your email), but constructing a query from a different one (CustomAnalyzer). -- Steve

Re: mvn snapshot releases

2017-07-05 Thread Steve Rowe
Hi Terry, They’re there now. These are produced by a regularly-scheduled Jenkins job , which runs every day or two (judging from recent build history). -- Steve www.lucidworks.com > On Jul 5, 2017, at 11:12 AM, Terry Smith wrote: > >

Re: email field - analyzed and not analyzed in single field using custom analyzer

2017-06-15 Thread Steve Rowe
Hi Kumaran, WordDelimiterGraphFilter with PRESERVE_ORIGINAL should do what you want: . Here’s a test I added to TestWordDelimiterGraphFilter.java that passed for me:

Re: Lucene synonym for multi-words and query parsers

2017-04-12 Thread Steve Rowe
Hi Nicolas, Classic QueryParser and SimpleQueryParser should work for you (see below). Some work has been done on StandardQueryParser (see ), but that work is not ready yet. AFAIK nobody has worked on enabling multi-term analysis in ComplexP

Re: A flush exception in lucene 4.10.0

2017-03-09 Thread Steve Rowe
Maybe (though it was committed in Lucene 4.5)? Robert Muir pointed to this issue as fixing , which contains a similar stack track to yours. -- Steve www.lucidworks.com > On Mar 9, 2017, at 6:

Re: Call for MODERATORs on the dev and java-user mailing lists

2017-02-03 Thread Steve Rowe
Steve www.lucidworks.com > On Feb 4, 2017, at 12:46 AM, aurelian rosca wrote: > > i am subscribed to the @dev list now. > > On Fri, Feb 3, 2017 at 10:26 PM, Steve Rowe wrote: > >> Hi Aurelian, >> >> Your response to the dev@ list required moderation, likely b

Re: Call for MODERATORs on the dev and java-user mailing lists

2017-02-03 Thread Steve Rowe
Steve www.lucidworks.com > On Feb 3, 2017, at 4:52 PM, Steve Rowe wrote: > > Hi Scott, > > FYI the average number of MODERATE emails per day per mailing list is roughly > *1* (I’ve received about 50 in the last 2 months over the 3 mailing lists > I've been moderating up to this

Re: Call for MODERATORs on the dev and java-user mailing lists

2017-02-03 Thread Steve Rowe
17, at 4:34 PM, scott cote wrote: > > Let me ask if I can get some cycles to do this. > > I’m interested but I have to check first. > > SCott > > scott.c...@lucidworks.com > > >> On Feb 3, 2017, at 3:14 PM, Steve Rowe wrote: >> >> FYI I’m h

Re: Call for MODERATORs on the dev and java-user mailing lists

2017-02-03 Thread Steve Rowe
Feb 3, 2017, at 3:26 PM, Steve Rowe wrote: > > Hi Aurelian, > > Your response to the dev@ list required moderation, likely because you’re not > subscribed to the dev@ list with the email address you used to respond. > Please first go subscribe to the dev@ list with the e

Re: Call for MODERATORs on the dev and java-user mailing lists

2017-02-03 Thread Steve Rowe
rements and i will let you know if I will have questions. Should i be > online each day or just 5days/week. > Pe 03.02.2017 22:07, "Steve Rowe" a scris: > >> Great! We only needed one new volunteer on each of the two lists, so we >> should be all set now. >>

Re: Call for MODERATORs on the dev and java-user mailing lists

2017-02-03 Thread Steve Rowe
Great! We only needed one new volunteer on each of the two lists, so we should be all set now. I’ll go make an INFRA JIRA requesting the moderator changes. -- Steve www.lucidworks.com > On Feb 3, 2017, at 3:02 PM, aurelian rosca wrote: > > Both. > Pe 03.02.2017 21:59, "Ste

Re: Call for MODERATORs on the dev and java-user mailing lists

2017-02-03 Thread Steve Rowe
PM, aurelian rosca wrote: > > Seems to be an easy job. I am in. > > On Feb 3, 2017 9:13 PM, "Steve Rowe" wrote: > >> Hello subscribers to dev@l.a.o and java-user@l.a.o: >> >> We need to replace a moderator who no longer wishes to do the job on t

Call for MODERATORs on the dev and java-user mailing lists

2017-02-03 Thread Steve Rowe
Hello subscribers to dev@l.a.o and java-user@l.a.o: We need to replace a moderator who no longer wishes to do the job on these two mailing lists. If anyone is interested in being a MODERATOR, please reply back to this thread. Being a moderator is really easy, the main chunk of the responsibili

Re: Too long token is not handled properly?

2016-11-14 Thread Steve Rowe
Hi Alexey, > On Nov 14, 2016, at 3:49 AM, Alexey Makeev wrote: > > But, please correct me if I wrong, this change of semantics (which has > implications from the user point of view) was a workaround for a performance > problem? I there was't the performance problem, it would be better to keep

Re: Too long token is not handled properly?

2016-11-11 Thread Steve Rowe
Hi Alexey, The behavior you mention is an intentional change from the behavior in Lucene 4.9.0 and earlier, when tokens longer than maxTokenLenth were silently ignored: see LUCENE-5897[1] and LUCENE-5400[2]. The new behavior is as follows: Token matching rules are no longer allowed to match ag

Re: POS tagging in Lucene

2016-10-18 Thread Steve Rowe
Hi Niki, > On Oct 18, 2016, at 7:27 AM, Niki Pavlopoulou wrote: > > Hi all, > > I am using Lucene and OpenNLP for POS tagging. I would like to support > biGrams with POS tags as well. For example, I would like something like > that: > > Input: (I[PRP], am[VBP], using[VBG], Lucene[NNP]) > Outpu

Re: null Query from MultiFieldQueryParser.getFieldQuery

2016-10-04 Thread Steve Rowe
or the fix. > > I locally applied the patch on branch_6_2 (because that is closest to my > current 6.2.1 dependency) and built Lucene from there. > Using the outcome in my application, the problem observed there is fixed. > > Best regards, > Oliver > > -----Ursprüngliche

Re: null Query from MultiFieldQueryParser.getFieldQuery

2016-09-30 Thread Steve Rowe
Hi Oliver, Thanks for reporting and for the analysis, this is a bug. See , where I’ve put up a patch with a fix that treats all non-BooleanQuery queries opaquely (like TermQuery), and adds a test for the SynonymQuery case that fails without the

[ANNOUNCE] Apache Lucene 5.5.2 released

2016-06-25 Thread Steve Rowe
25 June 2016, Apache Lucene™ 5.5.2 available The Lucene PMC is pleased to announce the release of Apache Lucene 5.5.2 Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-t

Re: Some questions about StandardTokenizer and UNICODE Regular Expressions

2016-06-16 Thread Steve Rowe
Hi dr, Unicode’s character property model is described here: . Wikipedia has a description of Unicode character properties: JFlex allows you to refer to the set of characters that have a given Unicode

[ANNOUNCE] Apache Lucene 6.0.1 released

2016-05-28 Thread Steve Rowe
28 May 2016, Apache Lucene™ 6.0.1 available The Lucene PMC is pleased to announce the release of Apache Lucene 6.0.1 Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires ful

Re: QueryParser with CustomAnalyzer wrongly uses PatternReplaceCharFilter

2016-04-28 Thread Steve Rowe
Classic QueryParser splits on whitespace and then sends the chunks to the analyzer one at a time. See . -- Steve www.lucidworks.com > On Apr 28, 2016, at 5:54 AM, Bahaa Eldesouky wrote: > > I am using org.apache.lucene.queryparser.classic.Qu

Re: Cannot comment on Jira issues

2016-04-22 Thread Steve Rowe
Mạnh, I’ve added you to the LUCENE and SOLR projects as a contributor, so you should now be able to create and comment on issues. -- Steve www.lucidworks.com > On Apr 22, 2016, at 6:18 AM, Đạt Cao Mạnh wrote: > > Thanks uwe, my account at jira is : "caomanhdat" > > On Fri, Apr 22, 2016 at 5:

Re: 5.3.1 artifacts in maven central

2015-09-30 Thread Steve Rowe
issing steps > On Sep 30, 2015 6:42 PM, "Steve Rowe" wrote: > > > I pulled the maven publishing stuff out into its own page to declutter > the > > main ReleaseTodo page. I don’t think it’s a great idea to put an example > > of a subset of the required steps he

Re: 5.3.1 artifacts in maven central

2015-09-30 Thread Steve Rowe
I pulled the maven publishing stuff out into its own page to declutter the main ReleaseTodo page. I don’t think it’s a great idea to put an example of a subset of the required steps here - Noble, your example leaves out two following steps: after staging you have to go close the repository and

Re: tokenize into sentences/sentence splitter

2015-09-23 Thread Steve Rowe
ence as a document? Basically the field for sentence and the field for > terms should be in the same index. > > Thanks > > > > On 23/09/2015 19:08, Steve Rowe wrote: >> Hi Ziqi, >> >> Lucene has support for sentence chunking - see SegmentingTokenizerB

Re: tokenize into sentences/sentence splitter

2015-09-23 Thread Steve Rowe
Hi Ziqi, Lucene has support for sentence chunking - see SegmentingTokenizerBase, implemeented in ThaiTokenizer and HMMChineseTokenizer. There is an example in that class’s tests that creates tokens out of individual sentences: TestSegmentingTokenizerBase.WholeSentenceTokenizer. However, it

Re: Request to be added to the ContributorsGroup

2015-09-02 Thread Steve Rowe
Hi Charlie, I’ve added your account name to the ContributorsGroup page on the Lucene wiki, so you should now have edit privileges. This mailing list sets the Reply-To header to the mailing list itself, so even if I use Reply-All, replies won’t go to you directly. I’ve manually CC’d you on thi

Re: StandardTokenizer#setMaxTokenLength

2015-07-20 Thread Steve Rowe
> Regards > > On Fri, Jul 17, 2015 at 4:40 PM, Steve Rowe wrote: > >> Hi Piotr, >> >> Thanks for reporting! >> >> See https://issues.apache.org/jira/browse/LUCENE-6682 >> >> Steve >> www.lucidworks.com >> >>> On

Re: StandardTokenizer#setMaxTokenLength

2015-07-17 Thread Steve Rowe
Hi Piotr, Thanks for reporting! See https://issues.apache.org/jira/browse/LUCENE-6682 Steve www.lucidworks.com > On Jul 16, 2015, at 4:47 AM, Piotr Idzikowski > wrote: > > Hello. > I am developing own analyzer based on StandardAnalyzer. > I realized that tokenizer.setMaxTokenLength is called

Re: does Lucene 5 provide a direct way to do paging on search result

2015-06-22 Thread Steve Rowe
Hi solmaz, IndexSearcher.searchAfter() (several variants) can be used to do (deep) paging - here’s one of them: Steve >

Re: Wiki edit rights

2015-05-28 Thread Steve Rowe
Hi Lee, I’ve added your username to the Lucene-java wiki’s ContributorsGroup page, so you should be able to edit now. Steve > On May 28, 2015, at 6:24 PM, Lee Hinman wrote: > > > Hi Java-user mailing list, > > Please add me to the ContributorsGroup wiki page so I can edit the wiki > > wiki

Re: Request to be added to ContributorsGroup

2015-03-07 Thread Steve Rowe
Welcome Aihua, We can add you to the lucene-java wiki ContributorsGroup after you create an account there and tell us what your username is. Steve > On Mar 6, 2015, at 8:42 PM, Aihua Liu wrote: > > Hi, > I recently started to work on some Lucene related project, so I'm pretty new > to it. >

Re: Request to be added to the ContributorsGroup

2015-02-10 Thread Steve Rowe
Hi Charlie, You need to create an account on the wiki and tell us your account name. Steve > On Feb 10, 2015, at 3:46 AM, Charlie Picorini > wrote: > > Dear Lucene Team, > > Please add me to the contributorsGroup so that I can add IntraCherche which > is actually based on Lucene. > > Kind r

Re: Does StandardTokenizer remove punctuation (in Lucene 4.1)

2014-10-02 Thread Steve Rowe
break iterator used by DefaultICUTokenizerConfig also ignores punctuation. You can find its grammar at: lucene/analysis/icu/src/data/uax29/Default.rbbi Steve On Oct 1, 2014, at 4:22 PM, Paul Taylor wrote: > On 01/10/2014 18:42, Steve Rowe wrote: >> Paul, >> >>

Re: Does StandardTokenizer remove punctuation (in Lucene 4.1)

2014-10-01 Thread Steve Rowe
you think it'd be possible (read: relatively easy) to create an >> analyzer (or a modification of the standard one's lexer) so that >> punctuation is returned as a separate token type? >> >> Dawid >> >> >> On Wed, Oct 1, 2014 at 7:01 AM, Steve Row

Re: Does StandardTokenizer remove punctuation (in Lucene 4.1)

2014-09-30 Thread Steve Rowe
Hi Paul, StandardTokenizer implements the Word Boundaries rules in the Unicode Text Segmentation Standard Annex UAX#29 - here’s the relevant section for Unicode 6.1.0, which is the version supported by Lucene 4.1.0: . Only those

Re: NOTICE: Seeking Moderators for java-user@lucene

2014-09-30 Thread Steve Rowe
Please keep me on the list of moderators. My inattention this past week is temporary and non-vacation-related. - Steve On Sep 30, 2014, at 12:51 PM, Chris Hostetter wrote: > > Hey folks, > > I was on facation for the psat 7 days - 6 days ago someone sent an email > directly to the java-user

Re: Request to be added to contributor's group

2014-08-01 Thread Steve Rowe
Vitaliy, I’ve added you to the ContributorsGroup page, so you should now be able to edit. - Steve On Aug 1, 2014, at 10:37 AM, Vitaliy Verbenko wrote: > Hi Steve, > > I'm already subscribed and my username is VitaliyVerbenko > > Regards > > On 8/1/2014 5:10 P

Re: Request to be added to contributor's group

2014-08-01 Thread Steve Rowe
Hi Vitaliy, First, you should subscribe to the lucene-java wiki, and then tell us your wiki username, so that we can add you to the ContributorsGroup page, which will enable you to make edits. Steve On Aug 1, 2014, at 9:23 AM, Vitaliy Verbenko wrote: > Dear team at Apache, > > I'd like to

Re: Incorrect tokenizing in the UAX29URLEmailAnalyzer analyzer?

2014-07-23 Thread Steve Rowe
On Jul 23, 2014, at 7:43 PM, Milind wrote: >>> input=esl2.gbr >>> output=[esl2.gb][r] >>> >>> This is a bug, which was fixed in Lucene 4.7 - see < > https://issues.apache.org/jira/browse/LUCENE-5391> > > BTW, I changed the POM dependency to 4.7.1, but I'm still seeing the same > output. I

Re: Incorrect tokenizing in the UAX29URLEmailAnalyzer analyzer?

2014-07-23 Thread Steve Rowe
se. I'm not sure if that would work > though. Since I'm using the MultiFieldQueryParser and that takes in a > single Analyzer. > > > On Wed, Jul 23, 2014 at 3:29 PM, Steve Rowe wrote: > >> Hi Milind, >> >> On Jul 23, 2014, at 1:49 PM, Milind wrote:

Re: Incorrect tokenizing in the UAX29URLEmailAnalyzer analyzer?

2014-07-23 Thread Steve Rowe
Hi Milind, On Jul 23, 2014, at 1:49 PM, Milind wrote: > The UAX29URLEmailAnalyzer analyzer in Lucene 4.4 is not working as I > expected. Is this a bug in the analyzer or is this working as designed? > > If I use the UAX29URLEmailAnalyzer, it tokenizes the following strings as >input=bwl-es

Re: Seeking Additional Moderator Volunteers for java-user@lucene

2014-07-23 Thread Steve Rowe
Sign me up: sar...@gmail.com Steve On Jul 23, 2014, at 1:02 PM, Chris Hostetter wrote: > > We're doing some housekeeping of the moderators of this list, and looking for > any new folks that would like to volunteer. (we currently have 3 active > moderators, 1-2 additional mods would be helpfu

Re: Migration Lucene 3=>4: IndexSearcher.setDefaultFieldSortScoring(..)

2014-07-18 Thread Steve Rowe
Hi Christian, I found an entry about this in the 4.0-ALPHA “Changes in backwards compatibility policy” section of Lucene’s CHANGES.txt (html version): : LUCENE-3514: IndexSea

Re: ShingleAnalyzerWrapper question

2014-06-11 Thread Steve Rowe
You should give sw rather than analyzer in the IndexWriter actor. Steve www.lucidworks.com On Jun 11, 2014 2:24 AM, "Manjula Wijewickrema" wrote: > Hi, > > In my programme, I can index and search a document based on unigrams. I > modified the code as follows to obtain the results based on bigra

Re: ASCIIFoldingFilterFactory

2014-06-05 Thread Steve Rowe
Hi Michael, Questions about Solr should go to the Solr user mailing list, rather than this list, which is for Lucene users - see for how to subscribe. I’ve never heard of ASCIIFoldingExpansionFilterFactory, but ASCIIFoldingFilterFactory has a new

Re: Problem running demo

2014-04-22 Thread Steve Rowe
Hi Joe, The demo text assumes the user will download the *binary* release, which contains the prebuilt jars, rather than build those 4 jars. The source release contains a file named lucene/BUILD.txt, which contains compilation instructions (‘ant’), though it does not appear to tell the user ho

[ANNOUNCE] Apache Lucene 4.7.1 released

2014-04-02 Thread Steve Rowe
April 2014, Apache Lucene™ 4.7.1 available The Lucene PMC is pleased to announce the release of Apache Lucene 4.7.1 Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-

Re: Lucene 4.7 intermittently not applying query filter

2014-03-28 Thread Steve Rowe
Hi Jamie, What does EmailFilter do? Why is the expanded form "required for the UAX29URLEmailTokenizer"? Seems like an exact match would work on the email address alone, without the expanded components? Do you have an example of a query that reproducibly matches more documents than it shoul

Re: Lucene 4.7 intermittently not applying query filter

2014-03-28 Thread Steve Rowe
gt; > I am busy trying to isolate the issue, since the code is running in a wider > system among other complexities. > > Jamie > > On 2014/03/28, 4:08 PM, Steve Rowe wrote: >> Hi Jamie, >> >> What does EmailFilter do? >> >> Why is the expanded for

Re: Extending StandardTokenizer Jflex to not split on '/'

2014-02-17 Thread Steve Rowe
Sorry, Diego, the generated scanner diff doesn't tell me anything. Since I was able to successfully make changes to the open source and get the desired behavior, I'm guessing you're: a) not using the same (versions of) tools as me; b) not using the same (version of the) source as me; or c) not tes

Re: Extending StandardTokenizer Jflex to not split on '/'

2014-02-14 Thread Steve Rowe
Welcome Diego, I think you’re right about MidLetter - adding a char to it should disable splitting on that char, as long as there is a letter on one side or the other. (If you’d like that behavior to be extended to numeric digits, you should use MidNumLet instead.) I tested this by adding “/“

Re: Using Lucene to index large source code repository

2014-01-27 Thread Steve Rowe
OpenGrok uses Lucene to index large source code repositories: https://github.com/OpenGrok/OpenGrok On Jan 27, 2014, at 9:59 AM, henrik sorensen wrote: > I have just started looking at Lucene but I wanted to ask if Lucene can be > used to index large source code repository. > > Looking at the

Re: Phrase indexing and searching

2013-12-23 Thread Steve Rowe
Hi Manjula, Sounds like ShingleFilter will do what you want: < http://lucene.apache.org/core/4_6_0/analyzers-common/org/apache/lucene/analysis/shingle/ShingleFilter.html > Steve www.lucidworks.com On Dec 22, 2013 11:25 PM, "Manjula Wijewickrema" wrote: > Dear All, > > My Lucene programme is abl

Re: Unable to find JAR for lucene-contrib

2013-09-11 Thread Steve Rowe
Hi Abhinav, There never has been a lucene-contrib jar - instead, each so-called contrib (as of v4.0, these are instead referred to as modules) is packaged as its own jar. Is there some contrib/module/feature/class in particular you're looking for? Steve On Sep 10, 2013, at 10:31 PM, Abhinav

[ANNOUNCE] Apache Lucene 4.4 released

2013-07-23 Thread Steve Rowe
July 2013, Apache Lucene™ 4.4 available The Lucene PMC is pleased to announce the release of Apache Lucene 4.4 Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text sea

Re: Request for addition of ThomasMurphy to ContributorsGroup

2013-06-04 Thread Steve Rowe
I added ThomasMurphy to the lucene-java ContributorsGroup wiki page. - Steve On Jun 4, 2013, at 1:33 PM, Thomas R. Murphy wrote: > Hello. I, ThomasMurphy on the wiki, would like to be a member of > ContributorsGroup. - To unsu

Re: requesting to be added to the ContributorsGroup wiki page.

2013-04-19 Thread Steve Rowe
Lukas, this seems weird to me: why are you, Lukas Fedorowicz, asking for permission for "MartinSchmidt" to edit the wiki? Why not "LukasFedorowicz"? Steve On Apr 19, 2013, at 4:15 AM, Lukas Fedorowicz wrote: > Please add the user "MartinSchmidt" to the ContributorsGroup >

Re: [ANNOUNCE] Wiki editing change

2013-03-25 Thread Steve Rowe
g thing. Steve On Mar 25, 2013, at 4:01 PM, Erick Erickson wrote: > Steve: > > Where are you finding the logons? I meant to help, not cause more work, > maybe I can do a better job if I do the right thing ... > > > On Mon, Mar 25, 2013 at 3:24 PM, Steve Rowe wrote:

Re: [ANNOUNCE] Wiki editing change

2013-03-25 Thread Steve Rowe
w to contributors >> Tom Burton-West >> tburtonw at umich dot edu >> >> Tom >> >> On Mon, Mar 25, 2013 at 9:05 AM, Steve Rowe wrote: >> >>> >>> On Mar 25, 2013, at 8:49 AM, Rafał Kuć wrote: >>>> Could you add RafalKuc to cont

Re: [ANNOUNCE] Wiki editing change

2013-03-25 Thread Steve Rowe
On Mar 25, 2013, at 8:49 AM, Rafał Kuć wrote: > Could you add RafalKuc to contributors ? Thanks :) Added to ContributorsGroup. On Mar 25, 2013, at 8:47 AM, Adrien Grand wrote: > Can you add 'jpountz' to the ContributorsGroup? Thank you! Added to ContributorsGroup.

Re: [ANNOUNCE] Wiki editing change

2013-03-25 Thread Steve Rowe
On Mar 25, 2013, at 4:09 AM, Simon Willnauer wrote: > please add me to the list "simonwillnauer" Added to AdminGroup. On Mar 25, 2013, at 4:42 AM, Andrzej Bialecki wrote: > Please add AndrzejBialecki to the ContributorsGroup. Thanks! Added to AdminGroup.

[ANNOUNCE] Wiki editing change

2013-03-24 Thread Steve Rowe
The wiki at http://wiki.apache.org/lucene-java/ has come under attack by spammers more frequently of late, so the PMC has decided to lock it down in an attempt to reduce the work involved in tracking and removing spam. From now on, only people who appear on http://wiki.apache.org/lucene-java/Co

Re: Migrating SnowballAnalyzer to 4.1

2013-03-15 Thread Steve Rowe
Hi Robert, On Mar 15, 2013, at 11:29 AM, Robert Muir wrote: > 2013/2/28 Steve Rowe : >> EnglishAnalyzer has used PorterStemmer instead of the English Snowball >> stemmer since it was created in 2010 as part of LUCENE-2055[2]. I think >> this is an oversight: EnglishAnalyz

Re: Getting documents from suggestions

2013-03-14 Thread Steve Rowe
Hi Bratislav, LUCENE-4517 sounds like what you want: : "Suggesters: allow to pass a user-defined predicate/filter to the completion searcher" There's a patch there, against Lucene trunk from about 5 months ago, so if you want to give it a try

Re: Loading lucene_solr_4_1_0 into IntelliJ

2013-03-05 Thread Steve Rowe
> > Thanks for all your help. > > Cheers, > > - Chris > > > > > > -Original Message- > From: Steve Rowe > To: java-user@lucene.apache.org > Sent: Tue, 5 Mar 2013 15:23 > Subject: Re: Loading lucene_solr_4_1_0 into IntelliJ > >

Re: Loading lucene_solr_4_1_0 into IntelliJ

2013-03-05 Thread Steve Rowe
Hi Chris, Those steps sound correct to me. On Mar 5, 2013, at 9:58 AM, Chris Bamford wrote: > Thanks for all your help here. I just tried it all again and this time I get > "Cannot Open Project /Users/cbamford/projects/lucene_solr_4_1_0 contains no > IntelliJ IDEA project" when I do File > O

Re: Migrating SnowballAnalyzer to 4.1

2013-02-28 Thread Steve Rowe
p. One more question: > Is EnglishAnalyzer a drop-in replacement for SnowballAnalyzer("English", > ...), in terms > of stemming? > > > Thanks again > Peng > > PS > Sorry for the Thread Hijacking. Will behave the next time. > >> -Original Message

Re: Migrating SnowballAnalyzer to 4.1

2013-02-28 Thread Steve Rowe
Hi Peng, Take a look at the release docs: In particular, in the API Javadocs section, the analyzers-common documentation has a large list of per-language analyzers. EnglishAnalyzer is under the org.apache.lucene.analysis.en package:

Re: More questions on BlockJoinQuery

2013-02-28 Thread Steve Rowe
Sorry, I meant to say "in the directory navigation dialog that comes up, choose the *directory* containing Lucene and Solr (*not* a proejct file)". - Steve On Feb 28, 2013, at 9:22 AM, Steve Rowe wrote: > Chris, > > You shouldn't use File > New Project, which wi

Re: More questions on BlockJoinQuery

2013-02-28 Thread Steve Rowe
wo, just not sure what! (My Project SDK is > correctly set to java 1.6.) > > Please can someone tell me what I need to do... > > Thanks > > - Chris > > > > > > > > > -Original Message- > From: Steve Rowe > To: java-user@lucen

Re: More questions on BlockJoinQuery

2013-02-20 Thread Steve Rowe
last step fails with: > > Buildfile: /Users/cbamford/projects/lucene-4.1.0/build.xml > > BUILD FAILED > Target "idea" does not exist in the project "lucene". > > Total time: 0 seconds > > > What have I done wrong? > > Thanks! > >

Re: More questions on BlockJoinQuery

2013-02-20 Thread Steve Rowe
Hi Chris, This mailing list is fine for discussing IntelliJ and Maven issues as they relate to Lucene. You'll need Ant v1.8.2+ to bootstrap things. 'ant idea' at the top level will produce an IntelliJ project you can open - see for mo

Re: Wildcard in a text field

2013-02-08 Thread Steve Rowe
Hi Nicolas, For trailing '*' only: On the query side, you can use a front-side EdgeNGramTokenFilter with a large max gram size, followed by a PatternReplaceFilter with pattern "(.*)" and replacement "$1*". Steve On Feb 8, 2013, at 10:14 AM, Nicolas Roduit wrote: > For instance, I have a lis

Re: List of files that Lucene 4.0 generates during indexing

2013-01-24 Thread Steve Rowe
Hi saisantoshi, Check out the documentation: - particularly the "File Formats" link under "Reference Documents". Steve On Jan 24, 2013, at 11:41 AM, saisantoshi wrote: > Is there any doc on how many files that lucene generates during indexing >

Re: IndexWriter.optimize() is removed in 4.0?

2013-01-23 Thread Steve Rowe
Hi Sai, Check out Simon Willnauer's blog post about this: Steve On Jan 23, 2013, at 4:49 PM, saisantoshi wrote: > There is no optimize() method in 4.0. I looked at the 3.6 docs and it did > mention the followin

[ANNOUNCE] Apache Lucene 4.1 released

2013-01-22 Thread Steve Rowe
January 2013, Apache Lucene™ 4.1 available The Lucene PMC is pleased to announce the release of Apache Lucene 4.1. Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text

Re: Lucene release source code

2013-01-14 Thread Steve Rowe
Hi Igor, You can find all recent released source code under , e.g. Alternatively, each release has a source distribution. For v4.0.0, click on the Download buttons for 4.0 fro

Re: Upgrade Lucene to latest version (4.0) from 2.4.0

2013-01-09 Thread Steve Rowe
Of course you're free to do as you like - who will stop you? :) The problem is the lack of a single place to look for detailed guidance on handling a long-distance upgrade like that. But it's difficult to generalize here: the possible range in the level of difficulty involved is vast, depending

Re: Upgrade Lucene to latest version (4.0) from 2.4.0

2013-01-09 Thread Steve Rowe
I don't think there is a migration guide from 2.X to 3.X, other than the specific information in the release notes. If you start reading CHANGES.txt at version 3.0.0, and then each later release's notes after that, especially the sections "Changes in backwards compatibility policy", e.g. for 3.

Re: Upgrade Lucene to latest version (4.0) from 2.4.0

2013-01-09 Thread Steve Rowe
Sai, For the transition from 2.X to 3.X, I recommend compiling your code against the latest 2.9.X version (2.9.4), looking at the deprecation messages, and making changes until these are all addressed and compilation no longer produces deprecation messages. Once that's done, your code should c

Re: Is StandardAnalyzer good enough for multi languages...

2013-01-08 Thread Steve Rowe
y has been enhanced in to-be-released Lucene/Solr 4.1: <https://issues.apache.org/jira/browse/SOLR-4123> - you can provide per-script RuleBasedBreakIterator specification files at runtime. On Jan 9, 2013, at 12:12 AM, Trejkaz wrote: > On Wed, Jan 9, 2013 at 10:57 AM, Steve Rowe wrote: >&g

Re: Cannot instantiate SPI class

2013-01-08 Thread Steve Rowe
ath instead? it doesn't make much sense to me as they both should use > the same class loader. > > unless of course, Lucene 4 is using a different class-loader to load these > classes. (does it?) > > as an aside -- Lucene 3.6 was running properly in that same environment

Re: Cannot instantiate SPI class

2013-01-08 Thread Steve Rowe
Hi Igal, Sounds like you don't have lucene-codecs-4.0.0.jar in Railo's classpath. Steve On Jan 8, 2013, at 10:53 PM, Igal @ getRailo.org wrote: > I'm trying to access Lucene4 from Railo (an open-source application server) > > when I try to create an IndexWriterConfig I get the error: Cannot

Re: Is StandardAnalyzer good enough for multi languages...

2013-01-08 Thread Steve Rowe
Trejkaz (and maybe Sai too): ICUTokenizer in Lucene's icu module may be be of interest to you, along with the token filters in that same module. - Steve On Jan 8, 2013, at 6:43 PM, Trejkaz wrote: > On Wed, Jan 9, 2013 at 6:30 AM, saisantoshi wrote: >> DoesLucene StandardAnalyzer work for all

Re: how to add attributes to a field just like term's payload ?

2013-01-06 Thread Steve Rowe
Hi wiggify, Lucene doesn't have direct support for what you want. However, you can store a custom map in the index when you commit: < https://lucene.apache.org/core/4_0_0/core/org/apache/lucene/index/IndexWriter.html#commit%28java.util.Map%29 >. It will be your responsibility to associate that i

Re: TokenStream: How to get token text?

2012-12-25 Thread Steve Rowe
Hi Dima, Did you see my response to your earlier email? I think it's what you're looking for: http://markmail.org/message/jdcjxauj4odyuv7e Steve On Dec 25, 2012, at 1:17 PM, dokondr wrote: > Hello, > Please, help. I am lost in TokenStream / Token / Analyzer API. > I am trying to figure out

Re: Lucene 4.0 scalability and performance.

2012-12-23 Thread Steve Rowe
Hi Vitaly, Anything by Tom Burton-West should interest you - he works on the HathiTrust digital library project , which currently indexes 7TB of full-length books, e.g.: "Practical Relevance Ranking for 10 Million Books" (paper) INEX 2012, September 2012, Rome, Italy

  1   2   >