Hello
Is it possible to exclude numbers using StandardAnalyzer just like
SimpleAnalyzer?
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Hello
i am having the following code to highlight a text
public String highlight(String text, String query ) throws IOException {
TermQuery query = new TermQuery(new Term("f", query));
QueryScorer scorer = new QueryScorer(query);
SimpleHTMLFormatter formatter =
new SimpleHTMLForm
Hello
i am having the following code to highlight a text
public String highlight(String text, String query ) throws IOException {
TermQuery query = new TermQuery(new Term("f", query));
QueryScorer scorer = new QueryScorer(query);
SimpleHTMLFormatter formatter =
new SimpleHTMLForm
Hi
Sorry about doubble messaging, it was from mistake
I am working on an application that searches documents with lucene.
The documents are taken from a database and the basic field is in HTML.
The index and searching works fine with Greek characters. I am using
the lucene to take the ID a
Hi
Is it possible to Hightlight more than one terms with highlighter but
with different style for each term ??
1st term with SimpleHTMLFormatter("", "");
2rd term with SimpleHTMLFormatter("", "");
..
n-th term with SimpleHTMLFormatter("", "");
or for foloween code
SimpleHTMLFormatter
Hello
I am using Highlighter to highlight query terms in documents getting
from a database founded from lucene search.
My problem is that when i display the full document, highlighter works
fine for most of documents but if the document is huge the highlighter
returns only a part of documen
it works!! Thanks
I believe Highlighter.setMaxDocBytesToAnalyze(int byteCount) should be used
for this.
On Mon, Aug 11, 2008 at 11:40 AM, <[EMAIL PROTECTED]> wrote:
> Hello
>
> I am using Highlighter to highlight query terms in documents getting from a
> database founded from lucene search.
I work out where to place the brackets if the query was something
like:
Marketing AND Smith OR Davies OR Management OR Business AND
Science.?
Can any suggest an effective way to group clauses so that the order of
Boolean precedence is preserved?
Thanks very much.
Jim
Thanks Ian. How would you achieve the logic of the below query using
BooleanQuery and BooleanClause.occur? How would you achieve the grouping
effect?
(Marketing AND Smith) OR Davies
Thanks a lot.
Jim
On 3 August 2011 14:54, Ian Lea wrote:
> I don't think there is an easy way. Brac
But the query parser doesn't seem to do that for me with the input Marketing
AND Smith OR Davies. The query parser gives me 3 clauses. 1 must clause for
the term Marketing, 1 must clause for the term smith and 1 should clause for
the term Davies. e.g. +Marketing +Smith SHOULD Davies. What I would l
es
I want my program to work out that this should be grouped as the following
(as AND has higher precedence than OR):
(Marketing AND Smith) OR Davies.
I'm effectively looking for an algorithm that will properly group any number
of terms..
Thanks
Jim
On 4 August 2011 16:47, Chris
Brilliant, that looks perfect. We're currently using an older version of
Lucene in which this was an experimental class. Looks like we should
upgrade.
Thanks
Jim
On 5 August 2011 02:10, Trejkaz wrote:
> On Fri, Aug 5, 2011 at 1:57 AM, Jim Swainston
> wrote:
> > So if
possible that the mirror you are using may
not have replicated the release yet. If that is the case, please try
another mirror. This also applies to Maven access.
Thanks,
Jim Ferenczi
27 March 2017 - Apache Lucene™ 6.5.0 available
The Lucene PMC is pleased to announce the release of Apache Lucene 6.5.0.
Apache Lucene is a high-performance, full-featured text search engine
library written entirely in Java. It is a technology suitable for nearly
any application that requires full
27 April 2017, Apache Lucene™ 6.5.1 available
The Lucene PMC is pleased to announce the release of Apache Lucene 6.5.1
Apache Lucene is a high-performance, full-featured text search engine
library written entirely in Java. It is a technology suitable for nearly
any application that requires full-
27 April 2017, Apache Lucene™ 6.5.1 available
The Lucene PMC is pleased to announce the release of Apache Lucene 6.5.1
Apache Lucene is a high-performance, full-featured text search engine
library written entirely in Java. It is a technology suitable for nearly
any application that requires full-
15 January 2018, Apache Lucene™ 7.2.1 available
The Lucene PMC is pleased to announce the release of Apache Lucene 7.2.1.
Apache Lucene is a high-performance, full-featured text search engine
library written entirely in Java. It is a technology suitable for nearly
any application that requires fu
24 September 2018, Apache Lucene™ 7.5.0 available
The Lucene PMC is pleased to announce the release of Apache Lucene 7.5.0.
Apache Lucene is a high-performance, full-featured text search engine
library written entirely in Java. It is a technology suitable for nearly
any application that requires
11 February 2019, Apache Lucene™ 7.7.0 available
The Lucene PMC is pleased to announce the release of Apache Lucene 7.7.0.
Apache Lucene is a high-performance, full-featured text search engine
library written entirely in Java. It is a technology suitable for nearly
any application that requires f
14 March 2019, Apache Lucene™ 8.0.0 available
The Lucene PMC is pleased to announce the release of Apache Lucene 8.0.0.
Apache Lucene is a high-performance, full-featured text search engine
library written entirely in Java. It is a technology suitable for nearly
any application that requires full
I vote option A (PMC vote)
Le mer. 17 juin 2020 à 14:24, Felix Kirchner <
felix.kirch...@uni-wuerzburg.de> a écrit :
> A
>
> non-PMC
>
> Am 16.06.2020 um 00:08 schrieb Ryan Ernst:
> > Dear Lucene and Solr developers!
> >
> > In February a contest was started to design a new logo for Lucene [1].
>
A1 (binding)
Le jeu. 3 sept. 2020 à 07:09, Noble Paul a écrit :
> A1, A2, D binding
>
> On Thu, Sep 3, 2020 at 7:22 AM Jason Gerlowski
> wrote:
> >
> > A1, A2, D (binding)
> >
> > On Wed, Sep 2, 2020 at 10:47 AM Michael McCandless
> > wrote:
> > >
> > > A2, A1, C5, D (binding)
> > >
> > > Than
I would like to be able to handle the following:
"/\d\d\d{4} \\d\\d/ office"
Where / indicates a regex expression phrase.
One option is extending MultiFieldQueryParser and catching the phrase within
getFieldQuery evaluating whether /, the regex identifier, is present and
then returning a SpanNe
By custom phrase query class I was trying to ask if it would be possible, or
even a good idea, to create a modified PhraseQuery class that is more
efficient that span queries (as I only want to use it for phrases). This
class might have multiple possible terms generated from a regex at a certain
po
Thanks for the advice Chris. What I am working on now is extracting the
matching phrases. The current code for MultiPhraseQuery and SpanQueries
just return all matching terms, not matching phrases. I implemented some
code matching up the TermPositions, but this is pretty slow. Is there any
way
as a guide. If some could show me how to use fields
add/retrieve (assuming that is the correct way to go) I would greatly
appreciate it.
thanks in advance jim s
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Thanks to Karl and Donna, I followed your suggestions and was able to get a
test driver (modified demo code) working, thanks again.
jim s
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL
added but update the
master index on the main application host.
I read blurbs about merge and think maybe I might be able to do something
with that, but I am unsure how that would work or if there is another way.
Any ideas, suggestions would be very much appreciated
"Lucene has no concept of "document identity" in that you can index
the same document 15 times in a row and Lucene will have 15 entries. "
Is this true? When ever I run the demo indexing logic document already
indexed are skipped. What am I missing.
OK I get it, thanks for the info, just told my boss the opposite, guess I
will have to send another email.
thanks again
jim s
- Original Message -
From: "Erick Erickson" <[EMAIL PROTECTED]>
To:
Sent: Saturday, April 21, 2007 8:32 AM
Subject: Re: Merging Indeces
aster index or index locally and then merge
a tiny index into the master index?
Thanks to any and all that take the time to advice me.
jim s.
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Anyone know how to add OpenOffice document to a Lucene index? Is there a
parser for OpenOffice?
thanks in advance
jim s.
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Is it possibe to index CAD formats such as AutoCad or CGM? I know some
commecail products (excalaber) claim to be able to do that? If so what about
TIFF?
thanks
jim s
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For
thank you for the reply, I knew the answer but was compelled to ask anyway.
CAD files like AutoCad/ProE/CaTia do contain some useful text and it is
possible to get at that and
index it. But mostly it's vectors and there is not much a text engine can do
with a vectors.
thanks again.
here. It is kind of important that I learn how to
add different files types like OO or AutoCad, so we can make a build (with
Lucene) or buy call.
Thanks to all that try to help me out
Jim S
P.S. If I get it working I will be happy to email post the code.
-
code up a Reader the just spites out "Here I am" a few
hundred times and see what happens. LOL.
thank you for the reply and advice.
jim s
- Original Message -
From: "Andrzej Bialecki" <[EMAIL PROTECTED]>
To:
Sent: Friday, May 25, 2007 1:10 PM
Subject: R
r easy method to extract
text from a pdf file I would centrainly like to know. I can live without
openoffice (for a while) but not being able to index pdf would be a Lucene
show stopper.
thanks
jim s
-
To unsub
e and
got
java.lang.NoClassDefFoundError: org/fontbox/afm/AFMParser
googling trying to find the jar that contains AFMPaser
thanks for the help
jim s
- Original Message -
From: "Chris Hostetter" <[EMAIL PROTECTED]>
To:
Sent: Wednesday, June 06, 2007 5:33 PM
Subject: Re: Indexing PDF document
:
ok thnaks found FontBox/jar on the net, but now I see the included jars with
pdfbox. I expected them to be in /lib not /external my bad. thanks again
jim s
- Original Message -
From: "Ben Litchfield" <[EMAIL PROTECTED]>
To:
Sent: Wednesday, June 06, 2007 6:1
I am trying to index msword documents. I’ve got things working but I do not
think I am doing things properly.
To index msword docs I use an extractor to extract the text. Then I write
the text to a .txt file and index that using an HTLMDocument object. Seems
to me that since I have the text
Hi,
I am trying to index msword documents. I've got things working but I do not
think I am doing things properly.
To index msword docs I use an extractor to extract the text. Then I write
the text to a .txt file and index that using an HTMLDocument object. Seems
to me that since I have the te
many thanks I will try that, thanks again!
jim s
- Original Message -
From: "Donna L Gresh" <[EMAIL PROTECTED]>
To:
Sent: Friday, June 08, 2007 12:52 PM
Subject: Re: Indexing MSword Documents
I do this exact thing. "text" (the second input to the Field co
taking the time to reply
jim s
- Original Message -
From: "Mathieu Lecarme" <[EMAIL PROTECTED]>
To:
Sent: Friday, June 08, 2007 12:48 PM
Subject: Re: Indexing MSword Documents
Why don't use Document?
http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightl
thanks the apprach you and Donna Gresh suggested worked out fine. I now have
a much better understanding of the Document class.
here is the create Document code in case another newie is interested. as
more mine types are added I will expand the in if
thanks again
jim s
public class
a bitset (you do this by
traversing the index directly for each word). Call these bitsets WBS(n)
(word bit set n. n spans 1..m where m is the total number of words you want
to count against)
Finally: to get a count per word bit-wise AND each WBS(n) with FRSBS and count
up the 1s
Jim Powers
On Sun
mmon-build.xml, and build-deprecated.xml??
Jim
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
mmon-build.xml, and build-deprecated.xml??
Jim
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
= new IndexSearcher(current.getDirectory());
}
}
Jim Wilson
Colorado Springs, CO
719-266-4431 (Home)
719-661-6768 (Cell)
[EMAIL PROTECTED]
IM:jwilsonsprings
Registered Linux User # 302849
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additio
o the
IndexWriter constructor would cause it to remove the old index files once
it had created the new ones.
Have I misunderstood something? Or done something incorrect?
Thanks.
--Jim
=====
Jim Coble
Head, Instructional Technology Applicat
Hi,
I am new to lucene. I was following the example, IndexFiles, from the
lucene demo package. However, one thing I find surpring is that the
write.lock file is left over, even when the IndexWriter#close() method is
called. Below are details.
Environment:
OS X Yosemite
java version "1.8.0_72"
Jav
Never mind, folks. Just found an answer:
http://www.gossamer-threads.com/lists/lucene/java-user/110708
On Sun, May 22, 2016 at 10:20 AM, Jianbao (Jim) Tao
wrote:
> Hi,
>
> I am new to lucene. I was following the example, IndexFiles, from the
> lucene demo package. However, one
yone point me in the right direction?
Thanks
Jim
Thanks for your prompt response! You ask about the use case. We have a series
of similar intranet sites, each represented by a separate tomcat application
instance using the same code base but with different start-up parameters. The
intranets all provide a common search function based on the sam
: Setting the COMMIT lock timeout.
On Dienstag 14 März 2006 10:52, Jim Bedford-roberts wrote:
> Admittedly we could have developed a single central search component,
> but given the way the code has evolved our current approach is simplest
> for us. With separate application instances shari
7;m still not clear
about how best to employ the codec. Has anyone had any experience with it?
Thanks for any and all insights.
Best regards,
Jim Beale
The information contained in this email message, including any attachments, is
intended solely for use by the individual or entity named abo
;t find anything useful on the
usual places.
I suppose I could setup wireshark to see what is happening but I'm hoping that
someone has a better suggestion.
Thanks in advance for any help!
Best regards,
Jim Beale
hibu.com
2201 Renaissance Boulevard, King of Prussia, PA, 19406
Office: 610
r the full cases.
Thanks for any info/guidance.
Cheers,
Jim Beale
Lead Developer
Hibu.com
The information contained in this email message, including any attachments, is
intended solely for use by the individual or entity named above and may be
confidential. If the reader of this message
our log files.
I have to admit that I didn't understand what you meant by query enrichment.
Where can I find more information about that?
Thanks!
Jim Beale
hibu.com
2201 Renaissance Boulevard, King of Prussia, PA, 19406
Office: 610-879-3864
Mobile: 610-220-3067
-Original Me
Oh, sorry!
Jim Beale
hibu.com
2201 Renaissance Boulevard, King of Prussia, PA, 19406
Office: 610-879-3864
Mobile: 610-220-3067
-Original Message-
From: Chris Hostetter [mailto:hossman_luc...@fucit.org]
Sent: Tuesday, July 23, 2013 1:42 PM
To: java-user@lucene.apache.org
Subject: Re
59 matches
Mail list logo