Big problem with solr in an official server.

2010-04-19 Thread Ariel
ory in the server and it is affecting considerably the performance of the server. Could you help me please ??? Thanks in advance. Regards Ariel

How to delete documents from an index and how to reset de remote multisearcher so the deleted docs not being shown in the search results ???

2009-09-11 Thread Ariel
deleted don't appear in the search results when they are deleted ??? I hope you can help me. Regards Ariel - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Problem with ranking in lucene

2009-04-09 Thread Ariel
and I am searching with MultifieldQuery in both fields two: title and content, I am using the same analyzer for indexing and searching. I hope you can help me. Thanks in advance Regards Ariel

How Can I make an analyzer that ignore the numbers o the texts ???

2009-04-08 Thread Ariel
Hi everybody: I would want to know how Can I make an analyzer that ignore the numbers o the texts like the stop words are ignored ??? For example that the terms : 3.8, 100, 4.15, 4,33 don't be added to the index. How can I do that ??? Regards Ariel

How can I change that lucene use by default the AND operator between terms ???

2009-04-08 Thread Ariel
When I do a search using lucene internally lucene use by default the OR operator between terms, How can I change that lucene use by default the AND operator between terms ??? Regards Ariel

Re: How to search a phrase using quotes in a query ???

2009-04-07 Thread Ariel
pFilter, LowerCaseFilter, StandardFilter and StandardTokenizer. So, I don't know why when I do a search like "the bank of america" the search results doesn't return the documents that have the exact phrase "the bank of america". Could you help me please ??? Regards A

Re: How to search a phrase using quotes in a query ???

2009-04-06 Thread Ariel
Well I have luke lucene, the index has been build fine. The field where I am searching is the content field. I am using the same analyzer in query and indexing time: SnowBall English Analyzer. I am going to submit later the snippet code. Regards Ariel On Mon, Apr 6, 2009 at 4:37 PM, Erick

How to search a phrase using quotes in a query ???

2009-04-06 Thread Ariel
zer for English ??? Could you help with this please ??? Regards Ariel

Re: How to index correctly taking in account the synonyms using Wordnet ???

2009-02-04 Thread Ariel
How can I see the senses of a word with wordnet ??? And How could I select the most populars ??? Is there a way to make queries ignoring the synonyms I have added to the index ??? I hope you can help me. Regards Ariel On Wed, Feb 4, 2009 at 7:46 PM, Manu Konchady wrote: > > > > -

Re: How to index correctly taking in account the synonyms using Wordnet ???

2009-02-04 Thread Ariel
nd code > examples, because your question is unanswerable as it > stands. > > Best > Erick > > On Wed, Feb 4, 2009 at 3:27 PM, Ariel wrote: > > > Hi every body: > > > > I am using wordnet to index my document taking in account the synonyms > > with

How to index correctly taking in account the synonyms using Wordnet ???

2009-02-04 Thread Ariel
nd bird are not synonyms then Why are the documents that contain "bird" retrieved ??? Could help me to solve that problem ??? How do you index your documents using wordnet ??? Thanks in advance. Regards Ariel - To unsub

Re: Default and optimal use of RAMDirectory

2009-01-05 Thread Ariel
Did you mean that the people that think the use of RAMDirectory is going to speed up the indexing proccess are wrong ??? On Sun, Dec 21, 2008 at 10:22 PM, Otis Gospodnetic < otis_gospodne...@yahoo.com> wrote: > Let me add to that that I clearly recall having a hard time getting the > tests for th

Re: How to search documents taking in account the dates ???

2008-12-18 Thread Ariel
Thank you, it works very good. Regards Ariel On Thu, Dec 18, 2008 at 8:22 AM, Erick Erickson wrote: > Use the setSort that takes an array of Sort objects... > > On Thu, Dec 18, 2008 at 8:11 AM, Ariel wrote: > > > What I am doing is this: > > > >

Re: How to search documents taking in account the dates ???

2008-12-18 Thread Ariel
What I am doing is this: Sort sort = new Sort(); sort.setSort("year", true); hits = searcher.search(pquery,sort); How I must put my code to sort first by date an then by score ??? Greetings Ariel On Thu, Dec 18, 2008 at 4:48 AM, Ian Lea wrote:

Re: How to search documents taking in account the dates ???

2008-12-17 Thread Ariel
Hi: This solution have a problem. the results are sorted bye the year criteria but I need that after sort by year criteria it sort by the scoring criteria two. How can I do this ??? I hope you can help me. Greetings Ariel On Wed, Nov 19, 2008 at 5:28 PM, Erick Erickson wrote: > W

Re: I would want to know more about the lucene implementation in C++

2008-12-08 Thread Ariel
h > > > > - Original Message > > From: Ariel <[EMAIL PROTECTED]> > > To: lucene user > > Sent: Tuesday, December 2, 2008 2:13:08 PM > > Subject: I would want to know more about the lucene implementation in C++ > > > > Hi everybody: >

I would want to know more about the lucene implementation in C++

2008-12-02 Thread Ariel
Hi everybody: I have seen the lucene project for C++ has been abandoned, could you tell me if there is another similar implementation of java lucene in C++ ???

Re: How to search documents taking in account the dates ???

2008-11-19 Thread Ariel
s. I hope you can help me. Regards Thanks in advance Ariel On Wed, Nov 19, 2008 at 3:58 PM, Ian Lea <[EMAIL PROTECTED]> wrote: > Are you using one of the search methods that includes sorting? If > not, then do. If you are, then you need to tell us exactly what you > are doin

Re: How to search documents taking in account the dates ???

2008-11-19 Thread Ariel
it is supposed lucene make a lexicocraphic sorting but this is not hapening, Could you tell me what I'm doing wrong ? I hope you can help me. Regards On Wed, Nov 19, 2008 at 11:56 AM, Ariel <[EMAIL PROTECTED]> wrote: > Thanks, that was very helpful, but I have a question w

Re: How to search documents taking in account the dates ???

2008-11-19 Thread Ariel
sort criteria. How could I fix that problem ??? Greetings Ariel On Wed, Nov 19, 2008 at 11:09 AM, Ian Lea <[EMAIL PROTECTED]> wrote: > Hi - sounds like you need a range query. > > http://lucene.apache.org/java/2_3_2/queryparsersyntax.html#Range%20Searches > > > -- > Ian. &g

How to search documents taking in account the dates ???

2008-11-19 Thread Ariel
after a Year, for example documents before 2002 year o after 2003 year. It is possible to do that with lucene ??? Regards Ariel

What is the percent of size of lucene's index ?

2008-07-23 Thread Ariel
? Is there any official document of apache lucene where says that ? I hope somebody can help me. Thanks. Ariel

Re: How to make documents clustering and topic classification with lucene

2008-07-07 Thread Ariel
Hi everybody: Do you have Idea how to make how to make documents clustering and topic classification using lucene ??? Is there anyway to do this. Please I need help. Thanks everybody. Ariel

How to make documents clustering and topic classification with lucene

2008-07-07 Thread Ariel
Hi everybody: Do you have Idea how to make how to make documents clustering and topic classification using lucene ??? Is there anyway to do this. Please I need help. Thanks everybody. Ariel

Re: boosting relevance of certain documents

2008-04-25 Thread Jonathan Ariel
n Fri, Apr 25, 2008 at 2:30 PM, Daniel Freudenberger < [EMAIL PROTECTED]> wrote: > I'm using the StandardAnalyzer - hope this answers your question (I'm > quite > new to the lucene thing) > > -Original Message- > From: Jonathan Ariel [mailto:[EMAIL PROTECTED]

Re: boosting relevance of certain documents

2008-04-25 Thread Jonathan Ariel
How are you analyzing the searchable field? On Fri, Apr 25, 2008 at 12:49 PM, Daniel Freudenberger < [EMAIL PROTECTED]> wrote: > Hello, > > > > I'm using lucene within a new project and I'm not sure about how to solve > the following problem: My index consists of the two attributes "id" and > "se

MoreLikeThis patch to support boost factor

2008-04-23 Thread Jonathan Ariel
This is a patch I made to be able to boost the terms with a specific factor beside the relevancy returned by MoreLikeThis. This is helpful when having more then 1 MoreLikeThis in the query, so words in the field A (i.e. Title) can be boosted more than words in the field B (i.e. Description). Any f

Re: MoreLikeThis over a subset of documents

2008-04-23 Thread Jonathan Ariel
hange MoreLikeThis to receive a set of term frequencies, instead of an IndexReader, and use that to do all the process. Anyone knows if a document contains for his fields the term frequencies? On Wed, Apr 23, 2008 at 7:46 AM, Karl Wettin <[EMAIL PROTECTED]> wrote: > Jonathan Ariel skrev: > &g

Re: MoreLikeThis over a subset of documents

2008-04-22 Thread Jonathan Ariel
in category A, only add the text to the > catA field. Now do MoreLikeThis on catA. This assumes you know the > categories at index time, of course. > Redundant but workable. > > -Glen > > 2008/4/22 Jonathan Ariel <[EMAIL PROTECTED]>: > > Is there any way to execute a M

Re: MoreLikeThis over a subset of documents

2008-04-22 Thread Jonathan Ariel
I could have up to 2 million documents and growing. On Tue, Apr 22, 2008 at 7:29 PM, Karl Wettin <[EMAIL PROTECTED]> wrote: > Jonathan Ariel skrev: > > Is there any way to execute a MoreLikeThis over a subset of documents? I > > need to retrieve a set of interesting keyw

Re: MoreLikeThis over a subset of documents

2008-04-22 Thread Jonathan Ariel
ties to > Query moreQuery = mlt.like( target); > BooleanQuery bq = new BooleanQuery(); > bq.add(moreQuery, BooleanClause.Occur.MUST); > Query restrictQuery = new TermQuery(new Term("Category", "A")); > bq.add(restrictQuery, BooleanClause.Occur.MUST); > Hits hits = is.search

MoreLikeThis over a subset of documents

2008-04-22 Thread Jonathan Ariel
Is there any way to execute a MoreLikeThis over a subset of documents? I need to retrieve a set of interesting keywords from a subset of documents and not the entire index (imagine that my index has documents categorized as A, B and C and I just want to work with those categorized as A). Right now

How to obtain the freq term vector of a field from a remote index ?

2008-02-28 Thread Ariel
? How can I make it ? Any help is appreciated . Greetings Ariel

MoreLikeThis queries

2008-02-22 Thread Jonathan Ariel
Hi, I'm trying to use MoreLikeThis but I can't find how to make a MoreLikeThis query that will return related documents given a document and some conditions, like country field in the related documents should be 1, etc. Is there any documentation on how to do this kind of queries? Thanks, Jonat

MoreLikeThis jar doesn't contain classes

2008-02-22 Thread Jonathan Ariel
Hi, I've downloaded Lucene 2.3.0 binaries and in the contrib folder I can see the Similarity package, but inside the Jar there are no classes! Downloading the sources I ran into the same issue. Am I doing something wrong? Where should I get the MoreLikeThis classes from? Thanks! Jonathan

Re: Why is lucene so slow indexing in nfs file system ?

2008-01-10 Thread Ariel
Thanks for yours suggestions. I'm sorry I didn't know but I would want to know what Do you mean with "SAN" and "FC"? Another thing, I have visited the lucene home page and there is not released the 2.3 version, could you tell me where is the download link ? Thank

Re: Why is lucene so slow indexing in nfs file system ?

2008-01-10 Thread Ariel
with the principal one, that is faster than if I had just one thread indexing the documents, doesn' it ? Yours suggestions are very useful. I hope you can help me. Greetings Ariel On Jan 10, 2008 10:21 AM, Erick Erickson <[EMAIL PROTECTED]> wrote: > This seems really clunky. Especially

Re: Why is lucene so slow indexing in nfs file system ?

2008-01-10 Thread Ariel
do some things to test it. Ariel On Jan 10, 2008 8:45 AM, Ariel <[EMAIL PROTECTED]> wrote: > Thanks all you for yours answers, I going to change a few things in my > application and make tests. > One thing I haven't find another good pdfToText converter like pdfBox Do > yo

Re: Why is lucene so slow indexing in nfs file system ?

2008-01-10 Thread Ariel
Thanks all you for yours answers, I going to change a few things in my application and make tests. One thing I haven't find another good pdfToText converter like pdfBox Do you know any other faster ? Greetings Thanks for yours answers Ariel On Jan 9, 2008 11:08 PM, Otis Gospodnetic &l

Why is lucene so slow indexing in nfs file system ?

2008-01-09 Thread Ariel
Hi: I have seen the post in http://www.mail-archive.com/[EMAIL PROTECTED]/msg12700.html and I am implementing a similar application in a distributed enviroment, a cluster of nodes only 5 nodes. The operating system I use is Linux(Centos) so I am using nfs file system too to access the home director

Re: How to build your custom termfreq vector an add it to the field ?

2007-11-08 Thread Ariel
lly attempts to provide Lucene with a new scoring > mechanism. > > You might also have a look at Lemur (http://www.lemurproject.org/) > which is much more academically focused. > > Cheers, > Grant > > > On Nov 7, 2007, at 12:49 PM, Ariel wrote: > > > Then if

Re: How to build your custom termfreq vector an add it to the field ?

2007-11-07 Thread Ariel
Then if I want to use another scoring formula I must to implement my own Query/Weigh/Scorer ? For example instead of cousine distance leiderbage distance or .. another. I'm studying Query/Weigh/Scorer classes to find out how to do that but there is not much documentation about that. I have seen I

Re: How to change the similarity function of lucene

2007-09-28 Thread Ariel
the term vector in a document, How can I do it ? Greetings Ariel On 9/24/07, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > > Perhaps you can explain in what way you want to make it more > powerful? There are possibilities to do: > 1. Change the Similarity class (a call b

How to change the similarity function of lucene

2007-09-24 Thread Ariel
Hi every body: I would like to know how to change the similarity function of lucene to extends the posibilities of searching and make it more powefull. Have somebody made this before ? Could you help me please ? I don't know how complex might be this. I hope you can help me. Greetings Ariel

Re: How to get documents similar to other document ?

2007-09-11 Thread Ariel
lso > search this archive for MoreLikeThis. > > > On Sep 11, 2007, at 11:50 AM, Ariel wrote: > > > Hi every body: > > > > My question is if there is an api function of lucene to obtain similar > > documents to other document comparing the term frequence ve

How to get documents similar to other document ?

2007-09-11 Thread Ariel
be a very useful functionality of the lucene api. I am using lucene version 1.9 I hope you can help me. Greetings. Ariel

Re: Indexing

2007-08-22 Thread Jonathan Ariel
RAMDir, > but that's "an exercise for the reader", > > You could also define the problem away and just live > with a 5 minute latency. > > Best > Erick > > On 8/22/07, Jonathan Ariel <[EMAIL PROTECTED]> wrote: > > > > Hi, > > I'm ne

Indexing

2007-08-22 Thread Jonathan Ariel
Hi, I'm new to this list. So first of all Hello to everyone! So right now I have a little issue I would like to discuss with you. Suppose that your are in a really big application where the data in your database is updated really fast. I reindex lucene every 5 min but since my application lists ev

One index per user or one index per day?

2007-02-26 Thread ariel goldberg
better without testing, but does anyone have a gut reaction? Any advice would be greatly appreciated! Thanks, Ariel Need Mail bonding? Go to the Yahoo! Mail Q&A for great tips from Y

Re: Full disk space during indexing process with 120 gb of free disk space

2006-12-05 Thread Ariel Isaac Romero Cartaya
Here is my source code where I convert pdf files to text for indexing, I got this source code from lucene in action examples and adapted it for my convenience, I hop you could help me to fix this problem, anyway if you know another more efficient way to do it please tell me how to: import java.i

Full disk space during indexing process with 120 gb of free disk space

2006-12-04 Thread Ariel Isaac Romero Cartaya
Hi every body: I am getting a problem during the indexing process, I am indexing big amounts of texts most of them in pdf format I am using pdf box 0.6 version. The space in hard disk before that the indexing process begin is around 120 Gb but incredibly even when my lucene index doesn't have y

Re: Big problem with big indexes

2006-10-17 Thread Ariel Isaac Romero Cartaya
Here are pieces of my source code: First of all, I search in all the indexes given a query String with a parallel searcher. As you can see I make a multi field query. Then you can see the index format I use, I store in the index all the fields. My index is optimized. public Hits search

Re: Big problem with big indexes

2006-10-16 Thread Ariel Isaac Romero Cartaya
f this helps much, but from your description, I think there's a fundamental, correctable problem because nobody would use the product if it gave such poor search times. And lots of people use it. Best Erick On 10/11/06, Ariel Isaac Romero Cartaya <[EMAIL PROTECTED]> wrote: > > Hi e

Big problem with big indexes

2006-10-11 Thread Ariel Isaac Romero Cartaya
Hi everybody: I have a big problem making prallel searches in big indexes. I have indexed with lucene over 60 000 articles, I have distributed the indexes in 10 computers nodes so each index not exceed the 60 MB of size. I makes parallel searches in those indexes but I get the search resu

RE: graphically representing an index

2006-09-01 Thread SOMMERIA KLEIN Ariel Ext VIACCESS-BU_DRM
Hi Andzej, Thanks for the tip, it does what I want. You are right, though, it's of limited use for helping the user access data. But I'm sure it will come in handy for my own analysis. Best, Ariel -Message d'origine- DeĀ : Andrzej Bialecki [mailto:[EMAIL PROTECTED] En

graphically representing an index

2006-08-31 Thread SOMMERIA KLEIN Ariel Ext VIACCESS-BU_DRM
Hi all, I'm a newbie with Lucene and I'm looking to implement the following: I want to index posts from a forum, and, rather than proposing a search on the contents, graphically represent the contents of the index. More precisely, I would like to have a list of the most popular words, with a number

Re: How to merge lucene indexes ???

2006-05-15 Thread Ariel Isaac Romero
ged into master index before, that's why I can't put the boolean parameter true. I really need help, please. I'm open to any suggestion. On 5/15/06, Daniel Naber <[EMAIL PROTECTED]> wrote: On Montag 15 Mai 2006 19:51, Ariel Isaac Romero wrote: > IndexReader indexTo

How to merge lucene indexes ???

2006-05-15 Thread Ariel Isaac Romero
Hi every body: I need to know how to merge an index into another. I have a master index whose another indexes are added to it from others nodes . I want to merge indexes from the others nodes to master index, I made this method: public void merge(String MasterIndexDir, String IndexTo

How do I know the memory size of my RAMDirectory ?

2006-05-09 Thread Ariel Isaac Romero
index in RAM memory. Please, Could you help me ??? Ariel