Is there a SubstringTransformer?

2010-08-23 Thread Gonzalo Payo Navarro
Hi everyone! I need to get the first 100 chars of a string-type field, but I am not able to find something like a SubstringTransformer, therefore I am using the RegexTransformer, but I suspect that it eats a lot of time on indexation time. So, in short, I need something like a SubstringTransforme

Re: Solr Hangs up after couple of hours

2010-08-23 Thread Bill Au
It would be very useful if you can take a threads dump while Solr is hanging. That will give indication where/why Solr is hanging. Bill On Mon, Aug 23, 2010 at 9:32 PM, Manepalli, Kalyan < kalyan.manepa...@orbitz.com> wrote: > Hi all, > I am facing a peculiar problem with Solr querying. D

Re: Solr jam after all my jvm thread pool hang in blocked state

2010-08-23 Thread Bill Au
It would be helpful it you can attached a threads dump. BIll On Mon, Aug 23, 2010 at 6:00 PM, AlexxelA wrote: > > I, > > I'm running solr 1.3 in production for now 1 year and i never had any > problem with it since 2 weeks. It happen 6-7 times a day, all of my thread > but one are in a blocked

SolrJ addField with Reader

2010-08-23 Thread Bojan Vukojevic
I am using SolrJ with embedded Solr server and some documents have a lot of text. Solr will be running on a small device with very limited memory. In my tests I cannot process more than 3MB of text (in a body) with 64MB heap. According to Java there is about 30MB free memory before I call server.a

Solr Hangs up after couple of hours

2010-08-23 Thread Manepalli, Kalyan
Hi all, I am facing a peculiar problem with Solr querying. During our indexing process we analyze the existing index. For this we query the index. We found that the solr server just hangs on a arbitrary query. If we access the admin/stats.jsp, it again resumes executing the queries. The t

Why it's boosted up?

2010-08-23 Thread 朱炎詹
In Lucene's web page, there's a paragraph: "Indexing time boosts are preprocessed for storage efficiency and written to the directory (when writing the document) in a single byte (!) as follows: For each field of a document, all boosts of that field (i.e. all boosts under the same field name i

Re: Doing Shingle but also keep special single word

2010-08-23 Thread 朱炎詹
Thanks! I'll give more effort to understand your suggestion & that Norm thing. - Original Message - From: "MitchK" To: Sent: Tuesday, August 24, 2010 5:28 AM Subject: Re: Doing Shingle but also keep special single word No, I mean that you use an additional field (indexed) for sear

Re: Doing Shingle but also keep special single word

2010-08-23 Thread 朱炎詹
The request is from our business team, they wish user of our product can type in partial string of a word that exists in title or body field. But now I also doubt if this request is really necessary? Scott - Original Message - From: "Ahmet Arslan" To: Sent: Monday, August 23, 2010

Re: ANNOUNCE: Stump Hoss @ Lucene Revolution

2010-08-23 Thread Chris Hostetter
: I have a couple of questions I would like to through your way. : : Is there a place where one can sign up for this. Heh sure, all the details were in my email... : >http://bit.ly/stump-hoss ...and... : > type of solution I might come up with under pressure, please email

about readercycle script

2010-08-23 Thread Koji Sekiguchi
I'm working on SOLR-2046 and realized that readercycle script might be looking for old(?) Solr response format, therefore, today it always fails: https://issues.apache.org/jira/browse/SOLR-2046 Since I've looked for issues regarding readercycle in jira and maling list archives so far, nobody com

Re: lucene + solr: corrupt index

2010-08-23 Thread ANurag
Thx Koji, I tried 2.9.3 and it works :-) On Mon, Aug 23, 2010 at 6:15 PM, Koji Sekiguchi wrote: >  (10/08/24 10:02), ANurag wrote: >> >> Hi, >> I am using lucene 3.0 jars and built a lucene index with 200 >> documents. The index files were then copied over to my solr 1.4.1 >> installation. I get

Re: lucene + solr: corrupt index

2010-08-23 Thread Koji Sekiguchi
(10/08/24 10:02), ANurag wrote: Hi, I am using lucene 3.0 jars and built a lucene index with 200 documents. The index files were then copied over to my solr 1.4.1 installation. I get the following error every time I start SOLR: What could I be doing wrong? Solr 1.4 can read Lucene 2.9 index or

lucene + solr: corrupt index

2010-08-23 Thread ANurag
Hi, I am using lucene 3.0 jars and built a lucene index with 200 documents. The index files were then copied over to my solr 1.4.1 installation. I get the following error every time I start SOLR: What could I be doing wrong? SEVERE: Could not start SOLR. Check solr/home property java.lang.RuntimeE

Solr jam after all my jvm thread pool hang in blocked state

2010-08-23 Thread AlexxelA
I, I'm running solr 1.3 in production for now 1 year and i never had any problem with it since 2 weeks. It happen 6-7 times a day, all of my thread but one are in a blocked state. All thread that are blocked are waiting on the Console monitor owned by the "Runnable" thread. We did not changed

Re: ANNOUNCE: Stump Hoss @ Lucene Revolution

2010-08-23 Thread Israel Ekpo
Chris, I have a couple of questions I would like to through your way. Is there a place where one can sign up for this. Is sounds very interesting. On Mon, Aug 23, 2010 at 4:49 PM, Chris Hostetter wrote: > > Hey everybody, > > As you (hopefully) have heard by now, Lucid Imagination is sponsorin

minMergeDocs supported ?

2010-08-23 Thread stockii
Heya: IS minMergeDocs SUPPORTED IN soLR ? -- View this message in context: http://lucene.472066.n3.nabble.com/minMergeDocs-supported-tp1302856p1302856.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How to use synonms on a faceted field with multiple words

2010-08-23 Thread Chris Hostetter
: A quick and dirty work around using Solr 1.4 is to replace spaces in the synonm file with : some other character/pattern. I used ## (i.e. video => digital##media). Then add the : solr.PatternReplaceFilterFactory after the synonm filter to replace pattern with space. : This works, but I'd lo

Re: Solrj ContentStreamUpdateRequest Slow

2010-08-23 Thread Chris Hostetter
: ContentStreamUpdateRequest req = new : ContentStreamUpdateRequest("/update/extract"); : : System.out.println("setting params..."); : req.setParam("stream.url", fileName); : req.setParam("literal.content_id", solrId); ContentStreamUpdateRequest exists so that you can stream content

Re: Doing Shingle but also keep special single word

2010-08-23 Thread MitchK
No, I mean that you use an additional field (indexed) for searching (i.e. whitespace-tokenized, so every word - seperated by a whitespace - becomes to a token . So you have got two fields (shingle-token-field and single-token-field). So you can search accross both fields. This provides several ben

ANNOUNCE: Stump Hoss @ Lucene Revolution

2010-08-23 Thread Chris Hostetter
Hey everybody, As you (hopefully) have heard by now, Lucid Imagination is sponsoring a Lucene/Solr conference in Boston about 6 weeks from now. We've got a lot of really great speakers lined up to give some really interesting technical talks, so I offered to do something a little bit differe

Re: Problem in setting the request writer in SolrJ (wiki page wrong?)

2010-08-23 Thread Ryan McKinley
Note that the 'setRequestWriter' is not part of the SolrServer API, it is on the CommonsHttpSolrServer: http://lucene.apache.org/solr/api/org/apache/solr/client/solrj/impl/CommonsHttpSolrServer.html#setRequestWriter%28org.apache.solr.client.solrj.request.RequestWriter%29 If you are using EmbeddedS

Re: Proper Escaping of Ampersands

2010-08-23 Thread Yonik Seeley
I'd recommend going back to the "textgen" field type as defined in the example schema. Your move of the StopFilter is what is causing the problem. At index time, the "s" gets removed (because the StopFilter is now after the WDF). But a query of "at&s" is transformed into "at s" (the s isn't removed

Re: Proper Escaping of Ampersands

2010-08-23 Thread Chris Hostetter
: The document is indexed correctly, a search for "at s" found it and all : fields looked great ("at&s and not for example, at&s). : : As my stopword list does not contain "at" or "&" or "&", I don't : quite understand, why my result is found, when I disable the : stopword-list. My stopwordlist ca

Re: How to Debug Sol-Code in Eclipse ?!

2010-08-23 Thread stockii
thx, for your help. now it works fine. its very simple when you kno how :D haha i try bernds suggest =) -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-Debug-Sol-Code-in-Eclipse-tp1262050p1296175.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Tokenising on Each Letter

2010-08-23 Thread Scottie
Nikolas, thanks a lot for that, I've just gave it a quick test and it definitely seems to work for the examples I've gave. Thanks again, Scott From: Nikolas Tautenhahn [via Lucene] Sent: Monday, August 23, 2010 3:14 PM To: Scottie Subject: Re: Tokenising on Each Letter Hi Scottie, > Cou

Re: SolrException log

2010-08-23 Thread Bastian Spitzer
I dont seem to find a decent documentation on how those parameters actually work. this is the default, example block: 1 0 so do i have to increase the maxCommitsToKeep to a value of 2 when i add a maxCommitAge Parameter? Or will 1 still be enough? D

Re: help refactoring from 3.x to 4.x

2010-08-23 Thread Ryan McKinley
On Mon, Aug 23, 2010 at 7:00 AM, Michael McCandless wrote: > Spooky that you see incorrect results!  The code looks correct.  What > are the specifics on when it produces an invalid result? Figured this out -- the above code is not invalid, however i tried versions that movedthe utf8ToString() th

Re: SolrException log

2010-08-23 Thread Bastian Spitzer
Hi Tommaso, Thanks for your Reply. The Solr Files are on local disk, on a reiserfs. I'll try to set a Deletion Policy and report back if that solved the problem, thank you for the hint. cheers, Bastian -Ursprüngliche Nachricht- Von: Tommaso Teofili [mailto:tommaso.teof...@gmail.com] Ge

Re: Tokenising on Each Letter

2010-08-23 Thread Nikolas Tautenhahn
Hi Scottie, > Could you elaborate about N gram for me, based on my schema? just a quick reply: > positionIncrementGap="100"> > > > > > generateNumberParts="0" catenateWords="1" catenateNumbers="0" catenateAll="0" > splitOnCaseChange="1" splitOnNumerics="0

Re: How to Debug Sol-Code in Eclipse ?!

2010-08-23 Thread stockii
"ant package" BUILD FAILED run program "perl" ... it`s necessary to install perl on my computer ?! -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-Debug-Sol-Code-in-Eclipse-tp1262050p1291992.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Tokenising on Each Letter

2010-08-23 Thread Scottie
Probably a good idea to post the relevant information! I guess I thought it would be a really obvious answer but it seems its a bit more complex ;) It seems you may be correct about the catenat

Re: SolrException log

2010-08-23 Thread Tommaso Teofili
Hi Bastian, this seems to be related to IO and file deletion (optimization compacts and removes index files), are you running Solr on NFS or a distributed file system? You could set a propert IndexDeletionPolicy (SolrDeletionPolicy) in solrconfig.xml to handle this. My 2 cents, Tommaso 2010/8/11 B

Problem in setting the request writer in SolrJ (wiki page wrong?)

2010-08-23 Thread Constantijn Visinescu
Hello, I'm using an embedded solrserver in my Java webapp, but as far as i can tell it's defaulting to sending updates in XML, which seems like a huge waste compared to sending it in Java binary format. According to this page: http://wiki.apache.org/solr/Solrj#Setting_the_RequestWriter I'm suppo

Re: Doing Shingle but also keep special single word

2010-08-23 Thread Ahmet Arslan
> 1. We have over ten million news articles to build into > Solr index. > 2. We copy several fields, such as title, author, body, > caption of attahed photos into a new field for default > search. > 3. We then wanna use shingle filter on this new field. > 4. We can't predict what new single-word no

Re: How to Debug Sol-Code in Eclipse ?!

2010-08-23 Thread Drew Farris
On Sun, Aug 22, 2010 at 8:29 PM, stockii wrote: > > okay, thx. but it want work =( > > i checkout solr1.4.1 as dynamic web project into eclipse. startet jetty with > XDebug. In eclpise i add WebLogic exactly how the tutorial shows but eclipse > cannot connect =( > > any idea what im doing wrong ?

Re: How to Debug Sol-Code in Eclipse ?!

2010-08-23 Thread Bernd Fehling
> > can nobody help me or want :D As already someone said: - install Eclipse - add Jetty Webapp Plugin to Eclipse - add svn plugin to Eclipse - download with svn the repository from trunk - change to lucene dir and run "ant package" - change to solr dir and run "ant dist" - setup with "Run conf

Re: How to get most indexed keyword from SOLR

2010-08-23 Thread Grijesh.singh
Hi Pawan, If u r using solr1.4 or latter version then u can see terms info by using terms request handler like http://localhost:8080/solr/terms/?terms.fl=text&terms.sort=count -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-get-most-indexed-keyword-from-SOLR-tp124055

Re: How to Debug Sol-Code in Eclipse ?!

2010-08-23 Thread stockii
can nobody help me or want :D -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-Debug-Sol-Code-in-Eclipse-tp1262050p1288705.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: help refactoring from 3.x to 4.x

2010-08-23 Thread Michael McCandless
Spooky that you see incorrect results! The code looks correct. What are the specifics on when it produces an invalid result? Also spooky that you see it running slower -- how much slower? Did you rebuild the index in 4.x (if not, you are using the preflex codec)? And is the index otherwise ide

Re: Proper Escaping of Ampersands

2010-08-23 Thread Nikolas Tautenhahn
Hi Yonik, I got it working, but I think the Stopword Filter is not behaving as expected - (The document could be found when I disabled the stopword filter, details later in this mail...) On 20.08.2010 16:57, Yonik Seeley wrote > On Thu, Aug 19, 2010 at 11:33 AM, Nikolas Tautenhahn > wrote: >> Bu

Re: possible to have multiple elevation file?

2010-08-23 Thread Chamnap Chhorn
Hi, Here, I talk about QueryElevationComponent . Anyone has some idea? Thanks On Mon, Aug 23, 2010 at 3:10 PM, Chamnap Chhorn wrote: > Hi, > > I need multiple elevatio

Re: Autosuggest on PART of cityname

2010-08-23 Thread gwk
On 8/20/2010 7:04 PM, PeterKerk wrote: @Markus: thanks, will try to work with that. @Gijs: I've looked at the site and the search function on your homepage is EXACTLY what I need! Do you have some Solr code samples for me to study perhaps? (I just need the relevant fields in the schema.xml and

possible to have multiple elevation file?

2010-08-23 Thread Chamnap Chhorn
Hi, I need multiple elevation file for each site (around 200). I think one big elevation file is difficult to manage. How could I manage each elevation file differently? Thanks -- Chhorn Chamnap http://chamnapchhorn.blogspot.com/