Re: Testing Solr Cloud with ZooKeeper

2012-11-13 Thread darul
Looks like after timeout has finished, first solr instance respond



I was not waiting enough. Is it possible to reduce this *timeout* value ?

Thanks



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Testing-Solr-Cloud-with-ZooKeeper-tp4018900p4020190.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Testing Solr Cloud with ZooKeeper

2012-11-13 Thread darul
https://issues.apache.org/jira/browse/SOLR-3993 has been resolved.

Just few question, is it in trunk, I mean in main distrib downloadable on
main solr site.

Because I have downloaded it and get still same behaviour while running
first instance..or second shards.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Testing-Solr-Cloud-with-ZooKeeper-tp4018900p4020118.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Testing Solr Cloud with ZooKeeper

2012-11-09 Thread darul
Yes ku3ia, I read your thread yesterday and looks like we get same issue. I
wish Apache Con is nearly finished and expert can resolve this 
Thanks again to solr community,
Jul



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Testing-Solr-Cloud-with-ZooKeeper-tp4018900p4019271.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Testing Solr Cloud with ZooKeeper

2012-11-09 Thread darul
- Shards : 2
- ZooKeeper Cluster : 3
- One collection.

Here is how I run it and my scenario case:

In first console, I get first Node (first Shard) running on port 8983:





In second console, I get second Node (second Shard) running on port 8984:





Here I get just 2 nodes for my 2 shards running.

The I decide to add 2 replicates for each shard node.


and


Now everything is fine, a "robust" collection with 2 shards, 2 replicates
running. 

Result expected is here:

 

Then, I decide to stop the 2 last predicates running on port 7501/7502.

Results expected is here:
 

Then I now stop the 2 main instances running on port 8983/8983.

Restart the first one 8983:

I get a lot of this dump in console:


Why not, I start second one running on 8984, and get 



I do not understand why replicates are needed at this phase...first when I
started the first time, no need for replicates. And now, I would like
restart 2 main instances, and maybe start replicates later.

If I start both instances 7501/7502, everything is fine but not what I was
expected.

Any ideas,

Thanks again,

Jul



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Testing-Solr-Cloud-with-ZooKeeper-tp4018900p4019257.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Testing Solr Cloud with ZooKeeper

2012-11-08 Thread darul
Too illustrate:

 

Taking this example, 8983 and 8984 are Shard "owner", 7501/7502 just
replicates.

If I stop all instance, then restart 8983 or 8984 first, they won't run and
asked for replicates too be started...




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Testing-Solr-Cloud-with-ZooKeeper-tp4018900p4019103.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Testing Solr Cloud with ZooKeeper

2012-11-08 Thread darul
Thanks Otis, 

Indeed here too  zoo doc

 
, they advise to choose odd number of zk nodes this way "To create a
deployment that can tolerate the failure of F machines, you should count on
deploying 2xF+1 machines"...

Well, I just do not yet understand why after using replicate, I am not able
to restart solr instances if replicates are not running. (When I start them,
it is ok)

Do I need to erase all zookeeper config every time solr servers are
restarted...I mean send the conf again with bootstrap, looks like I am not
doing the right way ;)





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Testing-Solr-Cloud-with-ZooKeeper-tp4018900p4019102.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Testing Solr Cloud with ZooKeeper

2012-11-08 Thread darul
Hello again,

With the following config :

- 2 zookeeper ensemble
- 2 shards
- 2 main solr instances for the 2 shards
- I added 2, 3 replicates for fun.

While running and I stop one replicate, I see in admin ui graph updates
(replicate disabled/inactivated)...normal.

But if I stopped all solr instance and restart the first main instance
:8983, I always get it waiting for some replicates...is it useful ? Why
replicate are needed to run ? Can not access to admin anymore. 

Solution is to erase zookeeper data and start again, do you have any
solutions to avoid :



What if my replicates are really down in production and I restart everything
?

Another question, 2 shards means 2 zookeeper ensemble, 3 shards, 3 zookeeper
ensemble ?

Thanks,

Jul



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Testing-Solr-Cloud-with-ZooKeeper-tp4018900p4019028.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Testing Solr Cloud with ZooKeeper

2012-11-07 Thread darul
Yes instanceDir attribute point to new created core (with no conf dir) so it
is stranged...

but looks like I have played to much:



when I start main solr shard. I try everything again tomorrow and give you
feedback.





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Testing-Solr-Cloud-with-ZooKeeper-tp4018900p4018909.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Testing Solr Cloud with ZooKeeper

2012-11-07 Thread darul
I reply to myself :


darul wrote
> 
*
> Few questions:
*
> - my both zookeeper have their own data directory, as usual, but I did not
> see so much change inside after indexing examples docs. Are data stored
> their or just 
/
> configuration (conf files) 
/
> is stored in zookeeper ensemble ? Can you confirmed 
/
> index data
/
>  are also stored in zookeeper cluster ? Or not ?

I read again and see "Solr embeds and uses Zookeeper as a repository for
cluster configuration and coordination", so meaning just configuration, not
index repository at all ?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Testing-Solr-Cloud-with-ZooKeeper-tp4018900p4018902.html
Sent from the Solr - User mailing list archive at Nabble.com.


Testing Solr Cloud with ZooKeeper

2012-11-07 Thread darul
Hello everyone,

Having used *Hadoop* (not in charge of deployment, just java code part) and
*Solr 3.6* (deployment and coding) this year, today I made the solr cloud
wiki.

Well, 

* I have deployed 2 zookeeper (not embedded) instances
* 2 solr instances with 2 shards (pointing to zookeeper nodes)
* 2 solr replicates

 successfully ...thank you for new administration ui, graph and co,
nice.

But I am still confused with all these new amazing features. (compared to
when I was using multicore and master/slave behaviour).

Here in cloud, I am lost (in translation too)

*Few questions:*
- my both zookeeper have their own data directory, as usual, but I did not
see so much change inside after indexing examples docs. Are data stored
their or just /configuration (conf files) /is stored in zookeeper ensemble ?
Can you confirmed /index data/ are also stored in zookeeper cluster ? Or not
?
- In my solr instances directory tree,  /solr/mycollection/ sometimes I have
an "index" or "index.20121107185908378" directory and tlog directory, what
is it used for, could you explain me why index directory sometimes looks
like a snapshot ? zookeeper should not store index, sorry I repeat myself,
or is it just a snaphot. what is tlog directory for ?
- Then, playing a little bit, I test following command
http://localhost:8983/solr/admin/collections?action=CREATE&name=myname&numShards=2&replicationFactor=1
and see it update configuration of core.xml and create "data" directory as
well, nice. But when I navigate to admin ui and check schema for instance,
where does this configuration come from ? I do not get any conf directory
for this core, does it take one by default

I have so much questions to ask.

Thanks,

Julien




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Testing-Solr-Cloud-with-ZooKeeper-tp4018900.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Easy question ? docs with empty geodata field

2012-10-22 Thread darul
Amit,

Your guess was perfect and result is what expected:

fq=-location_0_coordinate:[* TO *] to get docs with no geo data

Thx,

Jul



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Easy-question-docs-with-empty-geodata-field-tp4014751p4015067.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Easy question ? docs with empty geodata field

2012-10-20 Thread darul
Indeed, it would be nice we can use [* TO *]

Then, is it possible to deal with the following on solr 3.6:
geofieldname:[-90,-180 TO 90,180]





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Easy-question-docs-with-empty-geodata-field-tp4014751p4014944.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Easy question ? docs with empty geodata field

2012-10-20 Thread darul
Thank you Amit, 

I let you know on monday when at office because I do not have access to solr
from home...

But I guess I missed to use dynamic field in right way, a long time I do not
read my basics ;)





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Easy-question-docs-with-empty-geodata-field-tp4014751p4014943.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Easy question ? docs with empty geodata field

2012-10-19 Thread darul
Your idea looks great but with this schema info :

 


.




How can I use it ?

fq=location_coordinate:[1 to *] not working by instance





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Easy-question-docs-with-empty-geodata-field-tp4014751p4014779.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Easy question ? docs with empty geodata field

2012-10-19 Thread darul
sorry, I mean this field called "geodata" in my schema






--
View this message in context: 
http://lucene.472066.n3.nabble.com/Easy-question-docs-with-empty-geodata-field-tp4014751p4014752.html
Sent from the Solr - User mailing list archive at Nabble.com.


Easy question ? docs with empty geodata field

2012-10-19 Thread darul
Hello,

Looking to get all documents with empty geolocalisation field, I have not
found any way to do it, with ['' to *], 

geodata being a specific field, do you have any solution ?

Thanks,

Jul



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Easy-question-docs-with-empty-geodata-field-tp4014751.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Charfilter keep "dates" but skeep "number"

2012-10-17 Thread darul
Thank you Erik,

Probably right to put this business rule above processing or via a
updateRequestProcessorChain.

Thanks again, I love this forum, so efficient.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Charfilter-keep-dates-but-skip-number-tp4014049p4014211.html
Sent from the Solr - User mailing list archive at Nabble.com.


Charfilter keep "dates" but skeep "number"

2012-10-16 Thread darul
Hello all,A long time I have not posted, but do not worry I am still using
Solr everyday and enjoy it.Here the details of my requirement:According to a
specific content with "dates", "number", (maybe number%), we would like to
*skip number* and *keep dates (+number%)* in indexation process. Do you see
a common way to achieve this with provided analysers or charfilter
(PatternReplaceCharFilterFactory...). I have used
PatternReplaceCharFilterFactory to skip number, but results are not relevant
for what we are looking for.Example (stupid one ;)):After processing, may
be:



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Charfilter-keep-dates-but-skeep-number-tp4014049.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: maxScore always returned

2012-06-05 Thread darul
maybe look into your solrconfig.xml file whether fl not set by default on
your request handler 

--
View this message in context: 
http://lucene.472066.n3.nabble.com/maxScore-always-returned-tp3987727p3987733.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: searching and text highlighting

2012-04-18 Thread darul

rpc29y wrote
> 
> Good afternoon:
>  I would like to know if it can be indexed with SolR word documents or
> pdf.
> 
Yes, you may first look at Tika Solr processor.

rpc29y wrote
> 
>  If so how do I modify the solrconfig.xml to search these documents and
> highlight the found text?
> 
I guess you should first follow solr tutorial to know more about it, how
query parser work, how to define your schema and then you may use highlight
in right way.

http://wiki.apache.org/solr/HighlightingParameters




--
View this message in context: 
http://lucene.472066.n3.nabble.com/searching-and-text-highlighting-tp3917856p3919546.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Client-side failover with SolrJ

2012-03-27 Thread darul
I rediscover the world every day, thanks for this.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Client-side-failover-with-SolrJ-tp3858461p3860700.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: is the SolrJ call to add collection of documents a blocking function call ?

2012-03-20 Thread darul
Hmm nice feature Erik

--
View this message in context: 
http://lucene.472066.n3.nabble.com/is-the-SolrJ-call-to-add-collection-of-documents-a-blocking-function-call-tp3839387p3842232.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: XSLT Response Writer and content transformation

2012-03-07 Thread darul
Finally get success to make it while upgrading transformer to use Saxon, I
will give you details soon, it can be useful and is nice feature to get nice
rss feed.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/XSLT-Response-Writer-and-content-transformation-tp3800251p3807212.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: XSLT Response Writer and content transformation

2012-03-06 Thread darul
also tried :

  

  

to get my description content processed, but no success until now ;)

--
View this message in context: 
http://lucene.472066.n3.nabble.com/XSLT-Response-Writer-and-content-transformation-tp3800251p3804528.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: XSLT Response Writer and content transformation

2012-03-06 Thread darul
Well, by default solr distribution is using xalan ? 

- I have created my custom class mypackage.XsltCustomFunctions, with my
custom method processContent(), put jar in jetty/lib root directory.
- update my rss.xsl file by adding
xmlns:ev="xalan://mypackage.XsltCustomFunctions" in header
- try this syntax : 

get this nice exception java.lang.RuntimeException: getTransformer fails in
getContentType

I not get solr dev environnement to see where it comes from, have you any
idea. 

I do not think it is a classpath problem but a syntax one.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/XSLT-Response-Writer-and-content-transformation-tp3800251p3804473.html
Sent from the Solr - User mailing list archive at Nabble.com.


XSLT Response Writer and content transformation

2012-03-05 Thread darul
Hello,

Using native XSLT Response Writer, we may need to alter content before
processing xml solr output as a RSS Feed.

Example (trivial one...):


  bla bla bla 


After processing content:


  bla bla bla bla bla bla bla bla bla bla bla bla


Have you any ideas on how to implement a custom function in xslt or before
in XsltResponseWriter.

I would like get this code in a java class and call it for content
processing

Thanks,

Jul

--
View this message in context: 
http://lucene.472066.n3.nabble.com/XSLT-Response-Writer-and-content-transformation-tp3800251p3800251.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Fields, Facets, and Search Results

2012-02-22 Thread darul
And check your log file, you may have some errors at start of your server.

Due to some mistake, bad syntax in your schema file for example...

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Fields-Facets-and-Search-Results-tp3765946p3767569.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Fields, Facets, and Search Results

2012-02-22 Thread darul
Well, you probably need to clear you index first..remove index director,
restart your server and try again.
Let me know if it works or not.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Fields-Facets-and-Search-Results-tp3765946p3767537.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Fields, Facets, and Search Results

2012-02-22 Thread darul
Check you schema config file first.

It looks like you have missed copy of "section_text_content" field's content
to your default search field :

 
 text



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Fields-Facets-and-Search-Results-tp3765946p3766084.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Solr logging

2012-02-20 Thread darul
This case explained here:

http://stackoverflow.com/questions/762918/how-to-configure-multiple-log4j-for-different-wars-in-a-single-ear

http://techcrawler.wordpress.com/



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-logging-tp3760171p3760352.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Solr logging

2012-02-20 Thread darul
Hmm, I did not try to achieve this but interested if you find a way...

After I believe than having log4j config file outside war archive is a
better solution, if you may need to update its content for example.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-logging-tp3760171p3760322.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Solr logging

2012-02-20 Thread darul
Yes, you can update your .war archive by adding/removing expected jars.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-logging-tp3760171p3760285.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Solr logging

2012-02-20 Thread darul
I get similar questions in the past :)

http://lucene.472066.n3.nabble.com/Jetty-logging-td3476715.html#a3483146

wish it will help you.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-logging-tp3760171p3760173.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Date formatting issue

2012-02-17 Thread darul
Expected date example :

2012-02-17T13:45:08.1Z

Date formatter:

-MM-dd'T'HH:mm:ss.SSS'Z'

Missing Zulu in yours ?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Date-formatting-issue-tp3748591p3753704.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: solrcore.properties

2012-02-07 Thread darul

Walter Underwood wrote
> 
> Looking at SOLR-1335 and the wiki, I'm not quite sure of the final
> behavior for this.
> 
> These properties are per-core, and not visible in other cores, right?
> 
> 

yes it is.


Walter Underwood wrote
> 
> 
> Are variables substituted in solr.xml, so I can swap in different
> properties files for dev, test, and prod? Like this:
> 
> 
> 
> If that does not work, what are the best practices for managing
> dev/test/prod configs for Solr?
> 
> 

As you can see here http://wiki.apache.org/solr/CoreAdmin I am not sure you
can set a property file to be loaded per core with this variable syntax.
Does someone may confirm ?

What we have made here is a maven project, some variable properties in
.properties or .xml solr configuration files. Then while generating project,
we use maven profile to generate dev/prod...distribution.

Wish it can help you,

Jul


--
View this message in context: 
http://lucene.472066.n3.nabble.com/solrcore-properties-tp3720446p3723212.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: Sort by geoDist() syntax error on 3.5

2012-01-30 Thread darul
Is it readable now ?

Thanks Steve.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Sort-by-geoDist-syntax-error-on-3-5-tp360p3700539.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Sort by geoDist() syntax error on 3.5

2012-01-30 Thread darul
Well, I guess I have found the problem in the following thread "geodist()
sort does not work if sfield parameter is enclosed in LocalParams"

http://www.lucidimagination.com/search/link?url=https://issues.apache.org/jira/browse/SOLR-2254

Correct syntax may be :




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Sort-by-geoDist-syntax-error-on-3-5-tp360p3700020.html
Sent from the Solr - User mailing list archive at Nabble.com.


Sort by geoDist() syntax error on 3.5

2012-01-30 Thread darul
Hello,

Looking in threads to find the correct syntax, I have not found any
solutions to get right syntax on sorting by distance.

Here query with no sort :


With sort activated, it throws an exception :

I have tried:


No success.

Schema:


Thanks in advance, I really need it as you may imagine.

Jul

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Sort-by-geoDist-syntax-error-on-3-5-tp360p360.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Currency field type

2012-01-26 Thread darul
Thank you Erik, I think about taking time to be more involved in solr
development.

In the meantime, I will choose to store prices and currency in a normalized
way.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Currency-field-type-tp3684682p3690076.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: solr replication

2012-01-25 Thread darul
I mean by grid the list of your instances :

String masterUrl = "http://masterUrl/core/...";
String[] slaveUrls = {"http://slaveUrl/core/...";,
"http://slaveUrl/core/..."}

Then use your business logic to use the correct one with Http solrJ facade.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/solr-replication-tp3687106p3687314.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: solr replication

2012-01-25 Thread darul
You may define your specific configuration as a Grid with all your solr
instances and then using SolrJ and CommonsHttpSolrServer choose the right
url depending on indexation or search task.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/solr-replication-tp3687106p3687208.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: solr replication

2012-01-25 Thread darul
Here is the way I see it (and implemented it), while using SolrJ api you have
to fire :

- Indexation commands to your /indexation solr instance/ (master) example :
http://myMaster:80/myCore/
- Query commands to your /search solr instance/ (slave). You may have
several slaves, and also find alternative as broker to make load balancing
betweeen each
http://mySlave1:80/myCore/
http://mySlave2:80/myCore/
...

You do not need any changes in code normally, replication is made
automatically and defined in your solrconfig.xml configuration file.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/solr-replication-tp3687106p3687168.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: analyzing stored fields (removing HTML tags)

2012-01-24 Thread darul
You probably may use a Sanitizer as we do here.

http://stackoverflow.com/questions/1947021/libs-for-html-sanitizing



--
View this message in context: 
http://lucene.472066.n3.nabble.com/analyzing-stored-fields-removing-HTML-tags-tp3685144p3685182.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: charFilter PatternReplaceCharFilterFactory and highlighting

2012-01-04 Thread darul
Well I guess may be a bug somewhere 

https://issues.apache.org/jira/browse/LUCENE-2208

--
View this message in context: 
http://lucene.472066.n3.nabble.com/charFilter-PatternReplaceCharFilterFactory-and-highlighting-tp3629699p3631571.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: charFilter PatternReplaceCharFilterFactory and highlighting

2012-01-04 Thread darul
Some of our path in indexed content may contains some words matching query,
what we do not expect, that is why I have applied a CharFilter to skip it.

Here is example of content before filtering :



After applying regexp filter I have provided in my previous thread, it
should look like, shouldn't it (skip links path in indexation ?) :



I have also made query tests and get no results matching when looking for
"*uploads*" or "*content*", what is our expected behaviour.

Problem is when I activate highlight and search for "*word*", it throws
exception.




--
View this message in context: 
http://lucene.472066.n3.nabble.com/charFilter-PatternReplaceCharFilterFactory-and-highlighting-tp3629699p3631367.html
Sent from the Solr - User mailing list archive at Nabble.com.


charFilter PatternReplaceCharFilterFactory and highlighting

2012-01-03 Thread darul
Hello,

I wanted to use char filter PatternReplaceCharFilterFactory to avoid
specific content to be indexed. 

At the end I get many issues with highlights and offsets...so I remove it,
example :



Example of content :



My charfilter should clean it like :



I do not understand why offset of highlights are disturbed by charFilter
while it is defined in first, it may change content before highlight
processing occurs ?



Do you have any solutions, we really need charFilter feature ?

Thanks,

Jul





--
View this message in context: 
http://lucene.472066.n3.nabble.com/charFilter-PatternReplaceCharFilterFactory-and-highlighting-tp3629699p3629699.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: DeleteByQuery and date filter

2011-12-20 Thread darul
One of possible correct syntax may be :

(context:BACKOFFICE)AND(type:IDEA)AND(creationDate:[2011-12-01T00:00:00Z TO
2011-12-20T00:00:00Z])

...

--
View this message in context: 
http://lucene.472066.n3.nabble.com/DeleteByQuery-and-date-filter-tp3600739p3600769.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: DeleteByQuery and date filter

2011-12-20 Thread darul
I was thinking of this kind of reason but no. See my next reply. Thanks

--
View this message in context: 
http://lucene.472066.n3.nabble.com/DeleteByQuery-and-date-filter-tp3600739p3600757.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: DeleteByQuery and date filter

2011-12-20 Thread darul
Looks like millisecond in formatter are useless and make query failed.

Replace 

(context:BACKOFFICE)(type:IDEA)AND(creationDate:[2011-12-01T00:00:00*.000*Z
TO 2011-12-25T00:00:00*.000*Z])

By

(context:BACKOFFICE)(type:IDEA)AND(creationDate:[2011-12-01T00:00:00Z TO
2011-12-25T00:00:00Z])

Solved issue.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/DeleteByQuery-and-date-filter-tp3600739p3600755.html
Sent from the Solr - User mailing list archive at Nabble.com.


DeleteByQuery and date filter

2011-12-20 Thread darul
Hello,

I have the following issue when using deleteByQuery, it works fine with
simple filters:



and fail when using date filter



Can you help me ? I have tried a lot of syntax but not found the good one
yet...boring

Thanks,

Julien



--
View this message in context: 
http://lucene.472066.n3.nabble.com/DeleteByQuery-and-date-filter-tp3600739p3600739.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Copy in multivalued field and faceting

2011-12-19 Thread darul
The first case you mentioned is the one I am looking for. I do not want top
terms on a whole index but top terms for a specific query result set.

Faceting on my field appears being the only way to get relevant results of
top terms for documents that hit query. 

Thanks for LukeRequestHandler and TermsComponent features, I knew them also
but they do not meet my needs.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Copy-in-multivalued-field-and-faceting-tp3584819p3598485.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Copy in multivalued field and faceting

2011-12-19 Thread darul
Thank you all for these advices, you are obviously right that no need for any
copyField instructions to get what we expect. 

I will do some tests on using facet or LukeRequestHandler which seem much
more useful in my case.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Copy-in-multivalued-field-and-faceting-tp3584819p3597802.html
Sent from the Solr - User mailing list archive at Nabble.com.


Copy in multivalued field and faceting

2011-12-14 Thread darul
Hello,

Field for this scenario is "Title" and contains several words.

For a specific query, I would like get the top ten words by frequency in a
specific field.

My idea was the following:

- Title in my schema is stored/indexed in a specific field
- A copyField copy Title field content into a multivalued field. If my
multivalue field use a specific tokenizer which split words, does it fill
each word in each multivalued items ?
- If so, using faceting on this multivalue field, I will get top ten words,
correct ?

Example:

1) Title : this is my title
2) CopyField Title to specific multivalue field F1
3) F1 contains : {this, is, my, title}

My english

Thanks,

Jul

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Copy-in-multivalued-field-and-faceting-tp3584819p3584819.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Solr dismax scoring and weight

2011-11-23 Thread darul
Thanks a lot Erick for this explanation. Do you mean words are stored in
bytes, that's it ? 

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-dismax-scoring-and-weight-tp3490096p3531917.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: Jetty logging

2011-11-19 Thread darul
test

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Jetty-logging-tp3476715p3520897.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Solr dismax scoring and weight

2011-11-09 Thread darul
Thanks for the details, but what do you mean by normalization, can you
describe shortly the concepts behind ?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-dismax-scoring-and-weight-tp3490096p3492986.html
Sent from the Solr - User mailing list archive at Nabble.com.


Solr dismax scoring and weight

2011-11-08 Thread darul
title^1.1 body^1.0 comments^0.5

Could someone explain me how to understand following query debug, and how
score is computed. 

Here are 4 documents with "Idée" word in title, body or comments.

Results are in this order by score, I do not undestand why fourth document
is not second in the results.

First :
Title : *Idée *intéressante
Body: Solr fonctionne chez vous ou pas ?

Second:
Title : *Idée* du lundi 01112011
Body : Voici le contenu de mon *idée*
Comments:
- commentaire avec le mot *idée*
- bonne *idée *pour un début de semaine

Third :
Title : Une *idée *pas comme les autres d'avant
Body: Ah oui cette *idée *est intéressante

Fourth :
Title : *Idée *intéressante encore
Body: Solr fonctionne chez vous ou pas ?

For example what mean " (MATCH) weight(title:idé^1.1 in 0)" 




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-dismax-scoring-and-weight-tp3490096p3490096.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Jetty logging

2011-11-05 Thread darul
Thank for the reply Erik, I have to check my log4j config on monday as I
filtered on some solr packages growing logs for nothing...but without
filtering I remember I not see any traces of queries..I will check it again.
I remember seeing it in console so I am certainly wrong...again ;)

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Jetty-logging-tp3476715p3483146.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: Jetty logging

2011-11-03 Thread darul
Well,  jetty is running as a unix service.

Here is run command :



jetty-logging.xml:




With this configuration I have  logs of jetty but no logs of log4j: exemple
"/logs/_mm_dd.stderrout.log"

2011-11-03 14:36:59.306:INFO::jetty-6.1-SNAPSHOT
Nov 3, 2011 2:36:59 PM org.apache.solr.core.SolrResourceLoader
locateSolrHome
INFO: JNDI not configured for solr (NoInitialContextEx)
Nov 3, 2011 2:36:59 PM org.apache.solr.core.SolrResourceLoader
locateSolrHome
INFO: using system property solr.solr.home: /opt/solr-slave/multicore
Nov 3, 2011 2:36:59 PM org.apache.solr.core.SolrResourceLoader 
INFO: Solr home set to '/opt/solr-slave/multicore/'
Nov 3, 2011 2:36:59 PM org.apache.solr.servlet.SolrDispatchFilter init
INFO: SolrDispatchFilter.init()
Nov 3, 2011 2:36:59 PM org.apache.solr.core.SolrResourceLoader
locateSolrHome
INFO: JNDI not configured for solr (NoInitialContextEx)
Nov 3, 2011 2:36:59 PM org.apache.solr.core.SolrResourceLoader
locateSolrHome
INFO: using system property solr.solr.home: /opt/solr-slave/multicore
Nov 3, 2011 2:36:59 PM org.apache.solr.core.CoreContainer$Initializer
initialize

I would like jetty use my resource/log4j.properties file :




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Jetty-logging-tp3476715p3477221.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: pingQuery problem ?

2011-11-03 Thread darul
One of my core had a missing ping request handler.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/pingQuery-problem-tp3476850p3476980.html
Sent from the Solr - User mailing list archive at Nabble.com.


pingQuery problem ?

2011-11-03 Thread darul
My solr instance works well, when calling ping page I get no problem :



But in logs, I see this error lines repeated, do you know how to solve this
?



solrconfig.xml



Thanks

--
View this message in context: 
http://lucene.472066.n3.nabble.com/pingQuery-problem-tp3476850p3476850.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: questions about autocommit & committing documents

2011-10-27 Thread darul
While sending documents with SolrJ Http API...at the end, I am never sure
documents are indexed. 

I would like to store them somewhere and resend them in case commit has
failed.

If commit occurred every 10 minutes for example, and 100 documents are
waiting to be commit, server crash or stop..this 100 documents won't be
indexed later because my business logic won't send them again...

Then I would like create a Job (cron) which look into a table or somewhere
for documents which may not have been indexed due a problem occurred during
commit process.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/questions-about-autocommit-committing-documents-tp1582487p3459089.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Get results ordered by field content starting with specific word

2011-10-27 Thread darul
Meaning I need to implement my own QueryParser ?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Get-results-ordered-by-field-content-starting-with-specific-word-tp3455754p3459064.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Get results ordered by field content starting with specific word

2011-10-27 Thread darul
Well, at indexed time I can not touch because we do not have data to index
anymore.

To use SpanFirstQuery, I need to make a custom ParserQuery ?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Get-results-ordered-by-field-content-starting-with-specific-word-tp3455754p3457167.html
Sent from the Solr - User mailing list archive at Nabble.com.


Get results ordered by field content starting with specific word

2011-10-26 Thread darul
I have seen many threads talking about it but not found any way on how to
resolve it.

In my schema 2 fields :



Results are sorted by field2 desc like in the following listing when looking
for "word1" as query pattern:



I would like to get Doc3 at the end because "word1" is not at the beginning
of the field content.

Have you any idea ? 

I have seen SpanNearQuery, tried FuzzySearch with no success etc...maybe
making a special QueryParserPlugin, but I am lost ;)

We use Solr 3.4

Thanks

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Get-results-ordered-by-field-content-starting-with-specific-word-tp3455754p3455754.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: questions about autocommit & committing documents

2011-10-25 Thread darul
I was not sure thank you.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/questions-about-autocommit-committing-documents-tp1582487p3450794.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: questions about autocommit & committing documents

2011-10-25 Thread darul
Well until now I was using SolrJ API to commit() (for each document added...)
changes but wonder in case of a production deployment it was not a best
solution to use AutoCommit feature instead.

With AutoCommit parameters, is it mandatory to remove commit() instruction
called on CommonsHttpSolrServer

try
{
   getServer().addBean(o);
   getServer().commit(); => to remove ?
...}
  
I just have another questions, I was looking all over the threads but not
found any solutions yet about how to get a CallbackHandler with all
documents commited. Is there a way simple way to achieve this ?

Thanks again Erick.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/questions-about-autocommit-committing-documents-tp1582487p3450739.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: questions about autocommit & committing documents

2011-10-23 Thread darul
May someone explain me different use case when both or only one AutoCommit
parameters is filled ?

I really need to understand it.

For example with these configurations :

  
  1 


or 

  
  1000  


or 

  
  1
  1000  


Thanks to everyone

--
View this message in context: 
http://lucene.472066.n3.nabble.com/questions-about-autocommit-committing-documents-tp1582487p3445607.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: questions about autocommit & committing documents

2011-10-22 Thread darul
Old entry but I try to configure auto commit.

I am still not sure to understand how Solr handles the commit process.

Does Solr really wait for 1 documents before send a commit ?

I was thinking, it will use maxTime and then commit a number of documents
less than 1.

Could you please correct this following scenario:
- 20 documents are added.
- After value of maxTime is reached, the 20 documents are committed because
less than 1 ?
- 2 documents are added.
- After value of maxTime is reached, only the first 1 documents are
committed. The next 1 will on next iteration of commit phase.

Is it the right way to understand both maxTime and maxDocs parameters ?

Thanks, 



> - If I enable  and set  at 1, does it mean that
> my new documents won't be avalable for searching until 10,000 new
> documents have been added?
> 
Yes, that's correct. However, you can do a commit explicitly, if you want to
do so. 



--
View this message in context: 
http://lucene.472066.n3.nabble.com/questions-about-autocommit-committing-documents-tp1582487p3443838.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Query url escape caracters ?

2011-10-11 Thread darul
 *ClientUtils.toQueryString()* saved my life ! I was boring for hours before
finding solutions with my favourite search engine ;)
http://lucene.472066.n3.nabble.com/Does-SOLR-provide-a-java-class-to-perform-url-encoding-td842660.html

final SolrQuery newQuery = SolrQueryBuilder.buildQuery(queryParam);
.   
String queryUrl = url + "/select" + ClientUtils.toQueryString(newQuery,
false);

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Query-url-escape-caracters-tp3412912p3413178.html
Sent from the Solr - User mailing list archive at Nabble.com.


Newbie question

2011-10-11 Thread darul
If using CommonsHttpSolrServer query() method with parameter wt=json, when
retrieving QueryResponse, how to do to get JSON result output stream ?

I do not understand, I can get response.getResults() etc...but no way to
find just JSON output stream.

Thanks,

Jul

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Newbie-question-tp3413106p3413106.html
Sent from the Solr - User mailing list archive at Nabble.com.


Query url escape caracters ?

2011-10-11 Thread darul
Hello,

We use SolrJ for building and sending request to Solr server. (working well)

On the other part, we want to use HttpClient to request server and get
result in Json or Xml result ouput format.

Scenario:

- building SolrQuery object with SolrJ.
- getting parameters with SolrQuery.toString() method
- appending to my server url..
- sending with HttpClient

Problem is the following, for complex query parameters like the following
url :

http://192.168.10.233:8984/solr/mycore/select?q=test+&start=0&rows=5&qt=myrequestHandler&hl=true&hl.snippets=5&hl.fragsize=150&hl.fl=body%2C+title%2C+comments&fl=*%2C+score&sort=score+desc&facet.range=creation_date&facet.range=creation_date&f.creation_date.facet.range.start=2010-10-01T15%3A23%3A04.118Z&f.creation_date.facet.range.start=2010-10-01T15%3A23%3A04.118Z&f.creation_date.facet.range.end=2011-10-11T15%3A23%3A04.118Z&f.creation_date.facet.range.end=2011-10-11T15%3A23%3A04.118Z&f.creation_date.facet.range.gap=%2B10DAY&f.creation_date.facet.range.gap=%2B1MONTH&facet=true&facet.field=context&facet.field=type&facet.field=author&facet.field=commentCount&facet.query=commentCount%3A%5B*+TO+5%5D&facet.query=myfield%3A%5B5+TO+*%5D&wt=json&indent=on

I got this :

Can't parse value 2010-10-01T15%3A22%3A58.768Z for field:
creation_date/Powered by Jetty:/// 

How to get good caracter syntax, when pasting this url to browser, it is
ok...what syntax to use to send with httpclient. Here, ", " is replace by
"%2C+", etcmany caracters replaced

Extract of my code :

final SolrQuery newQuery = SolrQueryBuilder.buildQuery(.);

String queryUrl = urlServer + "/select?" + newQuery.toString();

HttpClient client = new HttpClient();

GetMethod method = null;

try
{
 URI uri = new URI(queryUrl);

  method = new GetMethod(uri.getEscapedURI());


Thanks,

Jul

sorry for poor english

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Query-url-escape-caracters-tp3412912p3412912.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: SolrJ Annotation for multiValued field

2011-10-04 Thread darul
well, another mistake, it works...sorry ;)

--
View this message in context: 
http://lucene.472066.n3.nabble.com/SolrJ-Annotation-for-multiValued-field-tp3390255p3392652.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Documents Indexed, SolrJ see nothing before long time

2011-10-04 Thread darul
Well, I guess, it is stupid to make +creation_date:[* TO NOW] filter



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Documents-Indexed-SolrJ-see-nothing-before-long-time-tp3389721p3392538.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Documents Indexed, SolrJ see nothing before long time

2011-10-04 Thread darul
Thank you Christopher, I have found one issue in my code when building a
query, thus I do not know why it is not working.

When I comment this line, I get right result count :

// solrQuery.setParam("fq", "+creation_date:[* TO NOW] +type:QUESTION");

Where creation_date is one Date field and type one String field.

I have tried those 2 lines which also make the query retrieve wrong count
whereas with curl working...:

solrQuery.addFilterQuery("creation_date:[* TO NOW]");
solrQuery.addFilterQuery("type:QUESTION");

When I remove filter on date, it works :

solrQuery.addFilterQuery("type:QUESTION");

any problem with adding 2 filters and one with a date ?

Is there any problem with this syntax for filter query and SolrJ ?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Documents-Indexed-SolrJ-see-nothing-before-long-time-tp3389721p3392473.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Documents Indexed, SolrJ see nothing before long time

2011-10-03 Thread darul
Any SolrJ cache ? I am newbie ;)

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Documents-Indexed-SolrJ-see-nothing-before-long-time-tp3389721p3390942.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: SolrJ Annotation for multiValued field

2011-10-03 Thread darul
I will check tomorrow, but for a test case, I have put this code to verify
Field List mapping :

@Field("mymultivaluedfield") 
List items; 

public List getItems() {
   if (items == null) {
 items = new ArrayList();
 items.add("value1");
 
   }
  return items;
}

Is it normal it does not persist with this syntax.

A better may be to use myBean.setItems() ?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/SolrJ-Annotation-for-multiValued-field-tp3390255p3390930.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Documents Indexed, SolrJ see nothing before long time

2011-10-03 Thread darul
3ms = 5 minutes ? Are you sure you are not mistaking...
3ms = 30s

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Documents-Indexed-SolrJ-see-nothing-before-long-time-tp3389721p3390515.html
Sent from the Solr - User mailing list archive at Nabble.com.


Documents Indexed, SolrJ see nothing before long time

2011-10-03 Thread darul
Hello,

While documents are indexed, I mean I can retrieve it with solr
administration console, it is taking too long, about 5 minutes before I can
see it using SolrJ API.

Do you have any Idea on how to resolve this please ?

Only specific configuration I have is :

 
 
 3 
   

Thanks,

Jul

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Documents-Indexed-SolrJ-see-nothing-before-long-time-tp3389721p3389721.html
Sent from the Solr - User mailing list archive at Nabble.com.


SolrJ Annotation for multiValued field

2011-10-03 Thread darul
Hello again,

Is it possible to persist simple list values in index using annotated bean ?

 

And in my Pojo :

@Field("one")
String myString
.


@Field("mymultivaluedfield")
List items;

Actually, nothing happends, content of this collection "ArrayList"
is not persisted into mymultivaluedfield field.

Is it the problem of nested field I have seen in many threads.

Any solutions to persist multivalue field with annotated bean ?

Thanks

--
View this message in context: 
http://lucene.472066.n3.nabble.com/SolrJ-Annotation-for-multiValued-field-tp3390255p3390255.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Documents Indexed, SolrJ see nothing before long time

2011-10-03 Thread darul
Or any Idea to see cache updated more quickly, I do not understand well how
caches are working in Solr.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Documents-Indexed-SolrJ-see-nothing-before-long-time-tp3389721p3389864.html
Sent from the Solr - User mailing list archive at Nabble.com.