On Thu, Dec 10, 2009 at 11:52 AM, Jason Rutherglen
jason.rutherg...@gmail.com wrote:
I assume there isn't one? Anything in the works?
Nope.
--
Regards,
Shalin Shekhar Mangar.
most likely go for a master/slave
deployment in production.
We use boxes with quad cores, 16 gig RAM, SCSI disks. YMMV.
--
Regards,
Shalin Shekhar Mangar.
street, chattanooga, FL
32212/address/listing/field
/doc
/add
You need to XML encode the value of the content field.
--
Regards,
Shalin Shekhar Mangar.
is 4GB RAM and 4 core x 2 CPUs.
are you suggesting us to increase the configuration?
4GB RAM for a 11GB index seems to be on the low side. It would be best to
benchmark performance on your data with the queries you expect to be made.
--
Regards,
Shalin Shekhar Mangar.
you represent this condition in fq paramenter of dismax
Are you saying that the above syntax does not work in an fq? Note, the or
should be in capitals.
--
Regards,
Shalin Shekhar Mangar.
to use a TokenFilter which remove
characters and just keeps the integer/long values. But you still won't be
able to use the LongField because that is not analyzed (so your token
filters will not be applied).
--
Regards,
Shalin Shekhar Mangar.
. What
do they signify?
See
http://wiki.apache.org/solr/UpdateXmlMessages#Optional_attributes_for_.22commit.22_and_.22optimize.22
--
Regards,
Shalin Shekhar Mangar.
?
--
Regards,
Shalin Shekhar Mangar.
://wiki.apache.org/solr/CoreAdmin
Or you could combine them into the same index. That is usually the easier
solution.
See
http://wiki.apache.org/solr/MultipleIndexes#Flattening_Data_Into_a_Single_Index
--
Regards,
Shalin Shekhar Mangar.
On Tue, Dec 8, 2009 at 11:48 AM, Jason Rutherglen
jason.rutherg...@gmail.com wrote:
If I've got multiple cores on a server, I guess I need multiple
rsyncd's running (if using the shell scripts)?
Yes. I'd highly recommend using the Java replication though.
--
Regards,
Shalin Shekhar Mangar.
( fieldA ) from table where fieldA IN
That sounds similar to faceting. See
http://wiki.apache.org/solr/SimpleFacetParameters
Perhaps you can give more details on what you want to achieve.
--
Regards,
Shalin Shekhar Mangar.
can search on all cores if schema.xml is same. See
http://wiki.apache.org/solr/DistributedSearch
If schema.xml is different, you can search on one core only. You can
denormalize and combine cores if you want to search on all of them.
--
Regards,
Shalin Shekhar Mangar.
cases to the patch in the future.
Thanks Ahmet, I've marked the issue for 1.5 so we do not forget about it.
I'll take a look at the patch soon.
--
Regards,
Shalin Shekhar Mangar.
haven't yet seen this issue elsewhere.
--
Regards,
Shalin Shekhar Mangar.
.
--
Regards,
Shalin Shekhar Mangar.
think I should execute solr queries twice ?
or is there any other workarounds
On Mon, Nov 30, 2009 at 3:07 PM, Shalin Shekhar Mangar
shalinman...@gmail.com wrote:
On Mon, Nov 30, 2009 at 2:26 PM, Mark N nipen.m...@gmail.com wrote:
field2=xyz we dont know until we run query1
Ah
--
Regards,
Shalin Shekhar Mangar.
)
clients able to do queries to the slave? If not, is that a strategy that I
should use to allow the website to continue to work during the pull/install?
Yes, queries can be done at any time, even in the middle of a snapinstall.
--
Regards,
Shalin Shekhar Mangar.
crawler, look at Nutch. Otherwise, you may need to build
something using Driods or Aperture.
http://lucene.apache.org/nutch/
http://incubator.apache.org/droids/
http://aperture.sourceforge.net/
--
Regards,
Shalin Shekhar Mangar.
to modify the
stored part as well, then only an UpdateRequestProcessor can do that. In
other words, the field's value after applying UpdateRequestProcessors is fed
into analyzers (for indexed field) and stored verbatim (for stored fields).
--
Regards,
Shalin Shekhar Mangar.
or the ending edge therefore you
can't match words in the middle of a phrase. Try using NGramFilterFactory
instead.
--
Regards,
Shalin Shekhar Mangar.
QParser.
--
Regards,
Shalin Shekhar Mangar.
instance, you can do writes concurrently without a problem.
--
Regards,
Shalin Shekhar Mangar.
because it wasn't
designed to be extensible. Please open a jira issue.
--
Regards,
Shalin Shekhar Mangar.
?
You need to provide urls of the cores in the distributed search request. It
will make HTTP calls to the specified cores but there is no way around that
right now.
http://wiki.apache.org/solr/DistributedSearch
Why do you want to search across cores on the same Solr?
--
Regards,
Shalin Shekhar
still needs LotsOfWork :)
--
Regards,
Shalin Shekhar Mangar.
that.
--
Regards,
Shalin Shekhar Mangar.
issues:
1. https://issues.apache.org/jira/browse/SOLR-1358
2. https://issues.apache.org/jira/browse/SOLR-1583
--
Regards,
Shalin Shekhar Mangar.
is just smaller and easier to embed and therefore Solr uses it
for testing and examples, though there are lots of shops which use the Jetty
shipped with Solr releases.
--
Regards,
Shalin Shekhar Mangar.
guess when you try to add documents and an IndexWriter is opened, the data
directory is created if it does not exist. Since it calls File#mkdirs, all
parent directories are also created. I don't think Solr creates those
directories by itself.
--
Regards,
Shalin Shekhar Mangar.
or the log4j
mailing lists.
--
Regards,
Shalin Shekhar Mangar.
compatible as long as your application does not
try to index or query them.
Modifying the field type of an existing field or adding/removing/modifying
tokenizers or filters on a field type is usually an incompatible change and
needs re-indexing of affected documents.
--
Regards,
Shalin Shekhar
for this or
is it published elsewhere ?
The war is not published as a maven artifact. Why would you need the war in
maven?
--
Regards,
Shalin Shekhar Mangar.
it. If your client really want
complete control you'd have to educate them on solr's query syntax (or
perhaps create a simpler query syntax) but I wouldn't suggest going that
way.
--
Regards,
Shalin Shekhar Mangar.
solution).
I see. If people find it generally useful, we could publish the war too.
Patches welcome :)
--
Regards,
Shalin Shekhar Mangar.
core instance
in memory after reading the configuration from disk.
--
Regards,
Shalin Shekhar Mangar.
making a patch to
the
artifact publishing process?
That'd be great. See http://wiki.apache.org/solr/HowToContribute
--
Regards,
Shalin Shekhar Mangar.
to return just one field from all documents? If yes, you can:
1. Query with q=*:*fl=name
2. Use TermsComponent - http://wiki.apache.org/solr/TermsComponent
--
Regards,
Shalin Shekhar Mangar.
://travel.aol.com/
Try typing something into the top most search box.
--
Regards,
Shalin Shekhar Mangar.
,
Shalin Shekhar Mangar.
spellcheck.build set to
true. This was a configuration eror on our part. I was wondering if the
spellcheck index being re-built for each query could have caused the above
exception to occur.
I don't know. Rebuilding the index for each query is not a good idea
anyways.
--
Regards,
Shalin Shekhar
, it depends. Each commit/optimize will re-create the spell check index
with those options. So, it is best if you test it out with your index,
queries and load.
--
Regards,
Shalin Shekhar Mangar.
Shekhar Mangar.
would be happy to contribute it, what should i do?
That'd be great. Please open an issue in Jira and attach a patch. See
http://wiki.apache.org/solr/HowToContribute
--
Regards,
Shalin Shekhar Mangar.
--
Regards,
Shalin Shekhar Mangar.
[0xabc0,0xb1028b90,0xe140)
PSPermGen total 131072K, used 44021K [0x8bc0, 0x93c0,
0xabc0)
object space 131072K, 33% used [0x8bc0,0x8e6fd460,0x93c0)
--
Lance Norskog
goks...@gmail.com
--
Regards,
Shalin Shekhar Mangar.
the
spellcheck.dictionary parameter.
--
Regards,
Shalin Shekhar Mangar.
to the same schema), then it can
be a worthwhile addition to Solrj. Can you open an issue? A patch would be
even better :)
--
Regards,
Shalin Shekhar Mangar.
Shekhar Mangar.
?
Any ideas on what the bottleneck should be? I think even a standard
drive can do writes of a bit of 40MB/s, and certainly reads over that.
Thoughts?
We are seeing transfer speeds of around 66 MBPS right now in the same data
center.
--
Regards,
Shalin Shekhar Mangar.
, thanks for the clarification. You need to add preserveOriginal=1 to
your index-time WDF configuration. This will index the original token as
well as the parts so that all of powershot, power and shot should
match powerShot. Make sure you re-index after making the changes.
--
Regards,
Shalin Shekhar
to the $deleteDocById flag.
I know this is a basic question but you are using Solr 1.4, aren't you?
--
Regards,
Shalin Shekhar Mangar.
,
Shalin Shekhar Mangar.
be $deleteDocByQuery and value
should be the Solr query to be used for deletion. You can write the
transformer in Java as well as Javascript.
--
Regards,
Shalin Shekhar Mangar.
the number 4 (the
type would depend on the sql type). Also, for javascript transformers to
work, you must use JDK 6 which has javascript support. Rest looks fine to
me.
--
Regards,
Shalin Shekhar Mangar.
configurations for above patch?
How to set master and slave?
Pravin, questions on specific patches are best asked on the Jira issue.
--
Regards,
Shalin Shekhar Mangar.
to http://wiki.apache.org/solr/PublicServers?
--
Regards,
Shalin Shekhar Mangar.
. If you are using
Solrj client, then you can use
QueryRequest#setMethod(SolrRequest.METHOD.POST)
--
Regards,
Shalin Shekhar Mangar.
On Wed, Oct 14, 2009 at 8:54 PM, Ankit Bhatnagar abhatna...@vantage.comwrote:
Solrj1.4 supports QueryRequest#setMethod(SolrRequest.METHOD.POST)
but Solrj1.3 does not.
I just checked the 1.3 release. It most definitely exists in 1.3
--
Regards,
Shalin Shekhar Mangar.
changing these parameters.
--
Regards,
Shalin Shekhar Mangar.
do writes of a bit of 40MB/s, and certainly reads over that.
Thoughts?
--
- Mark
http://www.lucidimagination.com
--
-
Noble Paul | Principal Engineer| AOL | http://aol.com
--
Regards,
Shalin Shekhar Mangar.
--
-
Noble Paul | Principal Engineer| AOL | http://aol.com
--
Regards,
Shalin Shekhar Mangar.
Lucene Specification Version: 2.9-dev
Lucene Implementation Version: 2.9-dev 794238 - 2009-07-15 18:05:08
This was built on 2009-07-28. The property injection stuff was committed to
trunk on 2009-09-27. Can you please try using the latest nightly?
--
Regards,
Shalin Shekhar Mangar.
://wiki.apache.org/solr/SolrReplication
--
Regards,
Shalin Shekhar Mangar.
does not limit documents. It is used for finding the number of
documents matching the query. In order to filter the result set you should
use filter query e.g. fq=Memory_s:1 GB
--
Regards,
Shalin Shekhar Mangar.
to see if it has any exceptions recorded?
--
Regards,
Shalin Shekhar Mangar.
public void withFieldsAndXpathStream() throws Exception {
Map entityAttrs = createMap(name, e, url, cd.xml,
XPathEntityProcessor.FOR_EACH, /catalog/cd, stream, true,
batchSize,1);
--
Regards,
Shalin Shekhar Mangar.
Index: contrib/dataimporthandler/src/test/java/org/apache/solr
=solr.RunExecutableListener
str name=exesnapshooter/str
str name=dirsolr/bin/str
bool name=waittrue/bool
/listener
/updateHandler
I'm not able to reproduce the problem. The snapshots are created regardless
of the spellchecker having buildOnCommit on or off.
--
Regards,
Shalin Shekhar Mangar.
= :ascending}])
Hope that helps.
--
Regards,
Shalin Shekhar Mangar.
be to use a
LocalSolrQueryRequest with qt as the handler name. See QuerySenderListener
for an example.
--
Regards,
Shalin Shekhar Mangar.
is the time at which the property file was written. This
is automatically added by Java's Properties class.
--
Regards,
Shalin Shekhar Mangar.
of Solr are you using? Also, any data
which can help us reproduce the problem would be nice.
--
Regards,
Shalin Shekhar Mangar.
last_index_time when indexing starts?
The last_index_time is recorded before any SQL queries are fired so I don't
think any rows could be missed. Some could be imported more than once
though.
--
Regards,
Shalin Shekhar Mangar.
problems.
Or you could escape the hyphen character. If you are using SolrJ, use
ClientUtils.escapeQueryChars on the query string.
--
Regards,
Shalin Shekhar Mangar.
such as name, instanceDir, dataDir, configName, schemaName
--
Regards,
Shalin Shekhar Mangar.
enable which can be true or false. You can control the
value (true/false) through a variable. So you can duplicate just the handle
instead of the complete solrconfig.xml
--
Regards,
Shalin Shekhar Mangar.
would be appreciated
I vaguely remember a bug which caused the slave to loop. Can you upgrade to
the latest nightly and see if that solves the problem?
--
Regards,
Shalin Shekhar Mangar.
to cores for our LotsOfCores effort.
http://wiki.apache.org/solr/LotsOfCores
--
Regards,
Shalin Shekhar Mangar.
--
-
Noble Paul | Principal Engineer| AOL | http://aol.com
--
Regards,
Shalin Shekhar Mangar.
/
/entity
/entity
/document
/dataConfig
-Jay
--
-
Noble Paul | Principal Engineer| AOL | http://aol.com
--
Regards,
Shalin Shekhar Mangar.
=/admin/cores shareSchema=true
core name=core0 instanceDir=./
property name=shardsParam
value=localhost:9990/core1,localhost:9990/core2,localhost:9990/core3,localhost:9990/core4/
/core
/cores
--
Regards,
Shalin Shekhar Mangar.
will be picked up.
--
Regards,
Shalin Shekhar Mangar.
this ?
It is possible but you have to take care to match Solr's schema with the
structure of documents in the Lucene index. The correct field names and
query-analyzers should be configured in schema.xml
--
Regards,
Shalin Shekhar Mangar.
want to create scalable and optimal search system.
Is this possible with solr?
Of course you can create a scalable and optimal search system with Solr. We
do that all the time ;)
--
Regards,
Shalin Shekhar Mangar.
fields as stored then you can write a program to read all
documents and write new ones with the same values.
--
Regards,
Shalin Shekhar Mangar.
to achieve? What kind of reports
are you looking for?
--
Regards,
Shalin Shekhar Mangar.
$MergeThread.run(ConcurrentMergeScheduler.java:291)
Oct 6, 2009 2:12:37 AM org.apache.solr.core.SolrCore execute
INFO: [] webapp=/build-2 path=/update params={} status=500 QTime=9110523
--
Regards,
Shalin Shekhar Mangar.
: it really is meant to take UTF-8 as input, even though it is
named ISOLatin1AccentFilter, isn't it?
See http://markmail.org/message/hi25u5iqusfu542b
--
Regards,
Shalin Shekhar Mangar.
matches.
http://lucene.apache.org/java/2_9_0/queryparsersyntax.html#Proximity%20Searches
Perhaps somebody else can weigh on your question #3 and #4.
--
Regards,
Shalin Shekhar Mangar.
a better understanding of what it
does.
maxCommitsToKeep is the number of commit points (a point-in-time snapshot of
the index) to keep from getting deleted. But deletion of commit points only
happens on startup or when someone calls commit/optimize.
--
Regards,
Shalin Shekhar Mangar.
independent of the entity name/ datasource. For me they are all
records of the same type.
That is very much possible out of the box.
--
Regards,
Shalin Shekhar Mangar.
, do let us know your feedback:
https://issues.apache.org/jira/browse/SOLR-785
--
Regards,
Shalin Shekhar Mangar.
want to plot on a bar chart?
I don't know of a reporting tool which can hook into Solr for creating such
things.
--
Regards,
Shalin Shekhar Mangar.
as an environment variable, you can
remove the default value.
--
Regards,
Shalin Shekhar Mangar.
for posting data to Solr.
--
Regards,
Shalin Shekhar Mangar.
org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to
execute query: select * from newheader where ImportTime 'Wed Oct 07
Thanks for reporting the error. This seems to be a bug. I've opened an
issue:
https://issues.apache.org/jira/browse/SOLR-1496
--
Regards,
Shalin Shekhar Mangar.
are
not overwriting the records.
Yes, good point. That is a typical problem with sharded databases with
auto-increment primary key. If you do not have unique keys, you can
concatenate the shard name with the value of the primary key.
--
Regards,
Shalin Shekhar Mangar.
.
Also note that full-import deletes all existing documents. So if you write
such a program which changes DB conf details, make sure you invoke the
import command (new in Solr 1.4) to avoid deleting the other documents.
--
Regards,
Shalin Shekhar Mangar.
a searcher just-in-time for SOLR-1293. We
have use-cases where a SolrCore is loaded only for indexing and then
unloaded.
--
Regards,
Shalin Shekhar Mangar.
though if there's any config setting to pass to
DataImportHandler to specify the desired date/time format to use.
There is no configuration to change this. However, you can write your own
Evaluator to output ${dih.last_index_time} in whatever format you prefer.
--
Regards,
Shalin Shekhar Mangar.
701 - 800 of 1747 matches
Mail list logo