Re: Iterative graph/nodes query

2019-01-22 Thread Dan M
Perhaps you're looking for the traversalFilter parameter of the graph query? https://lucene.apache.org/solr/guide/7_6/other-parsers.html#graph-query-parser Dan Meehl Meehl Technology Solutions Inc On Tue, Jan 22, 2019 at 7:13 AM Magnus Karlsson wrote: > Hi, > > > anyone

Tokenizer violates TokenStream.reset() contract?

2019-01-17 Thread Dan M
the input (Reader) to prevent memory leak. However, unsetting it means we can never read from the Tokenizer a 2nd time (unless you set the Reader again) and thus it violates the contract. Should there be a way to reuse Tokenizers? Thanks, Dan

different query for different dictionaries

2018-10-25 Thread Dan Rosher
Hi, If I have 2 fields e.g. location and products then I might have 2 dictionaries spell_location spell_products I cannot do for example spellcheck..q e.g. spellcheck.spell_location.q=... only spellcheck.q=... Does anyone have a workaround for this limitation? Cheers Dan

Re: Command Line Indexer

2018-09-18 Thread Dan Brown
Yup, thanks for the clarification. I see now that some of the items I list in 2 are moot. On Tue, Sep 18, 2018 at 4:16 PM Alexandre Rafalovitch wrote: > Uhm, inline: > > On 18 September 2018 at 17:05, Dan Brown wrote: > > 1. Thank you. > > > > 2. I think this

Re: Command Line Indexer

2018-09-18 Thread Dan Brown
answers all your questions. Please ask questions if anyone has more. Dan On Tue, Sep 18, 2018 at 3:21 PM Christopher Schultz < ch...@christopherschultz.net> wrote: > -BEGIN PGP SIGNED MESSAGE- > Hash: SHA256 > > Dan, > > On 9/18/18 2:51 PM, Dan Brown wrote: >

Command Line Indexer

2018-09-18 Thread Dan Brown
. One thing to note is that it will update individual fields. That is, unlike the Data Import Handler, it does not replace entire documents. Please check it out and let me know what you think. Dan

Re: Solr LTS and EOL

2018-08-28 Thread Dan Untenzu
Hey Shawn, thanks a lot for your clarification, all questions answered. Your message should indeed find it's way onto the community page. Thanks. Dan Am 28.08.2018 um 13:18 schrieb Shawn Heisey: > On 8/28/2018 2:59 AM, Dan Untenzu wrote: >> I would like to get some feedback a

Solr LTS and EOL

2018-08-28 Thread Dan Untenzu
S over the mayor version, because maybe EOL of version 6 is just next week. Dan -- Dan Untenzu Certified TYPO3 Integrator webit! Gesellschaft für neue Medien mbH Schandauer Straße 34 | 01309 Dresden | Germany Telefon +49 351 46766-24 | Telefax +49 351 46766-66 unte...@webit.de | www.webit.de Am

spelling and collation with correctly spelt words

2018-05-10 Thread Dan .
t correctly, just overall. Cheers, Dan

StandardTokenizer and splitting on mixedcase strings

2018-02-23 Thread Dan .
Hi, The StandardTokenizerFactory splits strings like 'JavaScript' into 'Java' and 'Script', but then searches with 'javascript' do not match the document. Is there a solr way to prevent StandardTokenizer from splitting mixedcase strings? Cheers, Dan

Re: Failed to create collection SOLR 6.3 HDP 2.6.2

2017-10-30 Thread Dan Caulfield
Shawn Heisey-2 wrote > On 10/26/2017 10:29 AM, Dan Caulfield wrote: >> I'm creating a collection on a new cluster. There are six new Solr nodes >> using a HDP 2.6.2 cluster for storage. Has anyone seen similar errors? > >> ERROR: Failed to create collecti

Failed to create collection SOLR 6.3 HDP 2.6.2

2017-10-26 Thread Dan Caulfield
I'm creating a collection on a new cluster. There are six new Solr nodes using a HDP 2.6.2 cluster for storage. Has anyone seen similar errors? /usr/iopsolr/current/iop-solr/server/scripts/cloud-scripts/zkcli.sh -cmd upconfig -zkhost d2mitphmn1001.edc.nam.gm.com:2181,d2mitphmn1003.edc.nam.gm

Re: knowing which fields were successfully hit

2017-05-18 Thread Dan .
v=$qq}) Hit is x_score or y_score > 0 Note that you might get a score for both depending on data. Dan On 17 May 2017 at 12:06, John Blythe wrote: > hey erik, totally unaware of those two. we're able to retrieve metadata > about the query itself that way? > > -- > *John Bl

Re: Solr Atomic Document update Conditional

2017-05-18 Thread Dan .
Hi, Why not write a custom UpdateRequestProcessor if it's a special case from the norm then place it in it's own chain and do update like http:// .../update/json?&update.chain= Cheers, Dan On 18 May 2017 at 09:05, Aman Deep Singh wrote: > Hi , > Is their any way t

Slow Bulk InPlace DocValues updates

2017-05-18 Thread Dan .
, to complete the update quicker. Has anyone a workaround or have any suggestions? Many thanks, Dan

Re: in-place atomic updates for numeric docValue field

2017-05-04 Thread Dan .
Hi Emir, Yes I though of representing -1 as null, but this makes the index unnecessarily larger, particularly if we have to default all docs to this value. Cheers, Dan On 4 May 2017 at 15:16, Emir Arnautovic wrote: > Hi Dan, > > Remove does not make sense when it comes to in-place u

Re: in-place atomic updates for numeric docValue field

2017-05-04 Thread Dan .
Hi Shawn, Thanks for the suggestion. I gave that a try but unfortunately it didn't work. Delete somehow would be really useful, seems wasteful to have e.g. -1 representing null. Cheers, Dan On 4 May 2017 at 15:30, Shawn Heisey wrote: > On 5/4/2017 7:40 AM, Dan . wrote: > >

in-place atomic updates for numeric docValue field

2017-05-04 Thread Dan .
":{"set":null} }]' then I'd expect the popularity field to be removed, however it's not. I this a bug? or is there a know workaround for this for in-place atomic updates? Cheers, Dan

Data Import Handlers not working after upgrade from 6.3.0 to 6.4.0

2017-01-25 Thread Dan Scarf
o downgrade to Solr 6.3.0, which I'd obviously not do. Thanks in advance for your time, Dan

Re-label terms from a shard?

2015-09-29 Thread Dan Bolser
lter on the list of all synonyms defined in the mapping. I should have defined this field as an enumeration, but I think the cat's out of the bag now! Many thanks, Dan.

Re: Cloud Deployment Strategy... In the Cloud

2015-09-24 Thread Dan Davis
ant is very good at this sort of thing, and easier for Java devs to learn than Make. Python has a module called fabric that is also very fine, but for my dev. ops. it is another thing to learn. I tend to divide things into three categories: - Things that have to do with system setup, and need to

Re: Solr authentication - Error 401 Unauthorized

2015-09-12 Thread Dan Davis
, Sep 12, 2015 at 9:40 AM, Dan Davis wrote: > It seems that you have secured Solr so thoroughly that you cannot now run > bin/solr status! > > bin/solr has no arguments as yet for providing a username/password - as a > mostly user like you I'm not sure of the roadmap. > >

Re: Solr authentication - Error 401 Unauthorized

2015-09-12 Thread Dan Davis
It seems that you have secured Solr so thoroughly that you cannot now run bin/solr status! bin/solr has no arguments as yet for providing a username/password - as a mostly user like you I'm not sure of the roadmap. I think you should relax those restrictions a bit and try again. On Fri, Sep 11,

Re: Issue Using Solr 5.3 Authentication and Authorization Plugins

2015-09-10 Thread Dan Davis
ons?action=CREATE and so on... On Thu, Sep 10, 2015 at 11:10 PM, Dan Davis wrote: > Kevin & Noble, > > I've manually verified the fix for SOLR-8000, but not yet for SOLR-8004. > > I reproduced the initial problem with reloading security.json after > restarting both Solr and Z

Re: Issue Using Solr 5.3 Authentication and Authorization Plugins

2015-09-10 Thread Dan Davis
available? On Tue, Sep 8, 2015 at 1:07 PM, Kevin Lee wrote: > Thanks Dan! Please let us know what you find. I’m interested to know if > this is an issue with anyone else’s setup or if I have an issue in my local > configuration that is still preventing it to work on start/restart. >

Re: Issue Using Solr 5.3 Authentication and Authorization Plugins

2015-09-05 Thread Dan Davis
Kevin & Noble, I'll take it on to test this. I've built from source before, and I've wanted this authorization capability for awhile. On Fri, Sep 4, 2015 at 9:59 AM, Kevin Lee wrote: > Noble, > > Does SOLR-8000 need to be re-opened? Has anyone else been able to test > the restart fix? > > At

Re: analyzer, indexAnalyzer and queryAnalyzer

2015-04-30 Thread Dan Davis
Hi Doug, nice write-up and 2 questions: - You write your own QParser plugins - can one keep the features of edismax for field boosting/phrase-match boosting by subclassing edismax? Assuming yes... - What do pf2 and pf3 do in the edismax query parser? hon-lucene-synonyms plugin links correction

Re: Odp.: solr issue with pdf forms

2015-04-23 Thread Dan Davis
ndexed words contain any words from the form fields's values? If you have a form where the data is not sensitive, I can investigate. If you are interested in this contact me offline - to dansm...@gmail.com or d...@danizen.net. Thanks, Dan On Thu, Apr 23, 2015 at 11:59 AM, Erick Erickso

Re: solr issue with pdf forms

2015-04-22 Thread Dan Davis
Steve, Are you using ExtractingRequestHandler / DataImportHandler or extracting the text content from the PDF outside of Solr? On Wed, Apr 22, 2015 at 6:40 AM, wrote: > Hi guys, > > hopefully you can help me with my issue. We are using a solr setup and > have the following issue: > - usual pdf

Re: Odp.: solr issue with pdf forms

2015-04-22 Thread Dan Davis
+1 - I like Erick's answer. Let me know if that turns out to be the problem - I'm interested in this problem and would be happy to help. On Wed, Apr 22, 2015 at 11:11 AM, Erick Erickson wrote: > Are they not _indexed_ correctly or not being displayed correctly? > Take a look at admin UI>>schema

Re: Securing solr index

2015-04-13 Thread Dan Davis
Where you want true Role-Based Access Control (RBAC) on each index (core or collection), one solution is to buy Solr Enterprise from LucidWorks. My personal practice is mostly dictated by financial decisions: - Each core/index has its configuration directory in a Git repository/branch where

variable length ngramfilter highlights

2015-04-09 Thread Dan Sullivan
. I have tried numerous configurations to no avail, so it might be moot to post my configuration, however here it is. schema.xml - https://gist.github.com/dsulli99/c1d8f3536ade65e8eb35 solrconfig.xml https://gist.github.com/dsulli99/10e2af507cde4373adba Thank you, Dan

Re: What is the best way of Indexing different formats of documents?

2015-04-07 Thread Dan Davis
Sangeetha, You can also run Tika directly from data import handler, and Data Import Handler can be made to run several threads if you can partition the input documents by directory or database id. I've done 4 "threads" by having a base configuration that does an Oracle query like this: SE

Re: Customzing Solr Dedupe

2015-04-01 Thread Dan Davis
is just an internal signature. In both cases, a non-deterministic up-front algorithm can work, but if the groupId needs to be preserved, you need to work harder to make sure it all hangs together. Hope this helps, -Dan On Wed, Apr 1, 2015 at 7:05 AM, Jack Krupansky wrote: > Solr dedupe is ba

Re: Solr on Tomcat

2015-02-10 Thread Dan Davis
As an application developer, I have to agree with this direction. I ran ManifoldCF and Solr together in the same Tomcat, and the sl4j configurations of the two conflicted with strange results. From a systems administrator/operations perspective, a separate install allows better packaging, e.g.

Re: Delta import query not working

2015-02-05 Thread Dan Davis
/solr/DataImportHandlerDeltaQueryViaFullImport Hope this helps, Dan On Thu, Feb 5, 2015 at 9:30 PM, Dan Davis wrote: > It looks like you are returning the transformed ID, along with some other > fields, in the deltaQuery command.deltaQuery should only return the ID, > without the "stk

Re: Delta import query not working

2015-02-05 Thread Dan Davis
It looks like you are returning the transformed ID, along with some other fields, in the deltaQuery command.deltaQuery should only return the ID, without the "stk_" prefix, and then deltaImportQuery should retrieve the transformed ID. I'd suggest: I'm not sure which RDBMS you are using, bu

Re: clarification regarding shard splitting and composite IDs

2015-02-05 Thread Dan Davis
> docs on different shards. > > On Wed, Feb 4, 2015 at 9:06 PM, Dan Davis wrote: > >> Doesn't relevancy for that assume that the IDF and TF for user1 and user2 >> are not too different?SolrCloud still doesn't use a distributed IDF, >> corre

Re: clarification regarding shard splitting and composite IDs

2015-02-04 Thread Dan Davis
Doesn't relevancy for that assume that the IDF and TF for user1 and user2 are not too different?SolrCloud still doesn't use a distributed IDF, correct? On Wed, Feb 4, 2015 at 7:05 PM, Gili Nachum wrote: > Alright. So shard splitting and composite routing plays nicely together. > Thank you An

Re: Solr 4.9 Calling DIH concurrently

2015-02-04 Thread Dan Davis
ython have good handling of both databases and Solr, as does Java with JDBC and SolrJ. Pushing to Solr probably has more legs than Data Import Handler going forward. On Wed, Feb 4, 2015 at 11:13 AM, Dan Davis wrote: > Suresh and Meena, > > I have solved this problem by taking a row coun

Re: Solr 4.9 Calling DIH concurrently

2015-02-04 Thread Dan Davis
o do "all" our data imports from RDBMS, and Data Import Handler is the only good solution that involves writing configuration, not code. The distinction is slight, I think. Hope this helps, Dan Davis On Wed, Feb 4, 2015 at 3:02 AM, Mikhail Khludnev wrote: > Suresh, > > There

Re: role of the wiki and cwiki

2015-02-02 Thread Dan Davis
it continues to be better organized if you are reading from scratch rather than starting with Solr In Action or something like that. Thanks again, Dan On Mon, Feb 2, 2015 at 11:57 AM, Chris Hostetter wrote: > > : Because they have different potential authors, the two systems n

role of the wiki and cwiki

2015-01-30 Thread Dan Davis
I've been thinking of https://wiki.apache.org/solr/ as the "Old Wiki" and https://cwiki.apache.org/confluence/display/solr as the "New Wiki". I guess that's the wrong way to think about it - Confluence is being used for the "Solr Reference Guide", and MoinMoin is being used as a wiki. Is this the

Re: Calling custom request handler with data import

2015-01-30 Thread Dan Davis
, Dan Davis wrote: > The Data Import Handler isn't pushing data into the /update request > handler. However, Data Import Handler can be extended with transformers. > Two such transformers are the TemplateTransformer and the > ScriptTransformer. It may be possible to get a s

Re: Calling custom request handler with data import

2015-01-30 Thread Dan Davis
code. You could also just write a StandfordNerTransformer. Hope this helps, Dan On Fri, Jan 30, 2015 at 9:07 AM, vineet yadav wrote: > Hi, > I am using data import handler to import data from mysql, and I want to > identify name entities from it. So I am using following e

Re: Cannot reindex to add a new field

2015-01-29 Thread Dan Davis
For this I prefer TemplateTransformer to RegexTransformer - its not a regex, just a pattern, and so should be more efficient to use TemplateTransformer. A script will also work, of course. On Tue, Jan 27, 2015 at 5:54 PM, Alexandre Rafalovitch wrote: > On 27 January 2015 at 17:47, Carl Roberts

Re: Solr admin Url issues

2015-01-26 Thread Dan Davis
Is Jetty actually running on port 80?Do you have Apache2 reverse proxy in front? On Mon, Jan 26, 2015 at 11:02 PM, Summer Shire wrote: > Hi All, > > Running solr (4.7.2) locally and hitting the admin page like this works > just fine http://localhost:8983/solr/ #

Re: How to implement Auto complete, suggestion client side

2015-01-26 Thread Dan Davis
Cannot get any easier than jquery-ui's autocomplete widget - http://jqueryui.com/autocomplete/ Basically, you set some classes and implement a javascript that calls the server to get the autocomplete data. I never would expose Solr to browsers, so I would have the AJAX call go to a php script (o

Re: Indexed epoch time in Solr

2015-01-26 Thread Dan Davis
I think copying to a new Solr date field is your best bet, because then you have the flexibility to do date range facets in the future. If you can re-index, and are using Data Import Handler, Jim Musil's suggestion is just right. If you can re-index, and are not using Data Import Handler: - T

Re: Need help importing data

2015-01-26 Thread Dan Davis
Glad it worked out. On Fri, Jan 23, 2015 at 9:50 PM, Carl Roberts wrote: > NVM > > I figured this out. The problem was this: pk="link" in > rss-dat.config.xml but unique id not link in schema.xml - it is id. > > From rss-data-config.xml: > > *pk="link"* > url="https://nvd.nist.gov/feeds/xml/c

Re: Need Help with custom ZIPURLDataSource class

2015-01-26 Thread Dan Davis
I have seen such errors by looking under Logging in the Solr Admin UI. There is also the LogTransformer for Data Import Handler. However, it is a design choice in Data Import Handler to skip fields not in the schema. I would suggest you always use Debug and Verbose to do the first couple of docu

Re: [MASSMAIL]Weighting of prominent text in HTML

2015-01-26 Thread Dan Davis
n Mon, Jan 26, 2015 at 12:47 AM, Jorge Luis Betancourt González < jlbetanco...@uci.cu> wrote: > Hi Dan: > > Agreed, this question is more Nutch related than Solr ;) > > Nutch doesn't send any data into /update/extract request handler, all the > text and metadata extra

Weighting of prominent text in HTML

2015-01-25 Thread Dan Davis
many, Dan Davis, Systems/Applications Architect National Library of Medicine

Re: solr replication vs. rsync

2015-01-25 Thread Dan Davis
@Erick, Problem space is not constant indexing. I thought SolrCloud replicas were replication, and you imply parallel indexing. Good to know. On Sunday, January 25, 2015, Erick Erickson wrote: > @Shawn: Cool table, thanks! > > @Dan: > Just to throw a different spin on it, if yo

Re: solr replication vs. rsync

2015-01-25 Thread Dan Davis
Thanks! On Sunday, January 25, 2015, Erick Erickson wrote: > @Shawn: Cool table, thanks! > > @Dan: > Just to throw a different spin on it, if you migrate to SolrCloud, then > this question becomes moot as the raw documents are sent to each of the > replicas so you very rarel

solr replication vs. rsync

2015-01-24 Thread Dan Davis
When I polled the various projects already using Solr at my organization, I was greatly surprised that none of them were using Solr replication, because they had talked about "replicating" the data. But we are not Pinterest, and do not expect to be taking in changes one post at a time (at least th

Re: OutOfMemoryError for PDF document upload into Solr

2015-01-15 Thread Dan Davis
Why re-write all the document conversion in Java ;) Tika is very slow. 5 GB PDF is very big. If you have a lot of PDF like that try pdftotext in HTML and UTF-8 output mode. The HTML mode captures some meta-data that would otherwise be lost. If you need to go faster still, you can also writ

Re: Occasionally getting error in solr suggester component.

2015-01-13 Thread Dan Davis
Related question - I see mention of needing to rebuild the spellcheck/suggest dictionary after solr core reload. I see spellcheckIndexDir in both the old wiki entry and the solr reference guide . If this parameter is provided, it

Re: Best way to implement Spotlight of certain results

2015-01-13 Thread Dan Davis
rickson : > > > Hmm, I wonder if the RerankingQueryParser might help here? > > See: https://cwiki.apache.org/confluence/display/solr/Query+Re-Ranking > > > > Best, > > Erick > > > > On Fri, Jan 9, 2015 at 10:35 AM, Dan Davis wrote: > > > I

Re: Logging in Solr's DataImportHandler

2015-01-13 Thread Dan Davis
Mikhail, Thanks - it works now.The script transformer was really not needed, a template transformer is clearer, and the log transformer is now working. On Mon, Dec 8, 2014 at 1:56 AM, Mikhail Khludnev wrote: > Hello Dan, > > Usually it works well. Can you describe how y

Improved suggester question

2015-01-13 Thread Dan Davis
The suggester is not working for me with Solr 4.10.2 Can anyone shed light over why I might be getting the exception below when I build the dictionary? 500 26 len must be <= 32767; got 35680 java.lang.IllegalArgumentException: len must be <= 32767; got 35680 at org.apache.lucene.util.Offline

Suggester questions

2015-01-13 Thread Dan Davis
I am having some trouble getting the suggester to work. The spell requestHandler is working, but I didn't like the results I was getting from the word breaking dictionary and turned them off. So some basic questions: - How can I check on the status of a dictionary? - How can I see what is

Best way to implement Spotlight of certain results

2015-01-09 Thread Dan Davis
demo and a brief evaluation period, but that either approach 2 or 3 will be the winner. Here's a snippet from my current qf/pf configuration: title^100 alttitle_t^100 ... text title^1000 alttitle_t^1000 ... text^10 Thanks, Dan Davis

Re: Spellchecker delivers far too few suggestions

2014-12-17 Thread Dan Davis
What about the frequency comparison - I haven't used the spellchecker heavily, but it seems that if "bnak" is in the database, but "bank" is much more frequent, then "bank" should be a suggestion anyway... On Wed, Dec 17, 2014 at 10:41 AM, Erick Erickson wrote: > > First, I'd look in your corpus

Re: DIH XPathEntityProcessor question

2014-12-08 Thread Dan Davis
Yes, that worked quite well. I still need the "//tagname" but that is the only DIH incantation I need. This will substantially accelerate things. On Mon, Dec 8, 2014 at 5:37 PM, Dan Davis wrote: > The problem is that XPathEntityProcessor implements Xpath on its own, and &

Re: DIH XPathEntityProcessor question

2014-12-08 Thread Dan Davis
urces and newsletter: http://www.solr-start.com/ and @solrstart > Solr popularizers community: https://www.linkedin.com/groups?gid=6713853 > > > On 8 December 2014 at 17:01, Dan Davis wrote: > > In experimentation with a much simpler and smaller XML file, it doesn't > >

Re: DIH XPathEntityProcessor question

2014-12-08 Thread Dan Davis
is beginning to same dangerous. Is there any short-hand for the current node or the match? On Mon, Dec 8, 2014 at 4:42 PM, Dan Davis wrote: > When I have a forEach attribute like the following: > > > forEach="/medical-topics/medical-topic/health-topic[@language='English

DIH XPathEntityProcessor question

2014-12-08 Thread Dan Davis
When I have a forEach attribute like the following: forEach="/medical-topics/medical-topic/health-topic[@language='English']" And then need to match an attribute of that, is there any alternative to spelling it all out: I suppose I could do "//health-topic/@url" since the document should

Re: Tika HTTP 400 Errors with DIH

2014-12-08 Thread Dan Davis
I would say that you could determine a row that gives a bad URL, and then run it in DIH admin interface (or the command-line) with "debug" enabled The url parameter going into tika should be present in its transformed form before the next entity gets going. This works in a similar scenario for me

Logging in Solr's DataImportHandler

2014-12-05 Thread Dan Davis
I have a script transformer and a log transformer, and I'm not seeing the log messages, at least not where I expect. Is there anyway I can simply log a custom message from within my script? Can the script easily interact with its containers logger?

Fwd: Best Practices for open source pipeline/connectors

2014-11-10 Thread Dan Davis
onfiguration software. So, my organizations' needs and mine may be different... -- Forwarded message -- From: "Jürgen Wagner (DVT)" Date: Tue, Nov 4, 2014 at 4:48 PM Subject: Re: Best Practices for open source pipeline/connectors To: solr-user@lucene.apache.

Re: Best Practices for open source pipeline/connectors

2014-11-04 Thread Dan Davis
lr resources and newsletter: http://www.solr-start.com/ and @solrstart > Solr popularizers community: https://www.linkedin.com/groups?gid=6713853 > > > On 4 November 2014 16:01, Dan Davis wrote: > > I'm trying to do research for my organization on the best practices for > > ope

Best Practices for open source pipeline/connectors

2014-11-04 Thread Dan Davis
ideas? Thanks, Dan

Re: Tika Integration problem with DIH and JDBC

2014-11-04 Thread Dan Davis
All, The problem here was that I gave driver="BinURLDataSource" rather than type="BinURLDataSource". Of course, saying driver="BinURLDataSource" caused it not to be able to find it.

Re: facet on field aliases of same field

2014-10-30 Thread Dan Field
t; > -Michael > > -Original Message- > From: Dan Field [mailto:d...@llgc.org.uk] > Sent: Wednesday, October 29, 2014 5:54 AM > To: solr-user@lucene.apache.org > Subject: facet on field aliases of same field > > Hi, we have a use case where we are trying to c

facet on field aliases of same field

2014-10-29 Thread Dan Field
hing the problem in the wrong way? Server is Solr 4.1 Any help appreciated -- Dan Field mailto:d...@llgc.org.uk>> Ffôn/Tel. +44 1970 632 582 Pennaeth Uned DatblyguHead of Development Unit Llyfrgell Genedlaethol Cymru National Library of Wales

Re: javascript form data save to XML in server side

2014-10-22 Thread Dan Davis
I always, always have a web application running that accepts the JavaScript AJAX call and then forwards it on to the Apache Solr request handler. Even if you don't control the web application, and can only add JavaScript, you can put up a API oriented webapp somewhere that only protects Solr for a

Re: import solr source to eclipse

2014-10-16 Thread Dan Davis
I had a problem with the "ant eclipse" answer - it was unable to resolve "javax.activation" for the Javadoc. Updating solr/contrib/dataimporthandler-extras/ivy.xml as follows did the trick for me: - + What I'm trying to do is to construct a failing Unit test for something that I think is a b

Re: Problem with DIH

2014-10-16 Thread Dan Davis
This seems a little abstract. What I'd do is double check that the SQL is working correctly by running the stored procedure outside of Solr and see what you get. You should also be able to look at the corresponding .properties file and see the inputs used for the delta import. If the data impo

Re: Tika Integration problem with DIH and JDBC

2014-10-10 Thread Dan Davis
it out, mention the Solr version on > the next email. Sometimes it makes difference, though DIH has been > largely unchanged for a while. > > -- Forwarded message -- > From: Dan Davis > Date: 10 October 2014 15:00 > Subject: Re: Tika Integration problem wit

Tika Integration problem with DIH and JDBC

2014-10-10 Thread Dan Davis
What I want to do is to pull an URL out of an Oracle database, and then use TikaEntityProcessor and BinURLDataSource to go fetch and process that URL. I'm having a problem with this that seems general to JDBC with Tika - I get an exception as follows: Exception in entity : extract:org.apache.sol

RE: SolrCloud Issues

2014-07-17 Thread Dan Murphy
Have you deployed ZooKeeper on servers other than the Solr nodes? If you have them on the Solr nodes, then you may be getting elections when under pressure. -Original Message- From: Shawn Heisey [mailto:s...@elyograg.org] Sent: 17 July 2014 14:25 To: solr-user@lucene.apache.org Subject:

Re: Field for 'species' data?

2014-07-05 Thread Dan Bolser
; the rest > of the hierarchy path for that species. > > -- Jack Krupansky > > -Original Message- From: Dan Bolser > Sent: Saturday, July 5, 2014 10:36 AM > To: solr-user@lucene.apache.org > Subject: Re: Field for 'species' data? > > One requirement

Re: Field for 'species' data?

2014-07-05 Thread Dan Bolser
hat fields you need to > populate, the analysis and processing details can be worked out. > > -- Jack Krupansky > > -Original Message- From: Dan Bolser > Sent: Saturday, July 5, 2014 4:49 AM > To: solr-user > Subject: Re: Field for 'species' data? > > I&

Re: Field for 'species' data?

2014-07-05 Thread Dan Bolser
I'm super noob... Why choose to write it add a custom update request processor rather than an analysis pipeline? Cheers, Dan. On 5 Jul 2014 03:45, "Alexandre Rafalovitch" wrote: > Do that with a custom update request processor. > > Just remember Solr is there to fin

Re: Field for 'species' data?

2014-07-04 Thread Dan Bolser
/lucene/analysis/path/PathHierarchyTokenizerFactory.html > > Or maybe a combination of the two approaches. > > I think I have some examples of it in my e-book. > > -- Jack Krupansky > > -Original Message- From: Dan Bolser > Sent: Friday, July 4, 2014 11:57 AM > To: s

Re: Field for 'species' data?

2014-07-04 Thread Dan Bolser
Hordeum / Hordeum vulgare / Hordeum vulgare var. hybernum" but rather SPECIES: "Hordeum vulgare". How can I add in that data at analysis time? Cheers, Dan. On 4 July 2014 04:19, Gora Mohanty wrote: > On 3 July 2014 21:40, Dan Bolser wrote: >> >> Hi, >> >&g

Field for 'species' data?

2014-07-03 Thread Dan Bolser
ar. vulgare var. hybernum Viborg, a specific variety of Hordeum vulgare... I figure I can't be the first person to look at this? Thanks for any tips, Dan.

Re: Configure Solr to use http proxy for querying shards?

2013-10-02 Thread Dan Bolser
t=8080 Cheers, Dan. On 2 October 2013 12:32, Dan Bolser wrote: > Hi, > > I'm running my Solr server on a box which can only see 'the web' via > an http proxy. For most applications it's as simple as setting > http_proxy in the shell environment to get them t

Configure Solr to use http proxy for querying shards?

2013-10-02 Thread Dan Bolser
y server running on this box, I get the following errors: "error":"org.apache.http.conn.HttpHostConnectException: Connection to http://sites.out.there refused", "trace":"... Can I get Solr to use the http proxy, or am I stuck? Cheers, Dan.

Re: Nagle's Algorithm

2013-09-29 Thread Dan Davis
I don't keep up with this list well enough to know whether anyone else answered. I don't know how to do it in jetty.xml, but you can certainly tweak the code. java.net.Socket has a method setTcpNoDelay() that corresponds with the standard Unix system calls. Long-time past, my suggestion of this

Excluding a facet's constraint to exclude a facet

2013-09-24 Thread Dan Davis
Summary - when constraining a search using filter query, how can I exclude the constraint for a particular facet? Detail - Suppose I have the following facet results for a query "q=* mainquery*": 491 111 103 ... ... I understand from http://people.apache.org/~hossman/apachecon2010/facets/an

Re: More on topic of Meta-search/Federated Search with Solr

2013-08-28 Thread Dan Davis
On Mon, Aug 26, 2013 at 9:06 PM, Amit Jha wrote: > Would you like to create something like > http://knimbus.com > I work at the National Library of Medicine. We are moving our library catalog to a newer platform, and we will probably include articles. The article's content and meta-data are

Re: More on topic of Meta-search/Federated Search with Solr

2013-08-28 Thread Dan Davis
On Tue, Aug 27, 2013 at 3:33 AM, Bernd Fehling < bernd.fehl...@uni-bielefeld.de> wrote: > Years ago when "Federated Search" was a buzzword we did some development > and > testing with Lucene, FAST Search, Google and several other Search Engines > according Federated Search in Library context. > Th

Re: More on topic of Meta-search/Federated Search with Solr

2013-08-28 Thread Dan Davis
On Tue, Aug 27, 2013 at 2:03 AM, Paul Libbrecht wrote: > Dan, > > if you're bound to federated search then I would say that you need to work > on the service guarantees of each of the nodes and, maybe, create > strategies to cope with bad nodes. > > paul > +1 I'll think on that.

Re: How to Manage RAM Usage at Heavy Indexing

2013-08-28 Thread Dan Davis
This could be an operating systems problem rather than a Solr problem. CentOS 6.4 (linux kernel 2.6.32) may have some issues with page flushing and I would read-up up on that. The VM parameters can be tuned in /etc/sysctl.conf On Sun, Aug 25, 2013 at 4:23 PM, Furkan KAMACI wrote: > Hi Erick; > >

Re: Storing query results

2013-08-28 Thread Dan Davis
You could copy the existing core to a new core every once in awhile, and then do your delta indexing into a new core once the copy is complete. If a Persistent URL for the search results included the name of the original core, the results you would get from a bookmark would be stable. However, if

Re: More on topic of Meta-search/Federated Search with Solr

2013-08-26 Thread Dan Davis
One more question here - is this topic more appropriate to a different list? On Mon, Aug 26, 2013 at 4:38 PM, Dan Davis wrote: > I have now come to the task of estimating man-days to add "Blended Search > Results" to Apache Solr. The argument has been made that this is not

Re: More on topic of Meta-search/Federated Search with Solr

2013-08-26 Thread Dan Davis
node; which can turn to be quite bad if you > do not exercise guarantees of remote sources. > > Or are the "remote cores" below actually things that you manage on your > side? If yes guarantees are easy to manage.. > > Paul > > > Le 26 août 2013 à 22:38, Da

Re: More on topic of Meta-search/Federated Search with Solr

2013-08-26 Thread Dan Davis
in a way that "makes sense". But I don't know > how you'd just get "the right thing" to happen with some kind > of scoring magic. > > Best > Erick > > > On Fri, Aug 16, 2013 at 4:07 PM, Dan Davis wrote: > >> I've thought abou

  1   2   3   >