Re: DIH - Export to XML

2009-10-30 Thread Noble Paul നോബിള്‍ नोब्ळ्
nope...

On Sat, Oct 31, 2009 at 4:47 AM, Yao Ge  wrote:
>
> For Data Import Handler, there is a way to dump data to a SOLR feed format
> XML file?
> --
> View this message in context: 
> http://old.nabble.com/DIH---Export-to-XML-tp26138213p26138213.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>



-- 
-
Noble Paul | Principal Engineer| AOL | http://aol.com


Re: solr search

2009-10-30 Thread manishkbawne

Hi,
I have made following changes in solrconfig.xml

   

C:/Apache-Tomcat/apache-tomcat-6.0.20/solr/conf/db-data-config.xml

   


in db-dataconfig.xml 
 
   
 


   
 


in schema.xml files


Please suggest me the possible cause of error??




Lance Norskog-2 wrote:
> 
> Please post your dataimporthandler configuration file.
> 
> On Fri, Oct 30, 2009 at 4:17 AM, manishkbawne 
> wrote:
>>
>> Thanks for your reply .. I am trying to use the database for solr search
>> but
>> getting this error..
>>
>> false in null
>> -
>> java.lang.NullPointerException at
>> org.apache.solr.handler.dataimport.DataImporter.(DataImporter.java:95)
>> at
>> org.apache.solr.handler.dataimport.DataImportHandler.inform(DataImportHandler.java:106)
>> at org.apache.solr.core.SolrResourceLoader
>>
>> Can you please suggest me some possible solution?
>>
>>
>>
>>
>>
>>
>>
>>
>> Karsten F. wrote:
>>>
>>> hi manishkbawne,
>>>
>>> unspecific ideas of search improvements are her:
>>> http://wiki.apache.org/solr/SolrPerformanceFactors
>>>
>>> I really like the last idea in
>>> http://wiki.apache.org/lucene-java/ImproveSearchingSpeed
>>> :
>>> Use a profiler and ask a more specific question in this forum.
>>>
>>> Best regards
>>>   Karsten
>>>
>>>
>>>
>>> manishkbawne wrote:

 I am using solr search to search through xml files. As I am working on
 millions of data, the result output is slower. Can anyone please
 suggest
 me some way, by which I can increase the search result output?

>>>
>>>
>>
>> --
>> View this message in context:
>> http://old.nabble.com/solr-search-tp26125183p26128341.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>>
>>
> 
> 
> 
> -- 
> Lance Norskog
> goks...@gmail.com
> 
> 

-- 
View this message in context: 
http://old.nabble.com/solr-search-tp26125183p26139946.html
Sent from the Solr - User mailing list archive at Nabble.com.



DIH - Export to XML

2009-10-30 Thread Yao Ge

For Data Import Handler, there is a way to dump data to a SOLR feed format
XML file?
-- 
View this message in context: 
http://old.nabble.com/DIH---Export-to-XML-tp26138213p26138213.html
Sent from the Solr - User mailing list archive at Nabble.com.



SpellCheckComponent suggestions and case

2009-10-30 Thread Acadaca

I am having great difficulty getting SpellCheckComponent to ignore case.

Given a search of Glod, the suggestion is wood
Given a search of glod, the suggestion is gold

I am using LowerCaseTokenizerFactory for both query and index, so as I
understand it Glod and glod should be treated the same. If not, how can I
truly ignore case?

Here is my fieldType:


  


 

  
  


 

  



-- 
View this message in context: 
http://old.nabble.com/SpellCheckComponent-suggestions-and-case-tp26137035p26137035.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: solr web ui

2009-10-30 Thread Grant Ingersoll
There is also a GWT contribution in JIRA that is pretty handy and will  
likely be added in 1.5.  See http://issues.apache.org/jira/browse/SOLR-1163


-Grant
On Oct 29, 2009, at 9:17 PM, scabbage wrote:



Hi,

I'm a new solr user. I would like to know if there are any easy to  
setup web
UIs for solr. It can be as simple as a search box, term highlighting  
and
basic faceting. Basically I'm using solr to store all our automation  
testing
logs and would like to have a simple searchable UI. I don't wanna  
spent too

much time writing my own.

Thanks.
--
View this message in context: 
http://www.nabble.com/solr-web-ui-tp26123604p26123604.html
Sent from the Solr - User mailing list archive at Nabble.com.






Lucene FieldCache memory requirements

2009-10-30 Thread Fuad Efendi
Hi,


Can anyone confirm Lucene FieldCache memory requirements? I have 100
millions docs with non-tokenized field "country" (10 different countries); I
expect it requires array of ("int", "long"), size of array 100,000,000,
without any impact of "country" field length; 

it requires 600,000,000 bytes: "int" is pointer to document (Lucene document
ID),  and "long" is pointer to String value...

Am I right, is it 600Mb just for this "country" (indexed, non-tokenized,
non-boolean) field and 1 million docs? I need to calculate exact minimum RAM
requirements... 

I believe it shouldn't depend on cardinality (distribution) of field...

Thanks,
Fuad







RE: can't find solr.xml

2009-10-30 Thread Ankit Bhatnagar

It should be in couple of places like - apache-solr-1.3\example\exampledocs etc
 
-Ankit
 


-Original Message-
From: javaxmlsoapdev [mailto:vika...@yahoo.com] 
Sent: Friday, October 30, 2009 4:50 PM
To: solr-user@lucene.apache.org
Subject: can't find solr.xml


I have downloaded apache-solr-1.3.0.tgz for Linux and don't see solr.xml. can
someone assist.
-- 
View this message in context: 
http://old.nabble.com/can%27t-find-solr.xml-tp26136630p26136630.html
Sent from the Solr - User mailing list archive at Nabble.com.



can't find solr.xml

2009-10-30 Thread javaxmlsoapdev

I have downloaded apache-solr-1.3.0.tgz for Linux and don't see solr.xml. can
someone assist.
-- 
View this message in context: 
http://old.nabble.com/can%27t-find-solr.xml-tp26136630p26136630.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: solr web ui

2009-10-30 Thread Lance Norskog
This is a cool out-of-the-box index browser. I'm not sure it is meant
to be customizeable.

On Fri, Oct 30, 2009 at 10:24 AM, scabbage  wrote:
>
> Jay,
>
> This looks awesome. I just tried it and it's what I need.
>
> Thanks.
>
> David
>
>
> Jay Hill wrote:
>>
>> Have a look at the VelocityResponseWriter (
>> http://wiki.apache.org/solr/VelocityResponseWriter). It's in the contrib
>> area, but the wiki has instructions on how to move it into your core Solr.
>> Solr uses response writers to return results. The default is XML but
>> responses can be returned in JSON, Ruby and other formats. The
>> VelocityResponseWriter enables responses returned using Velocity
>> templates.
>> It sounds like exactly what you need.
>>
>> -Jay
>> http://www.lucidimagination.com
>>
>>
>> On Thu, Oct 29, 2009 at 6:17 PM, scabbage  wrote:
>>
>>>
>>> Hi,
>>>
>>> I'm a new solr user. I would like to know if there are any easy to setup
>>> web
>>> UIs for solr. It can be as simple as a search box, term highlighting and
>>> basic faceting. Basically I'm using solr to store all our automation
>>> testing
>>> logs and would like to have a simple searchable UI. I don't wanna spent
>>> too
>>> much time writing my own.
>>>
>>> Thanks.
>>> --
>>> View this message in context:
>>> http://www.nabble.com/solr-web-ui-tp26123604p26123604.html
>>> Sent from the Solr - User mailing list archive at Nabble.com.
>>>
>>>
>>
>>
>
> --
> View this message in context: 
> http://old.nabble.com/solr-web-ui-tp26123604p26133935.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>



-- 
Lance Norskog
goks...@gmail.com


Re: solr search

2009-10-30 Thread Lance Norskog
Please post your dataimporthandler configuration file.

On Fri, Oct 30, 2009 at 4:17 AM, manishkbawne  wrote:
>
> Thanks for your reply .. I am trying to use the database for solr search but
> getting this error..
>
> false in null
> -
> java.lang.NullPointerException at
> org.apache.solr.handler.dataimport.DataImporter.(DataImporter.java:95)
> at
> org.apache.solr.handler.dataimport.DataImportHandler.inform(DataImportHandler.java:106)
> at org.apache.solr.core.SolrResourceLoader
>
> Can you please suggest me some possible solution?
>
>
>
>
>
>
>
>
> Karsten F. wrote:
>>
>> hi manishkbawne,
>>
>> unspecific ideas of search improvements are her:
>> http://wiki.apache.org/solr/SolrPerformanceFactors
>>
>> I really like the last idea in
>> http://wiki.apache.org/lucene-java/ImproveSearchingSpeed
>> :
>> Use a profiler and ask a more specific question in this forum.
>>
>> Best regards
>>   Karsten
>>
>>
>>
>> manishkbawne wrote:
>>>
>>> I am using solr search to search through xml files. As I am working on
>>> millions of data, the result output is slower. Can anyone please suggest
>>> me some way, by which I can increase the search result output?
>>>
>>
>>
>
> --
> View this message in context: 
> http://old.nabble.com/solr-search-tp26125183p26128341.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>



-- 
Lance Norskog
goks...@gmail.com


Re: best way to model 1-N

2009-10-30 Thread Joel Nylund

Im using apache-solr-1.3.0

I got it to work using javascript function instead.

thanks
Joel

On Oct 30, 2009, at 12:44 PM, Chantal Ackermann wrote:


This looks all right to me, but I might be missing something.
Which version/build of SOLR are you using?

Chantal

Joel Nylund schrieb:

Thanks Chantal, I will keep that in mind for tuning,
for sql I figured  way to combine them into one row using concat, but
I still seem to be having an issue splitting them:
Db now returns as one column categoryType:
TOPIC,LANGUAGE
but my solr result, if you note the item in categoryType  all seem to
be within one str, I would expect it to be in multiple strings within
the array, is this assumption wrong?

−

TOPIC,LANGUAGE

40
feed title

Here is my import:
  


   
group_concat(cfcr.categoryType) as categoryType
   from CFR cfcr
   where
   cfcr.feedId = '$ 
{item.id}' AND

   group by cfcr.feedId">
   name="categoryType"

splityBy="," />
   
 
In schema:
   stored="true"

required="false" multiValued="true"/>
   stored="true"

required="false" multiValued="true"/>
what am I missing?
thanks
Joel
On Oct 30, 2009, at 10:00 AM, Chantal Ackermann wrote:

That depends a bit on your database, but it is tricky and might not
be performant.

If you are more of a Java developer, you might prefer retrieving
mutliple rows per SOLR document from your dataSource (join on your
category and main table), and aggregate them in your custom
EntityProcessor. I got a far(!) better performance retrieving
everything in one query and doing the aggregation in Java. But this
is, of course, depending on your table structure and data.

Noble Paul helped me with the custom EntityProcessor, and it turned
out quite easy. Have a look at the thread with the heading from this
mailing list (SOLR-USER):
DataImportHandler / Import from DB : one data set comes in multiple
rows

Cheers,
Chantal


Joel Nylund schrieb:
thanks, but im confused how I can aggregate across rows, I dont  
know
of any easy way to get my db to return one row for all the  
categories

(given the hint from your other email), I have split the category
query into a separate entity, but its returning multiple rows,  
how do

I combine multiple rows into 1 index entity?
thanks
Joel
On Oct 29, 2009, at 8:58 PM, Avlesh Singh wrote:

In the database this is modeled a a 1-N where category table has
the
mapping of feed to category
I need to be able to query , give me all the feeds in any given
category.
How can I best model this in solr?
Seems like multiValued field might help, but how would I populate
it, and
would the query above work?.


Yes you are right. A multivalued field for "categories" is the
answer.

For populating in the index -

1. If you use DIH to populate your indexes and your datasource  
is a

database then you can use DIH's RegexTransformer on an aggregated
list of
categories. e.g. if your database query retruns "a,b,c,d" in a
column called
"db_categories", this is how you would put it in DIH's data-config
file -
.
2. If you "add" documents to Solr yourself  multiple values for
the field
can be specified as an array or list of values in the
SolrInputDocument.

A multivalued field provides the same faceting and searching
capabilites
like regular fields. There is no special syntax.

Cheers
Avlesh

On Fri, Oct 30, 2009 at 4:55 AM, Joel Nylund 
wrote:


Hi,

I have one index so far which contains feeds.  I have been able  
to
de-normalize several tables and map this data onto the feed  
entity.

There is
one tricky problem that I need help on.

Feeds have 1 - many categories.

So Lets say we have Category1, Category2 and Category3

Feed 1 - is in Category 1
Feed 2 is in category2 and category3
Feed 3 is in category2
Feed 4 has no category

In the database this is modeled a a 1-N where category table has
the
mapping of feed to category

I need to be able to query , give me all the feeds in any given
category.

How can I best model this in solr?

Seems like multiValued field might help, but how would I populate
it, and
would the query above work?.

thanks
Joel






autocomplete

2009-10-30 Thread Ankit Bhatnagar

Hi guys,

Enterprise 1.4 Solr Book (AutoComplete) says this works - 

My query looks like - 

"q=*:*&fq=ac:*all*&wt=json&rows=15&start=0&indent=on&omitHeader=true&json.wrf=?";


And it returns three results


{
 "responseHeader":{
  "status":0,
  "QTime":38,
  "params":{
"indent":"on",
"start":"0",
"q":"*:*",
"wt":"json",
"fq":"ac:*all*",
"rows":"15"}},
 "response":{"numFound":3,"start":0,"docs":[
{
 "id":"1",
 "ac":"Can you show me all the results"},
{
 "id":"2",
 "ac":"Can you show all companies "},
{
 "id":"3",
 "ac":"Can you list all companies"}]
 }}



But browser says syntax error -- 


Ankit




 
  





 
 


Re: slave java replication connection issue in nightly (wrong url?)

2009-10-30 Thread gabriele renzi
ah, problem solved, it was due (apparently) caching an old dns entry
sorry for the noise :)


Re: Is it possible to a Transform handle data from 2 rows?

2009-10-30 Thread Lucas F. A. Teixeira
Hello Chantal!

Thanks for your reply!

Actually, in my case, faceting will not help me because I really need all
"rows" from the same group at index time.
That's because I'll use all rows from the same group, merge them all into a
Velocity (or any other template engine) to generate an HTML that will have
all items information and willbe queries with the group id param.

For example, the select will return all stores (group) and departments
(item) in my system.
I'll generate an index with N docs, each one with the store id (group) and
the final HTML for all its departments!

Got it?

What you are doing, is any similar to this?

Thanks!


Lucas Frare Teixeira .·.
- lucas...@gmail.com
- lucastex.com.br
- blog.lucastex.com
- twitter.com/lucastex


On Fri, Oct 30, 2009 at 7:32 AM, Chantal Ackermann <
chantal.ackerm...@btelligent.de> wrote:

> Hi Lucas,
>
> check out the thread:
> DataImportHandler / Import from DB : one data set comes in multiple rows
>
>
> I am doing this successfully with my custom EntityProcessor as described in
> that thread.
> If you have any more questions or need some more code examples, just ask.
> I'd be glad to help!
>
>
> Just wondering: In your SQL statement the ID that you want to use as unique
> ID in SOLR is called "GROUP". You are sure you don't want to create one SOLR
> document per ITEM_ID and add the GROUP ID as a field so that you can sort
> and facet based on it? (Of course, you can facet based on ITEM_ID in the
> other case, as well.)
>
> Cheers,
> Chantal
>
> Noble Paul നോബിള്‍ नोब्ळ् schrieb:
>
>  how do you know that all the rows with the same group id has already come?
>> transformer can store the row and return null till you collect all the
>> rows and then you may return one consolidated row. But the problem is
>> it does not know when the row ends.
>>
>> you can override the SqlEntityProcessor and just override the
>> nextRow() this should do the trick
>>
>> On Fri, Oct 30, 2009 at 1:52 AM, Lucas F. A. Teixeira
>>  wrote:
>>
>>> Hello all,
>>>
>>> Is it possible to have my own implemented transformer that generates some
>>> output data from data of 2 different rows?
>>> My intention is to generate pre-processed HTML from a set of rows.
>>>
>>> Eg.
>>>
>>> SQL RESULT:
>>> GROUP | ITEM_ID | DESCRIPTION
>>> 1 | 8   | desc1
>>> 1 | 23  | desc2
>>> 1 | 563 | desc3
>>> 2 | 43  | desc4
>>> 2 | 3222| desc5
>>>
>>> And with this result, build a Transformer to group all entries in the
>>> same
>>> group and build a html using them. My idea in the final index is to have
>>> 2
>>> docs with
>>>
>>> 
>>>  * id=1
>>>  * content = "desc1>> class='23'>desc2desc3"
>>>
>>>
>>> 
>>>  * id=2
>>>  * content = "desc4>> class='3222'>desc5"
>>>
>>>
>>> I've builted some custom transformers before, but never using data from
>>> different rows.
>>>
>>> If this is not possible with transformers, how can I achieve this?
>>>
>>> Thanks!
>>>
>>>
>>>
>>>
>>> Lucas Frare Teixeira .·.
>>> - lucas...@gmail.com
>>> - lucastex.com.br
>>> - blog.lucastex.com
>>> - twitter.com/lucastex
>>>
>>>
>>
>>
>> --
>> -
>> Noble Paul | Principal Engineer| AOL | http://aol.com
>>
>


Re: solr web ui

2009-10-30 Thread scabbage

Jay,

This looks awesome. I just tried it and it's what I need.

Thanks.

David


Jay Hill wrote:
> 
> Have a look at the VelocityResponseWriter (
> http://wiki.apache.org/solr/VelocityResponseWriter). It's in the contrib
> area, but the wiki has instructions on how to move it into your core Solr.
> Solr uses response writers to return results. The default is XML but
> responses can be returned in JSON, Ruby and other formats. The
> VelocityResponseWriter enables responses returned using Velocity
> templates.
> It sounds like exactly what you need.
> 
> -Jay
> http://www.lucidimagination.com
> 
> 
> On Thu, Oct 29, 2009 at 6:17 PM, scabbage  wrote:
> 
>>
>> Hi,
>>
>> I'm a new solr user. I would like to know if there are any easy to setup
>> web
>> UIs for solr. It can be as simple as a search box, term highlighting and
>> basic faceting. Basically I'm using solr to store all our automation
>> testing
>> logs and would like to have a simple searchable UI. I don't wanna spent
>> too
>> much time writing my own.
>>
>> Thanks.
>> --
>> View this message in context:
>> http://www.nabble.com/solr-web-ui-tp26123604p26123604.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>>
>>
> 
> 

-- 
View this message in context: 
http://old.nabble.com/solr-web-ui-tp26123604p26133935.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: solr web ui

2009-10-30 Thread scabbage

Chantal,

Yes, I have been using the admin ui, but only for development. I need a ui
that displays the query result XML in HTML. Thanks for the suggestion
though.

David



Chantal Ackermann wrote:
> 
> Hi!
> 
> Just to make sure - you did see the search form in the SOLR admin ui?
> 
> Chantal
> 
> scabbage schrieb:
>> Hi,
>> 
>> I'm a new solr user. I would like to know if there are any easy to setup
>> web
>> UIs for solr. It can be as simple as a search box, term highlighting and
>> basic faceting. Basically I'm using solr to store all our automation
>> testing
>> logs and would like to have a simple searchable UI. I don't wanna spent
>> too
>> much time writing my own.
>> 
>> Thanks.
>> --
>> View this message in context:
>> http://www.nabble.com/solr-web-ui-tp26123604p26123604.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>> 
> 
> 

-- 
View this message in context: 
http://old.nabble.com/solr-web-ui-tp26123604p26133745.html
Sent from the Solr - User mailing list archive at Nabble.com.



slave java replication connection issue in nightly (wrong url?)

2009-10-30 Thread gabriele renzi
hi everyone and sorry if this question is dumb, but I seem to be
unable to connect from a slave, with both are in a multicore setup

I configured the replication like



  ${enable.master:false}
  commit
  startup
  optimize
  schema.xml


  ${enable.slave:false}
  http://master.solr.lan:8080/solr/core/replication
  00:00:60



And it seems that both master and slave loaded the configuration
correctly e.g. in the admin interface I see
* /replication added
* my config values correctly loaded
* replicationEnabled = true in master
* isSlave and isMaster correctly set on both
* the http://master.solr.lan:8080/solr/core/replication gives me
"status ok" when accessed as a GET (curl) from  the slave machine

yet, when I check from the slave's admin interface it says that
http://master.solr.lan:8080/solr/core/replication is unreachable, and
from the statistics I see
isReplicating : false

and in the logs
SEVERE: Master at: http://master.solr.lan:8080/solr/core/replication
is not available. Index fetch failed. Exception: Request failed
 for the url org.apache.commons.httpclient.methods.postmet...@48f3d27

I'm thinking maybe I should have configured the url differently, but
frankly I don't have a clue.

Could someone point out what is the obvious error I'm doing ? :)

Additional informations:
* slave nightly is from today, master from a few days ago
* both machines are 64 bit and running on linux
* java -version is
Java version "1.6.0_16"
Java(TM) SE Runtime Environment (build 1.6.0_16-b01)
Java HotSpot(TM) 64-Bit Server VM (build 14.2-b01, mixed mode)
* solr is deployed in a tomcat6 container


Re: adding and updating a lot of document to Solr, metadata extraction etc

2009-10-30 Thread Israel Ekpo
On Fri, Oct 30, 2009 at 11:23 AM, Eugene Dzhurinsky wrote:

> Hi there!
>
> We are trying to evaluate Apache Solr for our custom search implementation,
> which
> includes the following requirements:
>
> - ability to add/update/delete a lot of documents at once
>
> - ability to iterate over all documents, returned in search, as Lucene does
>  provide within a HitCollector instance. We would need to extract and
>  aggregate various fields, stored in index, to group results and aggregate
> them
>  in some way.
>
> After reading the tutorial I've realized that adding and removal of
> documents
> is performed through passing an XML file to controller in POST request.
> However our XML files may be very, very large - so I hope there is some
> another option to avoid interaction through HTTP protocol.
>
> Also I did not find any way in the tutorial to access the search results
> with
> all fields to be processed by our application.
>
> I think I simply did not read the documentation well or missed some point,
> so
> can somebody please point me to the articles, which may explain basics of
> how
> to achieve my goals?
>
> Thank you very much in advance!
>
> --
> Eugene N Dzhurinsky
>

Hi Eugene

Solr has an embedded version but you are encouraged to use the standard web
service interfaces.

Also, the Solr 1.4 white paper just recently released talks about the the
Streaming Updates Solr Server which according to the white paper can index
documents at an incredibly lightening speed of up to 25K documents per
second.

The white paper can be downloaded here

http://www.lucidimagination.com/whitepaper/whats-new-in-solr-1-4

Info about Streaming Update Solr Server is available here

http://lucene.apache.org/solr/api/org/apache/solr/client/solrj/impl/StreamingUpdateSolrServer.html

If you are still interested in the Embedded version to avoid the HTTP
version you can check out the following links

http://wiki.apache.org/solr/EmbeddedSolr

http://lucene.apache.org/solr/api/org/apache/solr/client/solrj/embedded/EmbeddedSolrServer.html

I hope this helps.

-- 
"Good Enough" is not good enough.
To give anything less than your best is to sacrifice the gift.
Quality First. Measure Twice. Cut Once.


Re: best way to model 1-N

2009-10-30 Thread Chantal Ackermann

This looks all right to me, but I might be missing something.
Which version/build of SOLR are you using?

Chantal

Joel Nylund schrieb:

Thanks Chantal, I will keep that in mind for tuning,

for sql I figured  way to combine them into one row using concat, but
I still seem to be having an issue splitting them:

Db now returns as one column categoryType:
TOPIC,LANGUAGE

but my solr result, if you note the item in categoryType  all seem to
be within one str, I would expect it to be in multiple strings within
the array, is this assumption wrong?


−

TOPIC,LANGUAGE

40
feed title



Here is my import:
   
 
 




  

In schema:




what am I missing?

thanks
Joel


On Oct 30, 2009, at 10:00 AM, Chantal Ackermann wrote:


That depends a bit on your database, but it is tricky and might not
be performant.

If you are more of a Java developer, you might prefer retrieving
mutliple rows per SOLR document from your dataSource (join on your
category and main table), and aggregate them in your custom
EntityProcessor. I got a far(!) better performance retrieving
everything in one query and doing the aggregation in Java. But this
is, of course, depending on your table structure and data.

Noble Paul helped me with the custom EntityProcessor, and it turned
out quite easy. Have a look at the thread with the heading from this
mailing list (SOLR-USER):
DataImportHandler / Import from DB : one data set comes in multiple
rows

Cheers,
Chantal


Joel Nylund schrieb:

thanks, but im confused how I can aggregate across rows, I dont know
of any easy way to get my db to return one row for all the categories
(given the hint from your other email), I have split the category
query into a separate entity, but its returning multiple rows, how do
I combine multiple rows into 1 index entity?
thanks
Joel
On Oct 29, 2009, at 8:58 PM, Avlesh Singh wrote:

In the database this is modeled a a 1-N where category table has
the
mapping of feed to category
I need to be able to query , give me all the feeds in any given
category.
How can I best model this in solr?
Seems like multiValued field might help, but how would I populate
it, and
would the query above work?.


Yes you are right. A multivalued field for "categories" is the
answer.

For populating in the index -

 1. If you use DIH to populate your indexes and your datasource is a
 database then you can use DIH's RegexTransformer on an aggregated
list of
 categories. e.g. if your database query retruns "a,b,c,d" in a
column called
 "db_categories", this is how you would put it in DIH's data-config
file -
 .
 2. If you "add" documents to Solr yourself  multiple values for
the field
 can be specified as an array or list of values in the
SolrInputDocument.

A multivalued field provides the same faceting and searching
capabilites
like regular fields. There is no special syntax.

Cheers
Avlesh

On Fri, Oct 30, 2009 at 4:55 AM, Joel Nylund 
wrote:


Hi,

I have one index so far which contains feeds.  I have been able to
de-normalize several tables and map this data onto the feed entity.
There is
one tricky problem that I need help on.

Feeds have 1 - many categories.

So Lets say we have Category1, Category2 and Category3

Feed 1 - is in Category 1
Feed 2 is in category2 and category3
Feed 3 is in category2
Feed 4 has no category

In the database this is modeled a a 1-N where category table has
the
mapping of feed to category

I need to be able to query , give me all the feeds in any given
category.

How can I best model this in solr?

Seems like multiValued field might help, but how would I populate
it, and
would the query above work?.

thanks
Joel






Re: Iso accents and wildcards

2009-10-30 Thread jfmelian
if the request contains any wilcard then filters are not called :
no ISOLatin1AccentFilterFactory and no SnowballPorterFilterFactory  !

"économie" is indexed to "econom"

solr don't found :
 - term starts with "éco" (éco*)
 - term starts with "economi" (economi*)

if you index manger, mangé and mangue, the indexed terms will be mang and mangu

requests  ->  results

manger   ->   mange, mangé
mangé->   mange, mangé
mang ->   mange, manger
mangu->   mangue
mang*->   manger, mangé, mangue
mang?->   mangue  (and not mangé)
mangé*   ->   nothing

Jean-François


- "Nicolas Leconte"  a écrit :

| Hi all,
| 
| I have a field that contains accentuated char in it, what I whant is
| to 
| be able to search with ignore accents.
| I have set up that field with :
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| In the index the word "économie" is translated to  "econom", the 
| accent 
| is removed thanks to the ISOLatin1AccentFilterFactory and the end of
| the 
| word removent thanks to the SnowballPorterFilterFactory.
| 
| When I request with title:econ* I can have the correct  answers, but
| if  
| I request  with  title:écon*  I  have no  answers.
| If I request with title:économ (the exact word of the index) it works,
| 
| so there might be something wrong with the wildcard.
| As far as I can understand the analyser should be use exactly the same
| 
| in both index and query time.
| 
| I have tested with changing the order of the filters (putting the 
| ISOLatin1AccentFilterFactory on top) without any result.
| 
| Could anybody help me with that and point me what may be wrong with my
| 
| shema ?


Re: best way to model 1-N

2009-10-30 Thread Joel Nylund

Thanks Chantal, I will keep that in mind for tuning,

for sql I figured  way to combine them into one row using concat, but  
I still seem to be having an issue splitting them:


Db now returns as one column categoryType:
TOPIC,LANGUAGE

but my solr result, if you note the item in categoryType  all seem to  
be within one str, I would expect it to be in multiple strings within  
the array, is this assumption wrong?



−

TOPIC,LANGUAGE

40
feed title



Here is my import:
  


		
from CFR cfcr
where
cfcr.feedId = '${item.id}' AND
group by cfcr.feedId">
				splityBy="," />



 

In schema:
	required="false" multiValued="true"/>
	required="false" multiValued="true"/>



what am I missing?

thanks
Joel


On Oct 30, 2009, at 10:00 AM, Chantal Ackermann wrote:

That depends a bit on your database, but it is tricky and might not  
be performant.


If you are more of a Java developer, you might prefer retrieving  
mutliple rows per SOLR document from your dataSource (join on your  
category and main table), and aggregate them in your custom  
EntityProcessor. I got a far(!) better performance retrieving  
everything in one query and doing the aggregation in Java. But this  
is, of course, depending on your table structure and data.


Noble Paul helped me with the custom EntityProcessor, and it turned  
out quite easy. Have a look at the thread with the heading from this  
mailing list (SOLR-USER):
DataImportHandler / Import from DB : one data set comes in multiple  
rows


Cheers,
Chantal


Joel Nylund schrieb:

thanks, but im confused how I can aggregate across rows, I dont know
of any easy way to get my db to return one row for all the categories
(given the hint from your other email), I have split the category
query into a separate entity, but its returning multiple rows, how do
I combine multiple rows into 1 index entity?
thanks
Joel
On Oct 29, 2009, at 8:58 PM, Avlesh Singh wrote:
In the database this is modeled a a 1-N where category table has  
the

mapping of feed to category
I need to be able to query , give me all the feeds in any given
category.
How can I best model this in solr?
Seems like multiValued field might help, but how would I populate
it, and
would the query above work?.

Yes you are right. A multivalued field for "categories" is the  
answer.


For populating in the index -

 1. If you use DIH to populate your indexes and your datasource is a
 database then you can use DIH's RegexTransformer on an aggregated
list of
 categories. e.g. if your database query retruns "a,b,c,d" in a
column called
 "db_categories", this is how you would put it in DIH's data-config
file -
 .
 2. If you "add" documents to Solr yourself  multiple values for
the field
 can be specified as an array or list of values in the
SolrInputDocument.

A multivalued field provides the same faceting and searching
capabilites
like regular fields. There is no special syntax.

Cheers
Avlesh

On Fri, Oct 30, 2009 at 4:55 AM, Joel Nylund 
wrote:


Hi,

I have one index so far which contains feeds.  I have been able to
de-normalize several tables and map this data onto the feed entity.
There is
one tricky problem that I need help on.

Feeds have 1 - many categories.

So Lets say we have Category1, Category2 and Category3

Feed 1 - is in Category 1
Feed 2 is in category2 and category3
Feed 3 is in category2
Feed 4 has no category

In the database this is modeled a a 1-N where category table has  
the

mapping of feed to category

I need to be able to query , give me all the feeds in any given
category.

How can I best model this in solr?

Seems like multiValued field might help, but how would I populate
it, and
would the query above work?.

thanks
Joel






Iso accents and wildcards

2009-10-30 Thread Nicolas Leconte

Hi all,

I have a field that contains accentuated char in it, what I whant is to 
be able to search with ignore accents.

I have set up that field with :



generateNumberParts="1" catenateWords="1" catenateNumbers="1" 
catenateAll="0" splitOnCaseChange="1" />


words="stopwords.txt" />







In the index the word "économie" is translated to  "econom", the  accent 
is removed thanks to the ISOLatin1AccentFilterFactory and the end of the 
word removent thanks to the SnowballPorterFilterFactory.


When I request with title:econ* I can have the correct  answers, but if  
I request  with  title:écon*  I  have no  answers.
If I request with title:économ (the exact word of the index) it works, 
so there might be something wrong with the wildcard.
As far as I can understand the analyser should be use exactly the same 
in both index and query time.


I have tested with changing the order of the filters (putting the 
ISOLatin1AccentFilterFactory on top) without any result.


Could anybody help me with that and point me what may be wrong with my 
shema ?


adding and updating a lot of document to Solr, metadata extraction etc

2009-10-30 Thread Eugene Dzhurinsky
Hi there!

We are trying to evaluate Apache Solr for our custom search implementation, 
which
includes the following requirements:

- ability to add/update/delete a lot of documents at once

- ability to iterate over all documents, returned in search, as Lucene does
  provide within a HitCollector instance. We would need to extract and
  aggregate various fields, stored in index, to group results and aggregate them
  in some way.

After reading the tutorial I've realized that adding and removal of documents
is performed through passing an XML file to controller in POST request.
However our XML files may be very, very large - so I hope there is some
another option to avoid interaction through HTTP protocol.

Also I did not find any way in the tutorial to access the search results with
all fields to be processed by our application.

I think I simply did not read the documentation well or missed some point, so
can somebody please point me to the articles, which may explain basics of how
to achieve my goals?

Thank you very much in advance!

-- 
Eugene N Dzhurinsky


pgpJ2ZR6rFHSF.pgp
Description: PGP signature


Re: solr web ui

2009-10-30 Thread Jay Hill
Have a look at the VelocityResponseWriter (
http://wiki.apache.org/solr/VelocityResponseWriter). It's in the contrib
area, but the wiki has instructions on how to move it into your core Solr.
Solr uses response writers to return results. The default is XML but
responses can be returned in JSON, Ruby and other formats. The
VelocityResponseWriter enables responses returned using Velocity templates.
It sounds like exactly what you need.

-Jay
http://www.lucidimagination.com


On Thu, Oct 29, 2009 at 6:17 PM, scabbage  wrote:

>
> Hi,
>
> I'm a new solr user. I would like to know if there are any easy to setup
> web
> UIs for solr. It can be as simple as a search box, term highlighting and
> basic faceting. Basically I'm using solr to store all our automation
> testing
> logs and would like to have a simple searchable UI. I don't wanna spent too
> much time writing my own.
>
> Thanks.
> --
> View this message in context:
> http://www.nabble.com/solr-web-ui-tp26123604p26123604.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>


Solr YUI autocomplete

2009-10-30 Thread Ankit Bhatnagar

Does Solr supports JSONP (JSON with Padding) in the response?

-Ankit
 


-Original Message-
From: Ankit Bhatnagar [mailto:abhatna...@vantage.com] 
Sent: Friday, October 30, 2009 10:27 AM
To: 'solr-user@lucene.apache.org'
Subject: Solr YUI autocomplete

Hi Guys,

I have question regarding - how to specify the 

I am using YUI autocomplete widget and it expects the JSONP response.

http://localhost:8983/solr/select/?q=monitor&version=2.2&start=0&rows=10&indent=on&wt=json&json.wrf=

I am not sure how should I specify the json.wrf=function

Thanks
Ankit


Solr YUI autocomplete

2009-10-30 Thread Ankit Bhatnagar
Hi Guys,

I have question regarding - how to specify the 

I am using YUI autocomplete widget and it expects the JSONP response.

http://localhost:8983/solr/select/?q=monitor&version=2.2&start=0&rows=10&indent=on&wt=json&json.wrf=

I am not sure how should I specify the json.wrf=function

Thanks
Ankit


Re: Solr and LSA

2009-10-30 Thread Glen Newton
I am using Semantic Vectors[1] implementation of LSA in a large scale
digital library project called Project Torngat[2]. I presented some of
the work at the European Conference on Digital Libraries (ECDL)[3], at
the 'Very Large Digital Libraries (VLDL) workshop[4] in September. A
pre-print of the paper is here[5]. Badsically I used it to generate
similarities of journals using the full-text of all their articles,
and used this to create a 2-D map of the journal space.

The Semantic Vectors works very very well, but I believe it is not yet
a stable (API or functionality) codebase, more of an
experimental-as-yet effort, so it may be a moving target for anyone
wanting to integrate it.

Gllen Newton
http://zzzoot.blogspot.com/

[1]http://code.google.com/p/semanticvectors/
[2]http://lab.cisti-icist.nrc-cnrc.gc.ca/cistilabswiki/index.php/Torngat1
[3]http://www.ecdl2009.eu/
[4]http://www.delos.info/vldl2009/
[5]http://cuvier.cisti.nrc.ca/~gnewton/events/2009/ecdl2009Newton_20090723.pdf

2009/10/30 György Frivolt :
> That's what I found so far too. Seems to be usable. It uses a variant of
> LSA, probabilistic LSA.
> Writing a module for it.. have to rephrase my question them. Where to start
> with writing a module for solr.. Should be useful for diving deeper in solr.
>
>
>
> On Fri, Oct 30, 2009 at 9:56 AM, Paul Libbrecht  wrote:
>
>> Not with solr but with Lucene, there is the project called semanticvectors.
>> It would be cute to make it a solr module.
>>
>> paul
>>
>>
>> Le 30-oct.-09 à 09:17, György Frivolt a écrit :
>>
>>
>>  Hi,
>>>   Does anyone of you have experiences with using LSA, Latent Semantic
>>> Analysis with Solr? I would like to search for expressions, but also find
>>> records, which have context relevant for the given query. Would appreciate
>>> any clue where to start, what to consider.
>>>
>>> Bw, Georg
>>>
>>
>>
>



-- 

-


AW: AW: JQuery and autosuggest

2009-10-30 Thread Markus.Rietzler
as long as you are working on your own pc than this might be ok.
but when someone else is accessing the site via internet, than there is "no 
localhost",
at least not your. so at this point the real (internet-) server name is needed.
btw: ajax calls are only allowed from the origin server, so a page loaded from

www.a-server.com

only will allow ajax-calls that also go to www.a-server.com, and not 
www.b-server.com
or any other

--
mit freundlichen Grüßen

Markus Rietzler - 
Rechenzentrum der Finanzverwaltung NRW
0211/4572-2130
 

> -Ursprüngliche Nachricht-
> Von: sophSophie [mailto:sop...@beezik.com] 
> Gesendet: Freitag, 30. Oktober 2009 14:57
> An: solr-user@lucene.apache.org
> Betreff: Re: AW: JQuery and autosuggest
> 
> 
> Hallo Markus !
> 
> I think that the servername is correct as I'm testing this 
> autocomplete box
> on my computer (my site is built in localhost, solr is in localhost).
> 
> Am I wrong ?
> 
> Thanks
> 
> 
> Markus.Rietzler wrote:
> > 
> > very simple, in the javascript you have
> > 
> > $("#artist").autocomplete(
> > 'http://localhost:8983/solr/completion/select/?wt=json', {
> > 
> > you have to change localhost to the correct servername
> > 
> > 
> > --
> > mit freundlichen Grüßen
> > 
> > Markus Rietzler - 
> > Rechenzentrum der Finanzverwaltung NRW
> > 0211/4572-2130
> >  
> > 
> >> -Ursprüngliche Nachricht-
> >> Von: sophSophie [mailto:sop...@beezik.com] 
> >> Gesendet: Freitag, 30. Oktober 2009 14:35
> >> An: solr-user@lucene.apache.org
> >> Betreff: JQuery and autosuggest
> >> 
> >> 
> >> Hello,
> >> 
> >> I'm trying to build a autocomplete text box. I think that 
> >> everything is ok
> >> in the schema.xml and the solrconfig.xml side as I have 
> >> correct responses
> >> through the solr admin page (I got something like that
> >> :http://brainmaths.fr/IMG/response.txt ).
> >> 
> >> I don't know really javascript and I tried the code given in 
> >> Solr 1.4 book
> >> page 241. You can see what I did here :
> >> http://brainmaths.fr/IMG/completion.html and have a look to 
> >> the source code.
> >> 
> >> The query is correcty sent to solr (I checked the log) but 
> >> nothing returns
> >> and I have an error on this line :
> >> 
> >> return $.map(data.response.docs, function(doc) {
> >> 
> >> The error is : response.docs is null or isn't an object, I 
> >> don't know how to
> >> fix it. I guess that docs aren't correctly returned.
> >> 
> >> I need any help any ideas
> >> 
> >> Thanks in advance
> >> 
> >> -
> >> English is not my mother tongue :)
> >> -- 
> >> View this message in context: 
> >> 
> http://old.nabble.com/JQuery-and-autosuggest-tp26130209p26130209.html
> >> Sent from the Solr - User mailing list archive at Nabble.com.
> >> 
> >> 
> > 
> > 
> 
> 
> -
> English is not my mother tongue :)
> -- 
> View this message in context: 
> http://old.nabble.com/JQuery-and-autosuggest-tp26130209p26130511.html
> Sent from the Solr - User mailing list archive at Nabble.com.
> 
> 


Re: best way to model 1-N

2009-10-30 Thread Chantal Ackermann
That depends a bit on your database, but it is tricky and might not be 
performant.


If you are more of a Java developer, you might prefer retrieving 
mutliple rows per SOLR document from your dataSource (join on your 
category and main table), and aggregate them in your custom 
EntityProcessor. I got a far(!) better performance retrieving everything 
in one query and doing the aggregation in Java. But this is, of course, 
depending on your table structure and data.


Noble Paul helped me with the custom EntityProcessor, and it turned out 
quite easy. Have a look at the thread with the heading from this mailing 
list (SOLR-USER):

DataImportHandler / Import from DB : one data set comes in multiple rows

Cheers,
Chantal


Joel Nylund schrieb:

thanks, but im confused how I can aggregate across rows, I dont know
of any easy way to get my db to return one row for all the categories
(given the hint from your other email), I have split the category
query into a separate entity, but its returning multiple rows, how do
I combine multiple rows into 1 index entity?

thanks
Joel

On Oct 29, 2009, at 8:58 PM, Avlesh Singh wrote:


In the database this is modeled a a 1-N where category table has the
mapping of feed to category
I need to be able to query , give me all the feeds in any given
category.
How can I best model this in solr?
Seems like multiValued field might help, but how would I populate
it, and
would the query above work?.


Yes you are right. A multivalued field for "categories" is the answer.

For populating in the index -

  1. If you use DIH to populate your indexes and your datasource is a
  database then you can use DIH's RegexTransformer on an aggregated
list of
  categories. e.g. if your database query retruns "a,b,c,d" in a
column called
  "db_categories", this is how you would put it in DIH's data-config
file -
  .
  2. If you "add" documents to Solr yourself  multiple values for
the field
  can be specified as an array or list of values in the
SolrInputDocument.

A multivalued field provides the same faceting and searching
capabilites
like regular fields. There is no special syntax.

Cheers
Avlesh

On Fri, Oct 30, 2009 at 4:55 AM, Joel Nylund 
wrote:


Hi,

I have one index so far which contains feeds.  I have been able to
de-normalize several tables and map this data onto the feed entity.
There is
one tricky problem that I need help on.

Feeds have 1 - many categories.

So Lets say we have Category1, Category2 and Category3

Feed 1 - is in Category 1
Feed 2 is in category2 and category3
Feed 3 is in category2
Feed 4 has no category

In the database this is modeled a a 1-N where category table has the
mapping of feed to category

I need to be able to query , give me all the feeds in any given
category.

How can I best model this in solr?

Seems like multiValued field might help, but how would I populate
it, and
would the query above work?.

thanks
Joel






Re: AW: JQuery and autosuggest

2009-10-30 Thread sophSophie

Hallo Markus !

I think that the servername is correct as I'm testing this autocomplete box
on my computer (my site is built in localhost, solr is in localhost).

Am I wrong ?

Thanks


Markus.Rietzler wrote:
> 
> very simple, in the javascript you have
> 
> $("#artist").autocomplete(
>   'http://localhost:8983/solr/completion/select/?wt=json', {
> 
> you have to change localhost to the correct servername
> 
> 
> --
> mit freundlichen Grüßen
> 
> Markus Rietzler - 
> Rechenzentrum der Finanzverwaltung NRW
> 0211/4572-2130
>  
> 
>> -Ursprüngliche Nachricht-
>> Von: sophSophie [mailto:sop...@beezik.com] 
>> Gesendet: Freitag, 30. Oktober 2009 14:35
>> An: solr-user@lucene.apache.org
>> Betreff: JQuery and autosuggest
>> 
>> 
>> Hello,
>> 
>> I'm trying to build a autocomplete text box. I think that 
>> everything is ok
>> in the schema.xml and the solrconfig.xml side as I have 
>> correct responses
>> through the solr admin page (I got something like that
>> :http://brainmaths.fr/IMG/response.txt ).
>> 
>> I don't know really javascript and I tried the code given in 
>> Solr 1.4 book
>> page 241. You can see what I did here :
>> http://brainmaths.fr/IMG/completion.html and have a look to 
>> the source code.
>> 
>> The query is correcty sent to solr (I checked the log) but 
>> nothing returns
>> and I have an error on this line :
>> 
>> return $.map(data.response.docs, function(doc) {
>> 
>> The error is : response.docs is null or isn't an object, I 
>> don't know how to
>> fix it. I guess that docs aren't correctly returned.
>> 
>> I need any help any ideas
>> 
>> Thanks in advance
>> 
>> -
>> English is not my mother tongue :)
>> -- 
>> View this message in context: 
>> http://old.nabble.com/JQuery-and-autosuggest-tp26130209p26130209.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>> 
>> 
> 
> 


-
English is not my mother tongue :)
-- 
View this message in context: 
http://old.nabble.com/JQuery-and-autosuggest-tp26130209p26130511.html
Sent from the Solr - User mailing list archive at Nabble.com.



AW: JQuery and autosuggest

2009-10-30 Thread Markus.Rietzler
very simple, in the javascript you have

$("#artist").autocomplete(
'http://localhost:8983/solr/completion/select/?wt=json', {

you have to change localhost to the correct servername


--
mit freundlichen Grüßen

Markus Rietzler - 
Rechenzentrum der Finanzverwaltung NRW
0211/4572-2130
 

> -Ursprüngliche Nachricht-
> Von: sophSophie [mailto:sop...@beezik.com] 
> Gesendet: Freitag, 30. Oktober 2009 14:35
> An: solr-user@lucene.apache.org
> Betreff: JQuery and autosuggest
> 
> 
> Hello,
> 
> I'm trying to build a autocomplete text box. I think that 
> everything is ok
> in the schema.xml and the solrconfig.xml side as I have 
> correct responses
> through the solr admin page (I got something like that
> :http://brainmaths.fr/IMG/response.txt ).
> 
> I don't know really javascript and I tried the code given in 
> Solr 1.4 book
> page 241. You can see what I did here :
> http://brainmaths.fr/IMG/completion.html and have a look to 
> the source code.
> 
> The query is correcty sent to solr (I checked the log) but 
> nothing returns
> and I have an error on this line :
> 
> return $.map(data.response.docs, function(doc) {
> 
> The error is : response.docs is null or isn't an object, I 
> don't know how to
> fix it. I guess that docs aren't correctly returned.
> 
> I need any help any ideas
> 
> Thanks in advance
> 
> -
> English is not my mother tongue :)
> -- 
> View this message in context: 
> http://old.nabble.com/JQuery-and-autosuggest-tp26130209p26130209.html
> Sent from the Solr - User mailing list archive at Nabble.com.
> 
> 


Re: solr search

2009-10-30 Thread Karsten F.

Hi

I never used DBMS/SQL Import in solr by myself.
Possible
http://old.nabble.com/Using-DataImportHandler-with-mysql-database-to20425791.html#a20435463
can help you.

regards
  Karsten


manishkbawne wrote:
> 
> Thanks for your reply .. I am trying to use the database for solr search
> but getting this error.. 
> 
> false in null
> -
> java.lang.NullPointerException at
> org.apache.solr.handler.dataimport.DataImporter.(DataImporter.java:95)
> at
> org.apache.solr.handler.dataimport.DataImportHandler.inform(DataImportHandler.java:106)
> at org.apache.solr.core.SolrResourceLoader
> 
> Can you please suggest me some possible solution?
> 

-- 
View this message in context: 
http://old.nabble.com/solr-search-tp26125183p26130344.html
Sent from the Solr - User mailing list archive at Nabble.com.



JQuery and autosuggest

2009-10-30 Thread sophSophie

Hello,

I'm trying to build a autocomplete text box. I think that everything is ok
in the schema.xml and the solrconfig.xml side as I have correct responses
through the solr admin page (I got something like that
:http://brainmaths.fr/IMG/response.txt ).

I don't know really javascript and I tried the code given in Solr 1.4 book
page 241. You can see what I did here :
http://brainmaths.fr/IMG/completion.html and have a look to the source code.

The query is correcty sent to solr (I checked the log) but nothing returns
and I have an error on this line :

return $.map(data.response.docs, function(doc) {

The error is : response.docs is null or isn't an object, I don't know how to
fix it. I guess that docs aren't correctly returned.

I need any help any ideas

Thanks in advance

-
English is not my mother tongue :)
-- 
View this message in context: 
http://old.nabble.com/JQuery-and-autosuggest-tp26130209p26130209.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: best way to model 1-N

2009-10-30 Thread Joel Nylund
thanks, but im confused how I can aggregate across rows, I dont know  
of any easy way to get my db to return one row for all the categories  
(given the hint from your other email), I have split the category  
query into a separate entity, but its returning multiple rows, how do  
I combine multiple rows into 1 index entity?


thanks
Joel

On Oct 29, 2009, at 8:58 PM, Avlesh Singh wrote:



In the database this is modeled a a 1-N where category table has the
mapping of feed to category
I need to be able to query , give me all the feeds in any given  
category.

How can I best model this in solr?
Seems like multiValued field might help, but how would I populate  
it, and

would the query above work?.


Yes you are right. A multivalued field for "categories" is the answer.

For populating in the index -

  1. If you use DIH to populate your indexes and your datasource is a
  database then you can use DIH's RegexTransformer on an aggregated  
list of
  categories. e.g. if your database query retruns "a,b,c,d" in a  
column called
  "db_categories", this is how you would put it in DIH's data-config  
file -

  .
  2. If you "add" documents to Solr yourself  multiple values for  
the field
  can be specified as an array or list of values in the  
SolrInputDocument.


A multivalued field provides the same faceting and searching  
capabilites

like regular fields. There is no special syntax.

Cheers
Avlesh

On Fri, Oct 30, 2009 at 4:55 AM, Joel Nylund   
wrote:



Hi,

I have one index so far which contains feeds.  I have been able to
de-normalize several tables and map this data onto the feed entity.  
There is

one tricky problem that I need help on.

Feeds have 1 - many categories.

So Lets say we have Category1, Category2 and Category3

Feed 1 - is in Category 1
Feed 2 is in category2 and category3
Feed 3 is in category2
Feed 4 has no category

In the database this is modeled a a 1-N where category table has the
mapping of feed to category

I need to be able to query , give me all the feeds in any given  
category.


How can I best model this in solr?

Seems like multiValued field might help, but how would I populate  
it, and

would the query above work?.

thanks
Joel






Re: solr search

2009-10-30 Thread manishkbawne

Thanks for your reply .. I am trying to use the database for solr search but
getting this error.. 

false in null
-
java.lang.NullPointerException at
org.apache.solr.handler.dataimport.DataImporter.(DataImporter.java:95)
at
org.apache.solr.handler.dataimport.DataImportHandler.inform(DataImportHandler.java:106)
at org.apache.solr.core.SolrResourceLoader

Can you please suggest me some possible solution?








Karsten F. wrote:
> 
> hi manishkbawne,
> 
> unspecific ideas of search improvements are her:
> http://wiki.apache.org/solr/SolrPerformanceFactors
> 
> I really like the last idea in
> http://wiki.apache.org/lucene-java/ImproveSearchingSpeed
> :
> Use a profiler and ask a more specific question in this forum.
> 
> Best regards
>   Karsten
> 
> 
> 
> manishkbawne wrote:
>> 
>> I am using solr search to search through xml files. As I am working on
>> millions of data, the result output is slower. Can anyone please suggest
>> me some way, by which I can increase the search result output? 
>> 
> 
> 

-- 
View this message in context: 
http://old.nabble.com/solr-search-tp26125183p26128341.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Is it possible to a Transform handle data from 2 rows?

2009-10-30 Thread Chantal Ackermann

Hi Lucas,

check out the thread:
DataImportHandler / Import from DB : one data set comes in multiple rows


I am doing this successfully with my custom EntityProcessor as described 
in that thread.
If you have any more questions or need some more code examples, just 
ask. I'd be glad to help!



Just wondering: In your SQL statement the ID that you want to use as 
unique ID in SOLR is called "GROUP". You are sure you don't want to 
create one SOLR document per ITEM_ID and add the GROUP ID as a field so 
that you can sort and facet based on it? (Of course, you can facet based 
on ITEM_ID in the other case, as well.)


Cheers,
Chantal

Noble Paul നോബിള്‍ नोब्ळ् schrieb:

how do you know that all the rows with the same group id has already come?
transformer can store the row and return null till you collect all the
rows and then you may return one consolidated row. But the problem is
it does not know when the row ends.

you can override the SqlEntityProcessor and just override the
nextRow() this should do the trick

On Fri, Oct 30, 2009 at 1:52 AM, Lucas F. A. Teixeira
 wrote:

Hello all,

Is it possible to have my own implemented transformer that generates some
output data from data of 2 different rows?
My intention is to generate pre-processed HTML from a set of rows.

Eg.

SQL RESULT:
GROUP | ITEM_ID | DESCRIPTION
1 | 8   | desc1
1 | 23  | desc2
1 | 563 | desc3
2 | 43  | desc4
2 | 3222| desc5

And with this result, build a Transformer to group all entries in the same
group and build a html using them. My idea in the final index is to have 2
docs with


 * id=1
 * content = "desc1desc2desc3"



 * id=2
 * content = "desc4desc5"


I've builted some custom transformers before, but never using data from
different rows.

If this is not possible with transformers, how can I achieve this?

Thanks!




Lucas Frare Teixeira .·.
- lucas...@gmail.com
- lucastex.com.br
- blog.lucastex.com
- twitter.com/lucastex





--
-
Noble Paul | Principal Engineer| AOL | http://aol.com


Re: solr web ui

2009-10-30 Thread Chantal Ackermann

Hi!

Just to make sure - you did see the search form in the SOLR admin ui?

Chantal

scabbage schrieb:

Hi,

I'm a new solr user. I would like to know if there are any easy to setup web
UIs for solr. It can be as simple as a search box, term highlighting and
basic faceting. Basically I'm using solr to store all our automation testing
logs and would like to have a simple searchable UI. I don't wanna spent too
much time writing my own.

Thanks.
--
View this message in context: 
http://www.nabble.com/solr-web-ui-tp26123604p26123604.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Solr and LSA

2009-10-30 Thread György Frivolt
That's what I found so far too. Seems to be usable. It uses a variant of
LSA, probabilistic LSA.
Writing a module for it.. have to rephrase my question them. Where to start
with writing a module for solr.. Should be useful for diving deeper in solr.



On Fri, Oct 30, 2009 at 9:56 AM, Paul Libbrecht  wrote:

> Not with solr but with Lucene, there is the project called semanticvectors.
> It would be cute to make it a solr module.
>
> paul
>
>
> Le 30-oct.-09 à 09:17, György Frivolt a écrit :
>
>
>  Hi,
>>   Does anyone of you have experiences with using LSA, Latent Semantic
>> Analysis with Solr? I would like to search for expressions, but also find
>> records, which have context relevant for the given query. Would appreciate
>> any clue where to start, what to consider.
>>
>> Bw, Georg
>>
>
>


Re: Solr and LSA

2009-10-30 Thread Paul Libbrecht
Not with solr but with Lucene, there is the project called  
semanticvectors.

It would be cute to make it a solr module.

paul


Le 30-oct.-09 à 09:17, György Frivolt a écrit :


Hi,
   Does anyone of you have experiences with using LSA, Latent Semantic
Analysis with Solr? I would like to search for expressions, but also  
find
records, which have context relevant for the given query. Would  
appreciate

any clue where to start, what to consider.

Bw, Georg




smime.p7s
Description: S/MIME cryptographic signature


Solr and LSA

2009-10-30 Thread György Frivolt
Hi,
Does anyone of you have experiences with using LSA, Latent Semantic
Analysis with Solr? I would like to search for expressions, but also find
records, which have context relevant for the given query. Would appreciate
any clue where to start, what to consider.

Bw, Georg


Re: Indexing multiple entities

2009-10-30 Thread Christian López Espínola
On Fri, Oct 30, 2009 at 2:04 AM, Avlesh Singh  wrote:
>>
>> One thing I thought about is if I can define my own
>> DocumentObjectBinder, so I can concatenate my entity names with the
>> IDs in the XML creation.
>>
>> Anyone knows if something like this can be done without modifying
>> Solrj sources? Is there any injection or plugin mecanism for this?
>>
> More details on the use-case please.

If I index a Book with ID=3, and then a Magazine with ID=3, I'll be
really removing my Book3 and indexing Magazine3. I want both entities
to be in the index.

The use case on DocumentObjectBinder is that I could override
toSolrInputDocument, and if field = ID, I could do: setField("id",
obj.getClass().getName() + obj.getId()) or something like that.

The goal is avoiding creating all the XMLs to be sent to Solr but
having the possibility of modifying them in some way.

Do you know how can I do that, or a better way of achieving the same results?


> Cheers
> Avlesh
>
> On Fri, Oct 30, 2009 at 2:16 AM, Christian López Espínola <
> penyask...@gmail.com> wrote:
>
>> Hi Israel,
>>
>> Thanks for your suggestion,
>>
>> On Thu, Oct 29, 2009 at 9:37 PM, Israel Ekpo  wrote:
>> > On Thu, Oct 29, 2009 at 3:31 PM, Christian López Espínola <
>> > penyask...@gmail.com> wrote:
>> >
>> >> Hi, my name is Christian and I'm a newbie introducing to solr (and
>> solrj).
>> >>
>> >> I'm working on a website where I want to index multiple entities, like
>> >> Book or Magazine.
>> >> The issue I'm facing is both of them have an attribute ID, which I
>> >> want to use as the uniqueKey on my schema, so I cannot identify
>> >> uniquely a document (because ID is saved in a database too, and it's
>> >> autonumeric).
>> >>
>> >> I'm sure that this is a common pattern, but I don't find the way of
>> solving
>> >> it.
>> >>
>> >> How do you usually solve this? Thanks in advance.
>> >>
>> >>
>> >> --
>> >> Cheers,
>> >>
>> >> Christian López Espínola 
>> >>
>> >
>> > Hi Christian,
>> >
>> > It looks like you are bringing in data to Solr from a database where
>> there
>> > are two separate tables.
>> >
>> > One for *Books* and another one for *Magazines*.
>> >
>> > If this is the case, you could define your uniqueKey element in Solr
>> schema
>> > to be a "string" instead of an integer then you can still load documents
>> > from both the books and magazines database tables but your could prefix
>> the
>> > uniqueKey field with "B" for books and "M" for magazines
>> >
>> > Like so :
>> >
>> > > > required="true"/>
>> >
>> > id
>> >
>> > Then when loading the books or magazines into Solr you can create the
>> > documents with id fields like this
>> >
>> > 
>> >  
>> >    B14000
>> >  
>> >  
>> >    M14000
>> >  
>> >  
>> >    B14001
>> >  
>> >  
>> >    M14001
>> >  
>> > 
>> >
>> > I hope this helps
>>
>> This was my first thought, but in practice there isn't Book and
>> Magazine, but about 50 different entities, so I'm using the Field
>> annotation of solrj for simplifying my code (it manages for me the XML
>> creation, etc).
>> One thing I thought about is if I can define my own
>> DocumentObjectBinder, so I can concatenate my entity names with the
>> IDs in the XML creation.
>>
>> Anyone knows if something like this can be done without modifying
>> Solrj sources? Is there any injection or plugin mecanism for this?
>>
>> Thanks in advance.
>>
>>
>> > --
>> > "Good Enough" is not good enough.
>> > To give anything less than your best is to sacrifice the gift.
>> > Quality First. Measure Twice. Cut Once.
>> >
>>
>>
>>
>> --
>> Cheers,
>>
>> Christian López Espínola 
>>
>



-- 
Cheers,

Christian López Espínola 


Re: solr search

2009-10-30 Thread Karsten F.

hi manishkbawne,

unspecific ideas of search improvements are her:
http://wiki.apache.org/solr/SolrPerformanceFactors

I really like the last idea in
http://wiki.apache.org/lucene-java/ImproveSearchingSpeed
:
Use a profiler and ask a more specific question in this forum.

Best regards
  Karsten



manishkbawne wrote:
> 
> I am using solr search to search through xml files. As I am working on
> millions of data, the result output is slower. Can anyone please suggest
> me some way, by which I can increase the search result output? 
> 

-- 
View this message in context: 
http://old.nabble.com/solr-search-tp26125183p26126041.html
Sent from the Solr - User mailing list archive at Nabble.com.