Re: DIH - Export to XML
nope... On Sat, Oct 31, 2009 at 4:47 AM, Yao Ge wrote: > > For Data Import Handler, there is a way to dump data to a SOLR feed format > XML file? > -- > View this message in context: > http://old.nabble.com/DIH---Export-to-XML-tp26138213p26138213.html > Sent from the Solr - User mailing list archive at Nabble.com. > > -- - Noble Paul | Principal Engineer| AOL | http://aol.com
Re: solr search
Hi, I have made following changes in solrconfig.xml C:/Apache-Tomcat/apache-tomcat-6.0.20/solr/conf/db-data-config.xml in db-dataconfig.xml in schema.xml files Please suggest me the possible cause of error?? Lance Norskog-2 wrote: > > Please post your dataimporthandler configuration file. > > On Fri, Oct 30, 2009 at 4:17 AM, manishkbawne > wrote: >> >> Thanks for your reply .. I am trying to use the database for solr search >> but >> getting this error.. >> >> false in null >> - >> java.lang.NullPointerException at >> org.apache.solr.handler.dataimport.DataImporter.(DataImporter.java:95) >> at >> org.apache.solr.handler.dataimport.DataImportHandler.inform(DataImportHandler.java:106) >> at org.apache.solr.core.SolrResourceLoader >> >> Can you please suggest me some possible solution? >> >> >> >> >> >> >> >> >> Karsten F. wrote: >>> >>> hi manishkbawne, >>> >>> unspecific ideas of search improvements are her: >>> http://wiki.apache.org/solr/SolrPerformanceFactors >>> >>> I really like the last idea in >>> http://wiki.apache.org/lucene-java/ImproveSearchingSpeed >>> : >>> Use a profiler and ask a more specific question in this forum. >>> >>> Best regards >>> Karsten >>> >>> >>> >>> manishkbawne wrote: I am using solr search to search through xml files. As I am working on millions of data, the result output is slower. Can anyone please suggest me some way, by which I can increase the search result output? >>> >>> >> >> -- >> View this message in context: >> http://old.nabble.com/solr-search-tp26125183p26128341.html >> Sent from the Solr - User mailing list archive at Nabble.com. >> >> > > > > -- > Lance Norskog > goks...@gmail.com > > -- View this message in context: http://old.nabble.com/solr-search-tp26125183p26139946.html Sent from the Solr - User mailing list archive at Nabble.com.
DIH - Export to XML
For Data Import Handler, there is a way to dump data to a SOLR feed format XML file? -- View this message in context: http://old.nabble.com/DIH---Export-to-XML-tp26138213p26138213.html Sent from the Solr - User mailing list archive at Nabble.com.
SpellCheckComponent suggestions and case
I am having great difficulty getting SpellCheckComponent to ignore case. Given a search of Glod, the suggestion is wood Given a search of glod, the suggestion is gold I am using LowerCaseTokenizerFactory for both query and index, so as I understand it Glod and glod should be treated the same. If not, how can I truly ignore case? Here is my fieldType: -- View this message in context: http://old.nabble.com/SpellCheckComponent-suggestions-and-case-tp26137035p26137035.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: solr web ui
There is also a GWT contribution in JIRA that is pretty handy and will likely be added in 1.5. See http://issues.apache.org/jira/browse/SOLR-1163 -Grant On Oct 29, 2009, at 9:17 PM, scabbage wrote: Hi, I'm a new solr user. I would like to know if there are any easy to setup web UIs for solr. It can be as simple as a search box, term highlighting and basic faceting. Basically I'm using solr to store all our automation testing logs and would like to have a simple searchable UI. I don't wanna spent too much time writing my own. Thanks. -- View this message in context: http://www.nabble.com/solr-web-ui-tp26123604p26123604.html Sent from the Solr - User mailing list archive at Nabble.com.
Lucene FieldCache memory requirements
Hi, Can anyone confirm Lucene FieldCache memory requirements? I have 100 millions docs with non-tokenized field "country" (10 different countries); I expect it requires array of ("int", "long"), size of array 100,000,000, without any impact of "country" field length; it requires 600,000,000 bytes: "int" is pointer to document (Lucene document ID), and "long" is pointer to String value... Am I right, is it 600Mb just for this "country" (indexed, non-tokenized, non-boolean) field and 1 million docs? I need to calculate exact minimum RAM requirements... I believe it shouldn't depend on cardinality (distribution) of field... Thanks, Fuad
RE: can't find solr.xml
It should be in couple of places like - apache-solr-1.3\example\exampledocs etc -Ankit -Original Message- From: javaxmlsoapdev [mailto:vika...@yahoo.com] Sent: Friday, October 30, 2009 4:50 PM To: solr-user@lucene.apache.org Subject: can't find solr.xml I have downloaded apache-solr-1.3.0.tgz for Linux and don't see solr.xml. can someone assist. -- View this message in context: http://old.nabble.com/can%27t-find-solr.xml-tp26136630p26136630.html Sent from the Solr - User mailing list archive at Nabble.com.
can't find solr.xml
I have downloaded apache-solr-1.3.0.tgz for Linux and don't see solr.xml. can someone assist. -- View this message in context: http://old.nabble.com/can%27t-find-solr.xml-tp26136630p26136630.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: solr web ui
This is a cool out-of-the-box index browser. I'm not sure it is meant to be customizeable. On Fri, Oct 30, 2009 at 10:24 AM, scabbage wrote: > > Jay, > > This looks awesome. I just tried it and it's what I need. > > Thanks. > > David > > > Jay Hill wrote: >> >> Have a look at the VelocityResponseWriter ( >> http://wiki.apache.org/solr/VelocityResponseWriter). It's in the contrib >> area, but the wiki has instructions on how to move it into your core Solr. >> Solr uses response writers to return results. The default is XML but >> responses can be returned in JSON, Ruby and other formats. The >> VelocityResponseWriter enables responses returned using Velocity >> templates. >> It sounds like exactly what you need. >> >> -Jay >> http://www.lucidimagination.com >> >> >> On Thu, Oct 29, 2009 at 6:17 PM, scabbage wrote: >> >>> >>> Hi, >>> >>> I'm a new solr user. I would like to know if there are any easy to setup >>> web >>> UIs for solr. It can be as simple as a search box, term highlighting and >>> basic faceting. Basically I'm using solr to store all our automation >>> testing >>> logs and would like to have a simple searchable UI. I don't wanna spent >>> too >>> much time writing my own. >>> >>> Thanks. >>> -- >>> View this message in context: >>> http://www.nabble.com/solr-web-ui-tp26123604p26123604.html >>> Sent from the Solr - User mailing list archive at Nabble.com. >>> >>> >> >> > > -- > View this message in context: > http://old.nabble.com/solr-web-ui-tp26123604p26133935.html > Sent from the Solr - User mailing list archive at Nabble.com. > > -- Lance Norskog goks...@gmail.com
Re: solr search
Please post your dataimporthandler configuration file. On Fri, Oct 30, 2009 at 4:17 AM, manishkbawne wrote: > > Thanks for your reply .. I am trying to use the database for solr search but > getting this error.. > > false in null > - > java.lang.NullPointerException at > org.apache.solr.handler.dataimport.DataImporter.(DataImporter.java:95) > at > org.apache.solr.handler.dataimport.DataImportHandler.inform(DataImportHandler.java:106) > at org.apache.solr.core.SolrResourceLoader > > Can you please suggest me some possible solution? > > > > > > > > > Karsten F. wrote: >> >> hi manishkbawne, >> >> unspecific ideas of search improvements are her: >> http://wiki.apache.org/solr/SolrPerformanceFactors >> >> I really like the last idea in >> http://wiki.apache.org/lucene-java/ImproveSearchingSpeed >> : >> Use a profiler and ask a more specific question in this forum. >> >> Best regards >> Karsten >> >> >> >> manishkbawne wrote: >>> >>> I am using solr search to search through xml files. As I am working on >>> millions of data, the result output is slower. Can anyone please suggest >>> me some way, by which I can increase the search result output? >>> >> >> > > -- > View this message in context: > http://old.nabble.com/solr-search-tp26125183p26128341.html > Sent from the Solr - User mailing list archive at Nabble.com. > > -- Lance Norskog goks...@gmail.com
Re: best way to model 1-N
Im using apache-solr-1.3.0 I got it to work using javascript function instead. thanks Joel On Oct 30, 2009, at 12:44 PM, Chantal Ackermann wrote: This looks all right to me, but I might be missing something. Which version/build of SOLR are you using? Chantal Joel Nylund schrieb: Thanks Chantal, I will keep that in mind for tuning, for sql I figured way to combine them into one row using concat, but I still seem to be having an issue splitting them: Db now returns as one column categoryType: TOPIC,LANGUAGE but my solr result, if you note the item in categoryType all seem to be within one str, I would expect it to be in multiple strings within the array, is this assumption wrong? − TOPIC,LANGUAGE 40 feed title Here is my import: group_concat(cfcr.categoryType) as categoryType from CFR cfcr where cfcr.feedId = '$ {item.id}' AND group by cfcr.feedId"> name="categoryType" splityBy="," /> In schema: stored="true" required="false" multiValued="true"/> stored="true" required="false" multiValued="true"/> what am I missing? thanks Joel On Oct 30, 2009, at 10:00 AM, Chantal Ackermann wrote: That depends a bit on your database, but it is tricky and might not be performant. If you are more of a Java developer, you might prefer retrieving mutliple rows per SOLR document from your dataSource (join on your category and main table), and aggregate them in your custom EntityProcessor. I got a far(!) better performance retrieving everything in one query and doing the aggregation in Java. But this is, of course, depending on your table structure and data. Noble Paul helped me with the custom EntityProcessor, and it turned out quite easy. Have a look at the thread with the heading from this mailing list (SOLR-USER): DataImportHandler / Import from DB : one data set comes in multiple rows Cheers, Chantal Joel Nylund schrieb: thanks, but im confused how I can aggregate across rows, I dont know of any easy way to get my db to return one row for all the categories (given the hint from your other email), I have split the category query into a separate entity, but its returning multiple rows, how do I combine multiple rows into 1 index entity? thanks Joel On Oct 29, 2009, at 8:58 PM, Avlesh Singh wrote: In the database this is modeled a a 1-N where category table has the mapping of feed to category I need to be able to query , give me all the feeds in any given category. How can I best model this in solr? Seems like multiValued field might help, but how would I populate it, and would the query above work?. Yes you are right. A multivalued field for "categories" is the answer. For populating in the index - 1. If you use DIH to populate your indexes and your datasource is a database then you can use DIH's RegexTransformer on an aggregated list of categories. e.g. if your database query retruns "a,b,c,d" in a column called "db_categories", this is how you would put it in DIH's data-config file - . 2. If you "add" documents to Solr yourself multiple values for the field can be specified as an array or list of values in the SolrInputDocument. A multivalued field provides the same faceting and searching capabilites like regular fields. There is no special syntax. Cheers Avlesh On Fri, Oct 30, 2009 at 4:55 AM, Joel Nylund wrote: Hi, I have one index so far which contains feeds. I have been able to de-normalize several tables and map this data onto the feed entity. There is one tricky problem that I need help on. Feeds have 1 - many categories. So Lets say we have Category1, Category2 and Category3 Feed 1 - is in Category 1 Feed 2 is in category2 and category3 Feed 3 is in category2 Feed 4 has no category In the database this is modeled a a 1-N where category table has the mapping of feed to category I need to be able to query , give me all the feeds in any given category. How can I best model this in solr? Seems like multiValued field might help, but how would I populate it, and would the query above work?. thanks Joel
autocomplete
Hi guys, Enterprise 1.4 Solr Book (AutoComplete) says this works - My query looks like - "q=*:*&fq=ac:*all*&wt=json&rows=15&start=0&indent=on&omitHeader=true&json.wrf=?"; And it returns three results { "responseHeader":{ "status":0, "QTime":38, "params":{ "indent":"on", "start":"0", "q":"*:*", "wt":"json", "fq":"ac:*all*", "rows":"15"}}, "response":{"numFound":3,"start":0,"docs":[ { "id":"1", "ac":"Can you show me all the results"}, { "id":"2", "ac":"Can you show all companies "}, { "id":"3", "ac":"Can you list all companies"}] }} But browser says syntax error -- Ankit
Re: slave java replication connection issue in nightly (wrong url?)
ah, problem solved, it was due (apparently) caching an old dns entry sorry for the noise :)
Re: Is it possible to a Transform handle data from 2 rows?
Hello Chantal! Thanks for your reply! Actually, in my case, faceting will not help me because I really need all "rows" from the same group at index time. That's because I'll use all rows from the same group, merge them all into a Velocity (or any other template engine) to generate an HTML that will have all items information and willbe queries with the group id param. For example, the select will return all stores (group) and departments (item) in my system. I'll generate an index with N docs, each one with the store id (group) and the final HTML for all its departments! Got it? What you are doing, is any similar to this? Thanks! Lucas Frare Teixeira .·. - lucas...@gmail.com - lucastex.com.br - blog.lucastex.com - twitter.com/lucastex On Fri, Oct 30, 2009 at 7:32 AM, Chantal Ackermann < chantal.ackerm...@btelligent.de> wrote: > Hi Lucas, > > check out the thread: > DataImportHandler / Import from DB : one data set comes in multiple rows > > > I am doing this successfully with my custom EntityProcessor as described in > that thread. > If you have any more questions or need some more code examples, just ask. > I'd be glad to help! > > > Just wondering: In your SQL statement the ID that you want to use as unique > ID in SOLR is called "GROUP". You are sure you don't want to create one SOLR > document per ITEM_ID and add the GROUP ID as a field so that you can sort > and facet based on it? (Of course, you can facet based on ITEM_ID in the > other case, as well.) > > Cheers, > Chantal > > Noble Paul നോബിള് नोब्ळ् schrieb: > > how do you know that all the rows with the same group id has already come? >> transformer can store the row and return null till you collect all the >> rows and then you may return one consolidated row. But the problem is >> it does not know when the row ends. >> >> you can override the SqlEntityProcessor and just override the >> nextRow() this should do the trick >> >> On Fri, Oct 30, 2009 at 1:52 AM, Lucas F. A. Teixeira >> wrote: >> >>> Hello all, >>> >>> Is it possible to have my own implemented transformer that generates some >>> output data from data of 2 different rows? >>> My intention is to generate pre-processed HTML from a set of rows. >>> >>> Eg. >>> >>> SQL RESULT: >>> GROUP | ITEM_ID | DESCRIPTION >>> 1 | 8 | desc1 >>> 1 | 23 | desc2 >>> 1 | 563 | desc3 >>> 2 | 43 | desc4 >>> 2 | 3222| desc5 >>> >>> And with this result, build a Transformer to group all entries in the >>> same >>> group and build a html using them. My idea in the final index is to have >>> 2 >>> docs with >>> >>> >>> * id=1 >>> * content = "desc1>> class='23'>desc2desc3" >>> >>> >>> >>> * id=2 >>> * content = "desc4>> class='3222'>desc5" >>> >>> >>> I've builted some custom transformers before, but never using data from >>> different rows. >>> >>> If this is not possible with transformers, how can I achieve this? >>> >>> Thanks! >>> >>> >>> >>> >>> Lucas Frare Teixeira .·. >>> - lucas...@gmail.com >>> - lucastex.com.br >>> - blog.lucastex.com >>> - twitter.com/lucastex >>> >>> >> >> >> -- >> - >> Noble Paul | Principal Engineer| AOL | http://aol.com >> >
Re: solr web ui
Jay, This looks awesome. I just tried it and it's what I need. Thanks. David Jay Hill wrote: > > Have a look at the VelocityResponseWriter ( > http://wiki.apache.org/solr/VelocityResponseWriter). It's in the contrib > area, but the wiki has instructions on how to move it into your core Solr. > Solr uses response writers to return results. The default is XML but > responses can be returned in JSON, Ruby and other formats. The > VelocityResponseWriter enables responses returned using Velocity > templates. > It sounds like exactly what you need. > > -Jay > http://www.lucidimagination.com > > > On Thu, Oct 29, 2009 at 6:17 PM, scabbage wrote: > >> >> Hi, >> >> I'm a new solr user. I would like to know if there are any easy to setup >> web >> UIs for solr. It can be as simple as a search box, term highlighting and >> basic faceting. Basically I'm using solr to store all our automation >> testing >> logs and would like to have a simple searchable UI. I don't wanna spent >> too >> much time writing my own. >> >> Thanks. >> -- >> View this message in context: >> http://www.nabble.com/solr-web-ui-tp26123604p26123604.html >> Sent from the Solr - User mailing list archive at Nabble.com. >> >> > > -- View this message in context: http://old.nabble.com/solr-web-ui-tp26123604p26133935.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: solr web ui
Chantal, Yes, I have been using the admin ui, but only for development. I need a ui that displays the query result XML in HTML. Thanks for the suggestion though. David Chantal Ackermann wrote: > > Hi! > > Just to make sure - you did see the search form in the SOLR admin ui? > > Chantal > > scabbage schrieb: >> Hi, >> >> I'm a new solr user. I would like to know if there are any easy to setup >> web >> UIs for solr. It can be as simple as a search box, term highlighting and >> basic faceting. Basically I'm using solr to store all our automation >> testing >> logs and would like to have a simple searchable UI. I don't wanna spent >> too >> much time writing my own. >> >> Thanks. >> -- >> View this message in context: >> http://www.nabble.com/solr-web-ui-tp26123604p26123604.html >> Sent from the Solr - User mailing list archive at Nabble.com. >> > > -- View this message in context: http://old.nabble.com/solr-web-ui-tp26123604p26133745.html Sent from the Solr - User mailing list archive at Nabble.com.
slave java replication connection issue in nightly (wrong url?)
hi everyone and sorry if this question is dumb, but I seem to be unable to connect from a slave, with both are in a multicore setup I configured the replication like ${enable.master:false} commit startup optimize schema.xml ${enable.slave:false} http://master.solr.lan:8080/solr/core/replication 00:00:60 And it seems that both master and slave loaded the configuration correctly e.g. in the admin interface I see * /replication added * my config values correctly loaded * replicationEnabled = true in master * isSlave and isMaster correctly set on both * the http://master.solr.lan:8080/solr/core/replication gives me "status ok" when accessed as a GET (curl) from the slave machine yet, when I check from the slave's admin interface it says that http://master.solr.lan:8080/solr/core/replication is unreachable, and from the statistics I see isReplicating : false and in the logs SEVERE: Master at: http://master.solr.lan:8080/solr/core/replication is not available. Index fetch failed. Exception: Request failed for the url org.apache.commons.httpclient.methods.postmet...@48f3d27 I'm thinking maybe I should have configured the url differently, but frankly I don't have a clue. Could someone point out what is the obvious error I'm doing ? :) Additional informations: * slave nightly is from today, master from a few days ago * both machines are 64 bit and running on linux * java -version is Java version "1.6.0_16" Java(TM) SE Runtime Environment (build 1.6.0_16-b01) Java HotSpot(TM) 64-Bit Server VM (build 14.2-b01, mixed mode) * solr is deployed in a tomcat6 container
Re: adding and updating a lot of document to Solr, metadata extraction etc
On Fri, Oct 30, 2009 at 11:23 AM, Eugene Dzhurinsky wrote: > Hi there! > > We are trying to evaluate Apache Solr for our custom search implementation, > which > includes the following requirements: > > - ability to add/update/delete a lot of documents at once > > - ability to iterate over all documents, returned in search, as Lucene does > provide within a HitCollector instance. We would need to extract and > aggregate various fields, stored in index, to group results and aggregate > them > in some way. > > After reading the tutorial I've realized that adding and removal of > documents > is performed through passing an XML file to controller in POST request. > However our XML files may be very, very large - so I hope there is some > another option to avoid interaction through HTTP protocol. > > Also I did not find any way in the tutorial to access the search results > with > all fields to be processed by our application. > > I think I simply did not read the documentation well or missed some point, > so > can somebody please point me to the articles, which may explain basics of > how > to achieve my goals? > > Thank you very much in advance! > > -- > Eugene N Dzhurinsky > Hi Eugene Solr has an embedded version but you are encouraged to use the standard web service interfaces. Also, the Solr 1.4 white paper just recently released talks about the the Streaming Updates Solr Server which according to the white paper can index documents at an incredibly lightening speed of up to 25K documents per second. The white paper can be downloaded here http://www.lucidimagination.com/whitepaper/whats-new-in-solr-1-4 Info about Streaming Update Solr Server is available here http://lucene.apache.org/solr/api/org/apache/solr/client/solrj/impl/StreamingUpdateSolrServer.html If you are still interested in the Embedded version to avoid the HTTP version you can check out the following links http://wiki.apache.org/solr/EmbeddedSolr http://lucene.apache.org/solr/api/org/apache/solr/client/solrj/embedded/EmbeddedSolrServer.html I hope this helps. -- "Good Enough" is not good enough. To give anything less than your best is to sacrifice the gift. Quality First. Measure Twice. Cut Once.
Re: best way to model 1-N
This looks all right to me, but I might be missing something. Which version/build of SOLR are you using? Chantal Joel Nylund schrieb: Thanks Chantal, I will keep that in mind for tuning, for sql I figured way to combine them into one row using concat, but I still seem to be having an issue splitting them: Db now returns as one column categoryType: TOPIC,LANGUAGE but my solr result, if you note the item in categoryType all seem to be within one str, I would expect it to be in multiple strings within the array, is this assumption wrong? − TOPIC,LANGUAGE 40 feed title Here is my import: In schema: what am I missing? thanks Joel On Oct 30, 2009, at 10:00 AM, Chantal Ackermann wrote: That depends a bit on your database, but it is tricky and might not be performant. If you are more of a Java developer, you might prefer retrieving mutliple rows per SOLR document from your dataSource (join on your category and main table), and aggregate them in your custom EntityProcessor. I got a far(!) better performance retrieving everything in one query and doing the aggregation in Java. But this is, of course, depending on your table structure and data. Noble Paul helped me with the custom EntityProcessor, and it turned out quite easy. Have a look at the thread with the heading from this mailing list (SOLR-USER): DataImportHandler / Import from DB : one data set comes in multiple rows Cheers, Chantal Joel Nylund schrieb: thanks, but im confused how I can aggregate across rows, I dont know of any easy way to get my db to return one row for all the categories (given the hint from your other email), I have split the category query into a separate entity, but its returning multiple rows, how do I combine multiple rows into 1 index entity? thanks Joel On Oct 29, 2009, at 8:58 PM, Avlesh Singh wrote: In the database this is modeled a a 1-N where category table has the mapping of feed to category I need to be able to query , give me all the feeds in any given category. How can I best model this in solr? Seems like multiValued field might help, but how would I populate it, and would the query above work?. Yes you are right. A multivalued field for "categories" is the answer. For populating in the index - 1. If you use DIH to populate your indexes and your datasource is a database then you can use DIH's RegexTransformer on an aggregated list of categories. e.g. if your database query retruns "a,b,c,d" in a column called "db_categories", this is how you would put it in DIH's data-config file - . 2. If you "add" documents to Solr yourself multiple values for the field can be specified as an array or list of values in the SolrInputDocument. A multivalued field provides the same faceting and searching capabilites like regular fields. There is no special syntax. Cheers Avlesh On Fri, Oct 30, 2009 at 4:55 AM, Joel Nylund wrote: Hi, I have one index so far which contains feeds. I have been able to de-normalize several tables and map this data onto the feed entity. There is one tricky problem that I need help on. Feeds have 1 - many categories. So Lets say we have Category1, Category2 and Category3 Feed 1 - is in Category 1 Feed 2 is in category2 and category3 Feed 3 is in category2 Feed 4 has no category In the database this is modeled a a 1-N where category table has the mapping of feed to category I need to be able to query , give me all the feeds in any given category. How can I best model this in solr? Seems like multiValued field might help, but how would I populate it, and would the query above work?. thanks Joel
Re: Iso accents and wildcards
if the request contains any wilcard then filters are not called : no ISOLatin1AccentFilterFactory and no SnowballPorterFilterFactory ! "économie" is indexed to "econom" solr don't found : - term starts with "éco" (éco*) - term starts with "economi" (economi*) if you index manger, mangé and mangue, the indexed terms will be mang and mangu requests -> results manger -> mange, mangé mangé-> mange, mangé mang -> mange, manger mangu-> mangue mang*-> manger, mangé, mangue mang?-> mangue (and not mangé) mangé* -> nothing Jean-François - "Nicolas Leconte" a écrit : | Hi all, | | I have a field that contains accentuated char in it, what I whant is | to | be able to search with ignore accents. | I have set up that field with : | | | | | | | | | | | | | In the index the word "économie" is translated to "econom", the | accent | is removed thanks to the ISOLatin1AccentFilterFactory and the end of | the | word removent thanks to the SnowballPorterFilterFactory. | | When I request with title:econ* I can have the correct answers, but | if | I request with title:écon* I have no answers. | If I request with title:économ (the exact word of the index) it works, | | so there might be something wrong with the wildcard. | As far as I can understand the analyser should be use exactly the same | | in both index and query time. | | I have tested with changing the order of the filters (putting the | ISOLatin1AccentFilterFactory on top) without any result. | | Could anybody help me with that and point me what may be wrong with my | | shema ?
Re: best way to model 1-N
Thanks Chantal, I will keep that in mind for tuning, for sql I figured way to combine them into one row using concat, but I still seem to be having an issue splitting them: Db now returns as one column categoryType: TOPIC,LANGUAGE but my solr result, if you note the item in categoryType all seem to be within one str, I would expect it to be in multiple strings within the array, is this assumption wrong? − TOPIC,LANGUAGE 40 feed title Here is my import: from CFR cfcr where cfcr.feedId = '${item.id}' AND group by cfcr.feedId"> splityBy="," /> In schema: required="false" multiValued="true"/> required="false" multiValued="true"/> what am I missing? thanks Joel On Oct 30, 2009, at 10:00 AM, Chantal Ackermann wrote: That depends a bit on your database, but it is tricky and might not be performant. If you are more of a Java developer, you might prefer retrieving mutliple rows per SOLR document from your dataSource (join on your category and main table), and aggregate them in your custom EntityProcessor. I got a far(!) better performance retrieving everything in one query and doing the aggregation in Java. But this is, of course, depending on your table structure and data. Noble Paul helped me with the custom EntityProcessor, and it turned out quite easy. Have a look at the thread with the heading from this mailing list (SOLR-USER): DataImportHandler / Import from DB : one data set comes in multiple rows Cheers, Chantal Joel Nylund schrieb: thanks, but im confused how I can aggregate across rows, I dont know of any easy way to get my db to return one row for all the categories (given the hint from your other email), I have split the category query into a separate entity, but its returning multiple rows, how do I combine multiple rows into 1 index entity? thanks Joel On Oct 29, 2009, at 8:58 PM, Avlesh Singh wrote: In the database this is modeled a a 1-N where category table has the mapping of feed to category I need to be able to query , give me all the feeds in any given category. How can I best model this in solr? Seems like multiValued field might help, but how would I populate it, and would the query above work?. Yes you are right. A multivalued field for "categories" is the answer. For populating in the index - 1. If you use DIH to populate your indexes and your datasource is a database then you can use DIH's RegexTransformer on an aggregated list of categories. e.g. if your database query retruns "a,b,c,d" in a column called "db_categories", this is how you would put it in DIH's data-config file - . 2. If you "add" documents to Solr yourself multiple values for the field can be specified as an array or list of values in the SolrInputDocument. A multivalued field provides the same faceting and searching capabilites like regular fields. There is no special syntax. Cheers Avlesh On Fri, Oct 30, 2009 at 4:55 AM, Joel Nylund wrote: Hi, I have one index so far which contains feeds. I have been able to de-normalize several tables and map this data onto the feed entity. There is one tricky problem that I need help on. Feeds have 1 - many categories. So Lets say we have Category1, Category2 and Category3 Feed 1 - is in Category 1 Feed 2 is in category2 and category3 Feed 3 is in category2 Feed 4 has no category In the database this is modeled a a 1-N where category table has the mapping of feed to category I need to be able to query , give me all the feeds in any given category. How can I best model this in solr? Seems like multiValued field might help, but how would I populate it, and would the query above work?. thanks Joel
Iso accents and wildcards
Hi all, I have a field that contains accentuated char in it, what I whant is to be able to search with ignore accents. I have set up that field with : generateNumberParts="1" catenateWords="1" catenateNumbers="1" catenateAll="0" splitOnCaseChange="1" /> words="stopwords.txt" /> In the index the word "économie" is translated to "econom", the accent is removed thanks to the ISOLatin1AccentFilterFactory and the end of the word removent thanks to the SnowballPorterFilterFactory. When I request with title:econ* I can have the correct answers, but if I request with title:écon* I have no answers. If I request with title:économ (the exact word of the index) it works, so there might be something wrong with the wildcard. As far as I can understand the analyser should be use exactly the same in both index and query time. I have tested with changing the order of the filters (putting the ISOLatin1AccentFilterFactory on top) without any result. Could anybody help me with that and point me what may be wrong with my shema ?
adding and updating a lot of document to Solr, metadata extraction etc
Hi there! We are trying to evaluate Apache Solr for our custom search implementation, which includes the following requirements: - ability to add/update/delete a lot of documents at once - ability to iterate over all documents, returned in search, as Lucene does provide within a HitCollector instance. We would need to extract and aggregate various fields, stored in index, to group results and aggregate them in some way. After reading the tutorial I've realized that adding and removal of documents is performed through passing an XML file to controller in POST request. However our XML files may be very, very large - so I hope there is some another option to avoid interaction through HTTP protocol. Also I did not find any way in the tutorial to access the search results with all fields to be processed by our application. I think I simply did not read the documentation well or missed some point, so can somebody please point me to the articles, which may explain basics of how to achieve my goals? Thank you very much in advance! -- Eugene N Dzhurinsky pgpJ2ZR6rFHSF.pgp Description: PGP signature
Re: solr web ui
Have a look at the VelocityResponseWriter ( http://wiki.apache.org/solr/VelocityResponseWriter). It's in the contrib area, but the wiki has instructions on how to move it into your core Solr. Solr uses response writers to return results. The default is XML but responses can be returned in JSON, Ruby and other formats. The VelocityResponseWriter enables responses returned using Velocity templates. It sounds like exactly what you need. -Jay http://www.lucidimagination.com On Thu, Oct 29, 2009 at 6:17 PM, scabbage wrote: > > Hi, > > I'm a new solr user. I would like to know if there are any easy to setup > web > UIs for solr. It can be as simple as a search box, term highlighting and > basic faceting. Basically I'm using solr to store all our automation > testing > logs and would like to have a simple searchable UI. I don't wanna spent too > much time writing my own. > > Thanks. > -- > View this message in context: > http://www.nabble.com/solr-web-ui-tp26123604p26123604.html > Sent from the Solr - User mailing list archive at Nabble.com. > >
Solr YUI autocomplete
Does Solr supports JSONP (JSON with Padding) in the response? -Ankit -Original Message- From: Ankit Bhatnagar [mailto:abhatna...@vantage.com] Sent: Friday, October 30, 2009 10:27 AM To: 'solr-user@lucene.apache.org' Subject: Solr YUI autocomplete Hi Guys, I have question regarding - how to specify the I am using YUI autocomplete widget and it expects the JSONP response. http://localhost:8983/solr/select/?q=monitor&version=2.2&start=0&rows=10&indent=on&wt=json&json.wrf= I am not sure how should I specify the json.wrf=function Thanks Ankit
Solr YUI autocomplete
Hi Guys, I have question regarding - how to specify the I am using YUI autocomplete widget and it expects the JSONP response. http://localhost:8983/solr/select/?q=monitor&version=2.2&start=0&rows=10&indent=on&wt=json&json.wrf= I am not sure how should I specify the json.wrf=function Thanks Ankit
Re: Solr and LSA
I am using Semantic Vectors[1] implementation of LSA in a large scale digital library project called Project Torngat[2]. I presented some of the work at the European Conference on Digital Libraries (ECDL)[3], at the 'Very Large Digital Libraries (VLDL) workshop[4] in September. A pre-print of the paper is here[5]. Badsically I used it to generate similarities of journals using the full-text of all their articles, and used this to create a 2-D map of the journal space. The Semantic Vectors works very very well, but I believe it is not yet a stable (API or functionality) codebase, more of an experimental-as-yet effort, so it may be a moving target for anyone wanting to integrate it. Gllen Newton http://zzzoot.blogspot.com/ [1]http://code.google.com/p/semanticvectors/ [2]http://lab.cisti-icist.nrc-cnrc.gc.ca/cistilabswiki/index.php/Torngat1 [3]http://www.ecdl2009.eu/ [4]http://www.delos.info/vldl2009/ [5]http://cuvier.cisti.nrc.ca/~gnewton/events/2009/ecdl2009Newton_20090723.pdf 2009/10/30 György Frivolt : > That's what I found so far too. Seems to be usable. It uses a variant of > LSA, probabilistic LSA. > Writing a module for it.. have to rephrase my question them. Where to start > with writing a module for solr.. Should be useful for diving deeper in solr. > > > > On Fri, Oct 30, 2009 at 9:56 AM, Paul Libbrecht wrote: > >> Not with solr but with Lucene, there is the project called semanticvectors. >> It would be cute to make it a solr module. >> >> paul >> >> >> Le 30-oct.-09 à 09:17, György Frivolt a écrit : >> >> >> Hi, >>> Does anyone of you have experiences with using LSA, Latent Semantic >>> Analysis with Solr? I would like to search for expressions, but also find >>> records, which have context relevant for the given query. Would appreciate >>> any clue where to start, what to consider. >>> >>> Bw, Georg >>> >> >> > -- -
AW: AW: JQuery and autosuggest
as long as you are working on your own pc than this might be ok. but when someone else is accessing the site via internet, than there is "no localhost", at least not your. so at this point the real (internet-) server name is needed. btw: ajax calls are only allowed from the origin server, so a page loaded from www.a-server.com only will allow ajax-calls that also go to www.a-server.com, and not www.b-server.com or any other -- mit freundlichen Grüßen Markus Rietzler - Rechenzentrum der Finanzverwaltung NRW 0211/4572-2130 > -Ursprüngliche Nachricht- > Von: sophSophie [mailto:sop...@beezik.com] > Gesendet: Freitag, 30. Oktober 2009 14:57 > An: solr-user@lucene.apache.org > Betreff: Re: AW: JQuery and autosuggest > > > Hallo Markus ! > > I think that the servername is correct as I'm testing this > autocomplete box > on my computer (my site is built in localhost, solr is in localhost). > > Am I wrong ? > > Thanks > > > Markus.Rietzler wrote: > > > > very simple, in the javascript you have > > > > $("#artist").autocomplete( > > 'http://localhost:8983/solr/completion/select/?wt=json', { > > > > you have to change localhost to the correct servername > > > > > > -- > > mit freundlichen Grüßen > > > > Markus Rietzler - > > Rechenzentrum der Finanzverwaltung NRW > > 0211/4572-2130 > > > > > >> -Ursprüngliche Nachricht- > >> Von: sophSophie [mailto:sop...@beezik.com] > >> Gesendet: Freitag, 30. Oktober 2009 14:35 > >> An: solr-user@lucene.apache.org > >> Betreff: JQuery and autosuggest > >> > >> > >> Hello, > >> > >> I'm trying to build a autocomplete text box. I think that > >> everything is ok > >> in the schema.xml and the solrconfig.xml side as I have > >> correct responses > >> through the solr admin page (I got something like that > >> :http://brainmaths.fr/IMG/response.txt ). > >> > >> I don't know really javascript and I tried the code given in > >> Solr 1.4 book > >> page 241. You can see what I did here : > >> http://brainmaths.fr/IMG/completion.html and have a look to > >> the source code. > >> > >> The query is correcty sent to solr (I checked the log) but > >> nothing returns > >> and I have an error on this line : > >> > >> return $.map(data.response.docs, function(doc) { > >> > >> The error is : response.docs is null or isn't an object, I > >> don't know how to > >> fix it. I guess that docs aren't correctly returned. > >> > >> I need any help any ideas > >> > >> Thanks in advance > >> > >> - > >> English is not my mother tongue :) > >> -- > >> View this message in context: > >> > http://old.nabble.com/JQuery-and-autosuggest-tp26130209p26130209.html > >> Sent from the Solr - User mailing list archive at Nabble.com. > >> > >> > > > > > > > - > English is not my mother tongue :) > -- > View this message in context: > http://old.nabble.com/JQuery-and-autosuggest-tp26130209p26130511.html > Sent from the Solr - User mailing list archive at Nabble.com. > >
Re: best way to model 1-N
That depends a bit on your database, but it is tricky and might not be performant. If you are more of a Java developer, you might prefer retrieving mutliple rows per SOLR document from your dataSource (join on your category and main table), and aggregate them in your custom EntityProcessor. I got a far(!) better performance retrieving everything in one query and doing the aggregation in Java. But this is, of course, depending on your table structure and data. Noble Paul helped me with the custom EntityProcessor, and it turned out quite easy. Have a look at the thread with the heading from this mailing list (SOLR-USER): DataImportHandler / Import from DB : one data set comes in multiple rows Cheers, Chantal Joel Nylund schrieb: thanks, but im confused how I can aggregate across rows, I dont know of any easy way to get my db to return one row for all the categories (given the hint from your other email), I have split the category query into a separate entity, but its returning multiple rows, how do I combine multiple rows into 1 index entity? thanks Joel On Oct 29, 2009, at 8:58 PM, Avlesh Singh wrote: In the database this is modeled a a 1-N where category table has the mapping of feed to category I need to be able to query , give me all the feeds in any given category. How can I best model this in solr? Seems like multiValued field might help, but how would I populate it, and would the query above work?. Yes you are right. A multivalued field for "categories" is the answer. For populating in the index - 1. If you use DIH to populate your indexes and your datasource is a database then you can use DIH's RegexTransformer on an aggregated list of categories. e.g. if your database query retruns "a,b,c,d" in a column called "db_categories", this is how you would put it in DIH's data-config file - . 2. If you "add" documents to Solr yourself multiple values for the field can be specified as an array or list of values in the SolrInputDocument. A multivalued field provides the same faceting and searching capabilites like regular fields. There is no special syntax. Cheers Avlesh On Fri, Oct 30, 2009 at 4:55 AM, Joel Nylund wrote: Hi, I have one index so far which contains feeds. I have been able to de-normalize several tables and map this data onto the feed entity. There is one tricky problem that I need help on. Feeds have 1 - many categories. So Lets say we have Category1, Category2 and Category3 Feed 1 - is in Category 1 Feed 2 is in category2 and category3 Feed 3 is in category2 Feed 4 has no category In the database this is modeled a a 1-N where category table has the mapping of feed to category I need to be able to query , give me all the feeds in any given category. How can I best model this in solr? Seems like multiValued field might help, but how would I populate it, and would the query above work?. thanks Joel
Re: AW: JQuery and autosuggest
Hallo Markus ! I think that the servername is correct as I'm testing this autocomplete box on my computer (my site is built in localhost, solr is in localhost). Am I wrong ? Thanks Markus.Rietzler wrote: > > very simple, in the javascript you have > > $("#artist").autocomplete( > 'http://localhost:8983/solr/completion/select/?wt=json', { > > you have to change localhost to the correct servername > > > -- > mit freundlichen Grüßen > > Markus Rietzler - > Rechenzentrum der Finanzverwaltung NRW > 0211/4572-2130 > > >> -Ursprüngliche Nachricht- >> Von: sophSophie [mailto:sop...@beezik.com] >> Gesendet: Freitag, 30. Oktober 2009 14:35 >> An: solr-user@lucene.apache.org >> Betreff: JQuery and autosuggest >> >> >> Hello, >> >> I'm trying to build a autocomplete text box. I think that >> everything is ok >> in the schema.xml and the solrconfig.xml side as I have >> correct responses >> through the solr admin page (I got something like that >> :http://brainmaths.fr/IMG/response.txt ). >> >> I don't know really javascript and I tried the code given in >> Solr 1.4 book >> page 241. You can see what I did here : >> http://brainmaths.fr/IMG/completion.html and have a look to >> the source code. >> >> The query is correcty sent to solr (I checked the log) but >> nothing returns >> and I have an error on this line : >> >> return $.map(data.response.docs, function(doc) { >> >> The error is : response.docs is null or isn't an object, I >> don't know how to >> fix it. I guess that docs aren't correctly returned. >> >> I need any help any ideas >> >> Thanks in advance >> >> - >> English is not my mother tongue :) >> -- >> View this message in context: >> http://old.nabble.com/JQuery-and-autosuggest-tp26130209p26130209.html >> Sent from the Solr - User mailing list archive at Nabble.com. >> >> > > - English is not my mother tongue :) -- View this message in context: http://old.nabble.com/JQuery-and-autosuggest-tp26130209p26130511.html Sent from the Solr - User mailing list archive at Nabble.com.
AW: JQuery and autosuggest
very simple, in the javascript you have $("#artist").autocomplete( 'http://localhost:8983/solr/completion/select/?wt=json', { you have to change localhost to the correct servername -- mit freundlichen Grüßen Markus Rietzler - Rechenzentrum der Finanzverwaltung NRW 0211/4572-2130 > -Ursprüngliche Nachricht- > Von: sophSophie [mailto:sop...@beezik.com] > Gesendet: Freitag, 30. Oktober 2009 14:35 > An: solr-user@lucene.apache.org > Betreff: JQuery and autosuggest > > > Hello, > > I'm trying to build a autocomplete text box. I think that > everything is ok > in the schema.xml and the solrconfig.xml side as I have > correct responses > through the solr admin page (I got something like that > :http://brainmaths.fr/IMG/response.txt ). > > I don't know really javascript and I tried the code given in > Solr 1.4 book > page 241. You can see what I did here : > http://brainmaths.fr/IMG/completion.html and have a look to > the source code. > > The query is correcty sent to solr (I checked the log) but > nothing returns > and I have an error on this line : > > return $.map(data.response.docs, function(doc) { > > The error is : response.docs is null or isn't an object, I > don't know how to > fix it. I guess that docs aren't correctly returned. > > I need any help any ideas > > Thanks in advance > > - > English is not my mother tongue :) > -- > View this message in context: > http://old.nabble.com/JQuery-and-autosuggest-tp26130209p26130209.html > Sent from the Solr - User mailing list archive at Nabble.com. > >
Re: solr search
Hi I never used DBMS/SQL Import in solr by myself. Possible http://old.nabble.com/Using-DataImportHandler-with-mysql-database-to20425791.html#a20435463 can help you. regards Karsten manishkbawne wrote: > > Thanks for your reply .. I am trying to use the database for solr search > but getting this error.. > > false in null > - > java.lang.NullPointerException at > org.apache.solr.handler.dataimport.DataImporter.(DataImporter.java:95) > at > org.apache.solr.handler.dataimport.DataImportHandler.inform(DataImportHandler.java:106) > at org.apache.solr.core.SolrResourceLoader > > Can you please suggest me some possible solution? > -- View this message in context: http://old.nabble.com/solr-search-tp26125183p26130344.html Sent from the Solr - User mailing list archive at Nabble.com.
JQuery and autosuggest
Hello, I'm trying to build a autocomplete text box. I think that everything is ok in the schema.xml and the solrconfig.xml side as I have correct responses through the solr admin page (I got something like that :http://brainmaths.fr/IMG/response.txt ). I don't know really javascript and I tried the code given in Solr 1.4 book page 241. You can see what I did here : http://brainmaths.fr/IMG/completion.html and have a look to the source code. The query is correcty sent to solr (I checked the log) but nothing returns and I have an error on this line : return $.map(data.response.docs, function(doc) { The error is : response.docs is null or isn't an object, I don't know how to fix it. I guess that docs aren't correctly returned. I need any help any ideas Thanks in advance - English is not my mother tongue :) -- View this message in context: http://old.nabble.com/JQuery-and-autosuggest-tp26130209p26130209.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: best way to model 1-N
thanks, but im confused how I can aggregate across rows, I dont know of any easy way to get my db to return one row for all the categories (given the hint from your other email), I have split the category query into a separate entity, but its returning multiple rows, how do I combine multiple rows into 1 index entity? thanks Joel On Oct 29, 2009, at 8:58 PM, Avlesh Singh wrote: In the database this is modeled a a 1-N where category table has the mapping of feed to category I need to be able to query , give me all the feeds in any given category. How can I best model this in solr? Seems like multiValued field might help, but how would I populate it, and would the query above work?. Yes you are right. A multivalued field for "categories" is the answer. For populating in the index - 1. If you use DIH to populate your indexes and your datasource is a database then you can use DIH's RegexTransformer on an aggregated list of categories. e.g. if your database query retruns "a,b,c,d" in a column called "db_categories", this is how you would put it in DIH's data-config file - . 2. If you "add" documents to Solr yourself multiple values for the field can be specified as an array or list of values in the SolrInputDocument. A multivalued field provides the same faceting and searching capabilites like regular fields. There is no special syntax. Cheers Avlesh On Fri, Oct 30, 2009 at 4:55 AM, Joel Nylund wrote: Hi, I have one index so far which contains feeds. I have been able to de-normalize several tables and map this data onto the feed entity. There is one tricky problem that I need help on. Feeds have 1 - many categories. So Lets say we have Category1, Category2 and Category3 Feed 1 - is in Category 1 Feed 2 is in category2 and category3 Feed 3 is in category2 Feed 4 has no category In the database this is modeled a a 1-N where category table has the mapping of feed to category I need to be able to query , give me all the feeds in any given category. How can I best model this in solr? Seems like multiValued field might help, but how would I populate it, and would the query above work?. thanks Joel
Re: solr search
Thanks for your reply .. I am trying to use the database for solr search but getting this error.. false in null - java.lang.NullPointerException at org.apache.solr.handler.dataimport.DataImporter.(DataImporter.java:95) at org.apache.solr.handler.dataimport.DataImportHandler.inform(DataImportHandler.java:106) at org.apache.solr.core.SolrResourceLoader Can you please suggest me some possible solution? Karsten F. wrote: > > hi manishkbawne, > > unspecific ideas of search improvements are her: > http://wiki.apache.org/solr/SolrPerformanceFactors > > I really like the last idea in > http://wiki.apache.org/lucene-java/ImproveSearchingSpeed > : > Use a profiler and ask a more specific question in this forum. > > Best regards > Karsten > > > > manishkbawne wrote: >> >> I am using solr search to search through xml files. As I am working on >> millions of data, the result output is slower. Can anyone please suggest >> me some way, by which I can increase the search result output? >> > > -- View this message in context: http://old.nabble.com/solr-search-tp26125183p26128341.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Is it possible to a Transform handle data from 2 rows?
Hi Lucas, check out the thread: DataImportHandler / Import from DB : one data set comes in multiple rows I am doing this successfully with my custom EntityProcessor as described in that thread. If you have any more questions or need some more code examples, just ask. I'd be glad to help! Just wondering: In your SQL statement the ID that you want to use as unique ID in SOLR is called "GROUP". You are sure you don't want to create one SOLR document per ITEM_ID and add the GROUP ID as a field so that you can sort and facet based on it? (Of course, you can facet based on ITEM_ID in the other case, as well.) Cheers, Chantal Noble Paul നോബിള് नोब्ळ् schrieb: how do you know that all the rows with the same group id has already come? transformer can store the row and return null till you collect all the rows and then you may return one consolidated row. But the problem is it does not know when the row ends. you can override the SqlEntityProcessor and just override the nextRow() this should do the trick On Fri, Oct 30, 2009 at 1:52 AM, Lucas F. A. Teixeira wrote: Hello all, Is it possible to have my own implemented transformer that generates some output data from data of 2 different rows? My intention is to generate pre-processed HTML from a set of rows. Eg. SQL RESULT: GROUP | ITEM_ID | DESCRIPTION 1 | 8 | desc1 1 | 23 | desc2 1 | 563 | desc3 2 | 43 | desc4 2 | 3222| desc5 And with this result, build a Transformer to group all entries in the same group and build a html using them. My idea in the final index is to have 2 docs with * id=1 * content = "desc1desc2desc3" * id=2 * content = "desc4desc5" I've builted some custom transformers before, but never using data from different rows. If this is not possible with transformers, how can I achieve this? Thanks! Lucas Frare Teixeira .·. - lucas...@gmail.com - lucastex.com.br - blog.lucastex.com - twitter.com/lucastex -- - Noble Paul | Principal Engineer| AOL | http://aol.com
Re: solr web ui
Hi! Just to make sure - you did see the search form in the SOLR admin ui? Chantal scabbage schrieb: Hi, I'm a new solr user. I would like to know if there are any easy to setup web UIs for solr. It can be as simple as a search box, term highlighting and basic faceting. Basically I'm using solr to store all our automation testing logs and would like to have a simple searchable UI. I don't wanna spent too much time writing my own. Thanks. -- View this message in context: http://www.nabble.com/solr-web-ui-tp26123604p26123604.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Solr and LSA
That's what I found so far too. Seems to be usable. It uses a variant of LSA, probabilistic LSA. Writing a module for it.. have to rephrase my question them. Where to start with writing a module for solr.. Should be useful for diving deeper in solr. On Fri, Oct 30, 2009 at 9:56 AM, Paul Libbrecht wrote: > Not with solr but with Lucene, there is the project called semanticvectors. > It would be cute to make it a solr module. > > paul > > > Le 30-oct.-09 à 09:17, György Frivolt a écrit : > > > Hi, >> Does anyone of you have experiences with using LSA, Latent Semantic >> Analysis with Solr? I would like to search for expressions, but also find >> records, which have context relevant for the given query. Would appreciate >> any clue where to start, what to consider. >> >> Bw, Georg >> > >
Re: Solr and LSA
Not with solr but with Lucene, there is the project called semanticvectors. It would be cute to make it a solr module. paul Le 30-oct.-09 à 09:17, György Frivolt a écrit : Hi, Does anyone of you have experiences with using LSA, Latent Semantic Analysis with Solr? I would like to search for expressions, but also find records, which have context relevant for the given query. Would appreciate any clue where to start, what to consider. Bw, Georg smime.p7s Description: S/MIME cryptographic signature
Solr and LSA
Hi, Does anyone of you have experiences with using LSA, Latent Semantic Analysis with Solr? I would like to search for expressions, but also find records, which have context relevant for the given query. Would appreciate any clue where to start, what to consider. Bw, Georg
Re: Indexing multiple entities
On Fri, Oct 30, 2009 at 2:04 AM, Avlesh Singh wrote: >> >> One thing I thought about is if I can define my own >> DocumentObjectBinder, so I can concatenate my entity names with the >> IDs in the XML creation. >> >> Anyone knows if something like this can be done without modifying >> Solrj sources? Is there any injection or plugin mecanism for this? >> > More details on the use-case please. If I index a Book with ID=3, and then a Magazine with ID=3, I'll be really removing my Book3 and indexing Magazine3. I want both entities to be in the index. The use case on DocumentObjectBinder is that I could override toSolrInputDocument, and if field = ID, I could do: setField("id", obj.getClass().getName() + obj.getId()) or something like that. The goal is avoiding creating all the XMLs to be sent to Solr but having the possibility of modifying them in some way. Do you know how can I do that, or a better way of achieving the same results? > Cheers > Avlesh > > On Fri, Oct 30, 2009 at 2:16 AM, Christian López Espínola < > penyask...@gmail.com> wrote: > >> Hi Israel, >> >> Thanks for your suggestion, >> >> On Thu, Oct 29, 2009 at 9:37 PM, Israel Ekpo wrote: >> > On Thu, Oct 29, 2009 at 3:31 PM, Christian López Espínola < >> > penyask...@gmail.com> wrote: >> > >> >> Hi, my name is Christian and I'm a newbie introducing to solr (and >> solrj). >> >> >> >> I'm working on a website where I want to index multiple entities, like >> >> Book or Magazine. >> >> The issue I'm facing is both of them have an attribute ID, which I >> >> want to use as the uniqueKey on my schema, so I cannot identify >> >> uniquely a document (because ID is saved in a database too, and it's >> >> autonumeric). >> >> >> >> I'm sure that this is a common pattern, but I don't find the way of >> solving >> >> it. >> >> >> >> How do you usually solve this? Thanks in advance. >> >> >> >> >> >> -- >> >> Cheers, >> >> >> >> Christian López Espínola >> >> >> > >> > Hi Christian, >> > >> > It looks like you are bringing in data to Solr from a database where >> there >> > are two separate tables. >> > >> > One for *Books* and another one for *Magazines*. >> > >> > If this is the case, you could define your uniqueKey element in Solr >> schema >> > to be a "string" instead of an integer then you can still load documents >> > from both the books and magazines database tables but your could prefix >> the >> > uniqueKey field with "B" for books and "M" for magazines >> > >> > Like so : >> > >> > > > required="true"/> >> > >> > id >> > >> > Then when loading the books or magazines into Solr you can create the >> > documents with id fields like this >> > >> > >> > >> > B14000 >> > >> > >> > M14000 >> > >> > >> > B14001 >> > >> > >> > M14001 >> > >> > >> > >> > I hope this helps >> >> This was my first thought, but in practice there isn't Book and >> Magazine, but about 50 different entities, so I'm using the Field >> annotation of solrj for simplifying my code (it manages for me the XML >> creation, etc). >> One thing I thought about is if I can define my own >> DocumentObjectBinder, so I can concatenate my entity names with the >> IDs in the XML creation. >> >> Anyone knows if something like this can be done without modifying >> Solrj sources? Is there any injection or plugin mecanism for this? >> >> Thanks in advance. >> >> >> > -- >> > "Good Enough" is not good enough. >> > To give anything less than your best is to sacrifice the gift. >> > Quality First. Measure Twice. Cut Once. >> > >> >> >> >> -- >> Cheers, >> >> Christian López Espínola >> > -- Cheers, Christian López Espínola
Re: solr search
hi manishkbawne, unspecific ideas of search improvements are her: http://wiki.apache.org/solr/SolrPerformanceFactors I really like the last idea in http://wiki.apache.org/lucene-java/ImproveSearchingSpeed : Use a profiler and ask a more specific question in this forum. Best regards Karsten manishkbawne wrote: > > I am using solr search to search through xml files. As I am working on > millions of data, the result output is slower. Can anyone please suggest > me some way, by which I can increase the search result output? > -- View this message in context: http://old.nabble.com/solr-search-tp26125183p26126041.html Sent from the Solr - User mailing list archive at Nabble.com.