Re: about partial index update
hi, thx for reply ! In my case, my vote will keep update in database, but the ExternalFileField since like the value will get from a file. so if I using ExternalFileField, is that anyway to read the value from index actually ? regards, kiwi On Fri, Dec 23, 2011 at 10:41 PM, Ahmet Arslan iori...@yahoo.com wrote: if i using ExternalFileField, i cannot do sorting when query. Sort by function query is now doable. http://wiki.apache.org/solr/FunctionQuery#Sort_By_Function
Re: about partial index update
hi, if in this case, then i think reindex the whole document probably will be easily right ? as compare to write to text file (look like more work and probably performance same ? ) actually i was looking for a features like indextank which can update dynamic value without scarify performance. http://www.quora.com/Indextank-Vs-Websolr http://www.quora.com/Indextank-Vs-Websolr reddit is one of the users of indextank. so i wondering solr got any features like this ? (since like only indextank had this base on my research.) regards, kiwi On Fri, Dec 23, 2011 at 11:10 PM, Ahmet Arslan iori...@yahoo.com wrote: In my case, my vote will keep update in database, but the ExternalFileField since like the value will get from a file. so if I using ExternalFileField, is that anyway to read the value from index actually ? You could read values from DB and dump them into a text file in a periodic manner. Not used by myself but you can find this post relevant. http://sujitpal.blogspot.com/2011/05/custom-sorting-in-solr-using-external.html
about partial index update
hi all, I have a object like this public class Link { private long id; private string url; // other 20 property private int vote; //summary of vote, for sorting used } so when I index document, my Lucene document also contain all field from my Link object. e.g doc_id = 1 url = solr.org vote = 23 because the vote is change frequently then other property, every time the vote is increase, I need to reindex the whole document so I can use vote field for sorting. is that anyway just to index partial Lucene document (in this case, only vote field) instead of index again whole document ? any performance concern if is index whole doc (I think is not look like a good solution). any idea ? kiwi
solr DIH for mongodb
hi, do we got any DIH plugin which is for mongodb? regards, kiwi
Re: solr DIH for mongodb
wow, this search engine is powerful ! too bad after look throught it, still got not solution. seem like I need to get my hand dirty to make one :) kiwi On Tue, Sep 27, 2011 at 12:08 PM, Otis Gospodnetic otis_gospodne...@yahoo.com wrote: Hi, Here is a 1 month old thread I found on search-lucene -- didn't even have to do a search, I got it as a suggestion from AutoComplete when I started typing the word mongodb :) http://search-lucene.com/m/8AEE31AaTd32 Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ From: Kiwi de coder kiwio...@gmail.com To: solr-user@lucene.apache.org Sent: Monday, September 26, 2011 11:58 AM Subject: solr DIH for mongodb hi, do we got any DIH plugin which is for mongodb? regards, kiwi
Re: Tomcat unicode problems with IE 9, Firefox 6 and solrj client
maybe set on tomcat connector ? Connector ... URIEncoding=UTF-8/ kiwi On Mon, Sep 19, 2011 at 12:56 AM, Xue-Feng Yang just4l...@yahoo.com wrote: The first returned link for Firefox 6 is the same as the returned link for IE 9. I was misread. So the encode is this one. Anyone knows what it is. From: Xue-Feng Yang just4l...@yahoo.com To: solr-user@lucene.apache.org solr-user@lucene.apache.org Sent: Sunday, September 18, 2011 12:48:31 PM Subject: Tomcat unicode problems with IE 9, Firefox 6 and solrj client Now I moved to Tomcat 6 for another round testing. All English cores are OK for IE 9, Firefox 6 and solrj client. When I tested with Chinese cores, IE 9 is OK whatever I played on it. Firefox 6 is a little different: 1) If I set Query String to content:耶和华 without qotations in page http://localhost:18080/solr/cuv/admin/, I had return responselst name=responseHeaderint name=status0/intint name=QTime1/intlst name=paramsstr name=indenton/strstr name=start0/strstr name=qcontent:耶和华/strstr name=rows10/strstr name=version2.2/str/lst/lstresult name=response numFound=5908 start=0docstr name=id申命记 2:2/str/docdocstr name=id出埃及记 13:1/str/docdocstr name=id出埃及记 14:1/str/docdocstr name=id出埃及记 16:11/str/docdocstr name=id出埃及记 25:1/str/docdocstr name=id出埃及记 30:11/str/docdocstr name=id出埃及记 30:17/str/docdocstr name=id出埃及记 30:22/str/docdocstr name=id出埃及记 31:1/str/docdocstr name=id出埃及记 31:12/str/doc/result/response the link went to http://localhost:18080/solr/cuv/select/?q=content%3A%E8%80%B6%E5%92%8C%E5%8D%8Eversion=2.2start=0rows=10indent=on and %E8%80%B6%E5%92%8C%E5%8D%8E is visible as 耶和华。 2) If I refreshed, it kept the same. 3) If I highlighted the link and placed Enter key, then the returned link was http://localhost:18080/solr/cuv/select/?q=content%3A%D2%AE%BA%CD%BB%AAversion=2.2start=0rows=10indent=on which was different from the above link. The returned page was responselst name=responseHeaderint name=status0/intint name=QTime1/intlst name=paramsstr name=indenton/strstr name=start0/strstr name=qcontent:Ү�ͻ�/strstr name=rows10/strstr name=version2.2/str/lst/lstresult name=response numFound=0 start=0//response Actually, in IE 9, the returned link is always http://localhost:18080/solr/cuv/select/?q=content%3A%E8%80%B6%E5%92%8C%E5%8D%8Eversion=2.2start=0rows=10indent=on which is different from both of Firefox 6. For solrj client, I tested many ways to encode, but all return no results. What is the right encode way? Any help is welcome.
location of solr folder when deploy to servlet container
hi, how do i configure the solr folder to specific directory when deploy to servlet container. regards, kiwi
about standard analyzer
hi, I using solr 3.3 which in schema.xml contain this : fieldType name=text_standard class=solr.TextField analyzer class=org.apache.lucene.analysis.standard.StandardAnalyzer/ /fieldType i use the sentences as example XYZ Corporation - x...@example.com however, when I try on /analysis.jsp, it show difference result compare to using Lucene. using solr I got result below when using text_standard and text_general (is both the same ?) XYZCorporationxyzexample.com
about standardAnaylzer in solr
hi, I using solr 3.3 which in schema.xml contain this : fieldType name=text_standard class=solr.TextField analyzer class=org.apache.lucene. analysis.standard.StandardAnalyzer/ /fieldType i use the sentences as example XYZ Corporation - x...@example.com however, when I try on /analysis.jsp, it show difference result compare to using Lucene. using solr I got result below when using text_standard and text_general (is both the same ?) XYZCorporationxyzexample.com (which all belong to ALPHANUM ) when using Lucene, i got this StandardAnalyzer: 1: [xyz:0-4:COMPANY] 2: [corporation:5-16:ALPHANUM] 3: [x...@example.com:19-34:EMAIL] so my question is, how to make it analysis like in Lucene ? regards, kiwi
Re: about standard analyzer
hi, sorry.. i accidentally sending out this uncomplete mail.. actually I had send another one, please ignore this thx :) kiwi On Wed, Jul 13, 2011 at 10:52 PM, Erick Erickson erickerick...@gmail.comwrote: You're probably seeing the effects of a number of the filters that Solr is applying (see the fieldType definition). In particular, this looks like the result of WordDelimiterFilterFactory. If you click the verbose box on the analysis page, you should see the results of each step in the analysis chain. Best Erick On Wed, Jul 13, 2011 at 10:36 AM, Kiwi de coder kiwio...@gmail.com wrote: hi, I using solr 3.3 which in schema.xml contain this : fieldType name=text_standard class=solr.TextField analyzer class=org.apache.lucene.analysis.standard.StandardAnalyzer/ /fieldType i use the sentences as example XYZ Corporation - x...@example.com however, when I try on /analysis.jsp, it show difference result compare to using Lucene. using solr I got result below when using text_standard and text_general (is both the same ?) XYZCorporationxyzexample.com
Re: about standardAnaylzer in solr
ok, work now ! thx :) kiwi On Wed, Jul 13, 2011 at 11:06 PM, Edoardo Tosca e.to...@sourcesense.comwrote: Try to change from StandardTolkenizerFactory to ClassicTokenizerFactory or create your own fieldType fieldType name=text_general class=solr.TextField positionIncrementGap=100 analyzer type=index *tokenizer class=solr.ClassicTokenizerFactory/* ... /fieldType Edo On Wed, Jul 13, 2011 at 3:40 PM, Kiwi de coder kiwio...@gmail.com wrote: hi, I using solr 3.3 which in schema.xml contain this : fieldType name=text_standard class=solr.TextField analyzer class=org.apache.lucene. analysis.standard.StandardAnalyzer/ /fieldType i use the sentences as example XYZ Corporation - x...@example.com however, when I try on /analysis.jsp, it show difference result compare to using Lucene. using solr I got result below when using text_standard and text_general (is both the same ?) XYZCorporationxyzexample.com (which all belong to ALPHANUM ) when using Lucene, i got this StandardAnalyzer: 1: [xyz:0-4:COMPANY] 2: [corporation:5-16:ALPHANUM] 3: [x...@example.com:19-34:EMAIL] so my question is, how to make it analysis like in Lucene ? regards, kiwi -- Edoardo Tosca Sourcesense - making sense of Open Source: http://www.sourcesense.com
@field for child object
hi, i wondering solrj @Field annotation support embedded child object ? e.g. class A { @field string somefield; @emebedded B b; } regards, kiwi
Re: Using jetty's GzipFilter in the example solr.war
try to put u filter on top of web.xml (instead of middle or bottom), i try this few day and it just only a simple solution (not sure is a spec to put on top or is a bug) hope this help. kiwi On Sat, Nov 13, 2010 at 8:11 PM, Jay Luker lb...@reallywow.com wrote: Hi, I thought I'd try turning on gzip compression but I can't seem to get jetty's GzipFilter to actually compress my responses. I unpacked the example solr.war and tried adding variations of the following to the web.xml (and then rejar-ed), but as far as I can tell, jetty isn't actually compressing anything. filter filter-nameGZipFilter/filter-name display-nameJetty's GZip Filter/display-name descriptionFilter that zips all the content on-the-fly/description filter-classorg.mortbay.servlet.GzipFilter/filter-class init-param param-namemimeTypes/param-name param-value*/param-value /init-param /filter filter-mapping filter-nameGZipFilter/filter-name url-pattern*/url-pattern /filter-mapping I've also tried explicitly listing mime-types and assigning the filter-mapping using servlet-name. I can see that the GzipFilter is being loaded when I add -DDEBUG to the jetty startup command. But as far as I can tell from looking at the response headers nothing is being gzipped. I'm expecting to see Content-Encoding: gzip in the response headers. Anyone successfully gotten this to work? Thanks, --jay
Re: Design questions/Schema Help
i think the search log will require a lot of storage which may make indexes size unreasonable large if store in solr. and the aggregration results may not really fixed in lucene index structure. :) kiwi happy hacking ! On Tue, Jul 27, 2010 at 7:47 AM, Tommy Chheng tommy.chh...@gmail.comwrote: Alternatively, have you considered storing(or i should say indexing) the search logs with Solr? This lets you text search across your search queries. You can perform time range queries with solr as well. @tommychheng Programmer and UC Irvine Graduate Student Find a great grad school based on research interests: http://gradschoolnow.com On 7/26/10 4:43 PM, Mark wrote: We are thinking about using Cassandra to store our search logs. Can someone point me in the right direction/lend some guidance on design? I am new to Cassandra and I am having trouble wrapping my head around some of these new concepts. My brain keeps wanting to go back to a RDBMS design. We will be storing the user query, # of hits returned and their session id. We would like to be able to answer the following questions. - What is the n most popular queries and their counts within the last x (mins/hours/days/etc). Basically the most popular searches within a given time range. - What is the most popular query within the last x where hits = 0. Same as above but with an extra where clause - For session id x give me all their other queries - What are all the session ids that searched for 'foos' We accomplish the above functionality w/ MySQL using 2 tables. One for the raw search log information and the other to keep the aggregate/running counts of queries. Would this sort of ad-hoc querying be better implemented using Hadoop + Hive? If so, should I be storing all this information in Cassandra then using Hadoop to retrieve it? Thanks for your suggestions
Re: SOLR or Hibernate Search?
hi, hibernate search is only work with hibernate, while solr can use for difference system other then hibernate (loose coupling) current solr still not support complex POJO index like what hibernate did. 1) I think one way u can do is index on solr and retrieve the unique id and get from database. e.g. select enetity form table where id in (x, y) 2) i not yet test out, but i do believe lucene performance is quite good and it still keep improving (u can add more search server if u using solr) 3) maybe solr is more suitable in u case. hope this help kiwi 2009/12/30 Márcio Paulino mcopaul...@gmail.com Hey Everyone! I was make a comparison of both technologies (SOLR AND Hibernate Search) and i see many things are equals. Anyone could told me when i must use SOLR and when i must use Hibernate Search? Im my project i will have: 1. Queries for indexed fields (Strings) and for not indexed Fields (Integer, Float, Date). [In Hibernate Search on in SOLR, i must search on index and, with results of query, search on database (I can't search in both places ate same time).] I Will Have search like: Give me all Register Where Value 190 And Name Contains = 'JAVA' 2. My client need process a lot of email (20.000 per day) and i must indexed all fields (excluded sentDate ) included Attachments, and performance is requirement of my System 3. My Application is multiclient, and i need to separate the index by client. In this Scenario, whats the best solution? SOLR or HIbernateSearch I See SOLR is a dedicated server and has a good performance test. I don't see advantages to use hibernate-search in comparison with SOLR (Except the fact of integrate with my Mapped Object) Thanks for Help -- att, ** Márcio Paulino Campo Grande - MS MSN / Gtalk: mcopaul...@gmail.com ICQ: 155897898 **
solr and hibernate integration
hi, I had a project which is required to index POJO and search it from database. however, the current support for POJO is only limited to field value, which still lack of support of complex domain object model like composite element, collection etc. hibernate search had done a great job that is able to index complex POJO, I wondering is some one had wrote a plug-in that can handle complex POJO (like what hibernate search doing for indexing) ? kiwi -- happy hacking !
Re: solr and hibernate integration
so is that any plan to support in future version of solr ? or anyone is interesting to write one :) as i see that it can take advantages of solr features like facet, spell check etc. 2009/11/9 Noble Paul നോബിള് नोब्ळ् noble.p...@corp.aol.com The point is that the usual complex POJO mapping does not work in Solr. For all the supported cases , SolrJ mapping works well To answer your question , I am not aware of anybody making it work w/ hibernate On Mon, Nov 9, 2009 at 1:54 PM, Kiwi de coder kiwio...@gmail.com wrote: hi, I had a project which is required to index POJO and search it from database. however, the current support for POJO is only limited to field value, which still lack of support of complex domain object model like composite element, collection etc. hibernate search had done a great job that is able to index complex POJO, I wondering is some one had wrote a plug-in that can handle complex POJO (like what hibernate search doing for indexing) ? kiwi -- happy hacking ! -- - Noble Paul | Principal Engineer| AOL | http://aol.com