Re: Search based on images

2010-12-11 Thread Geert-Jan Brits
Well-known algorithms for detecting 'highly descriptive features' in images that can cope with scaling and rotation (up to a certain degree of course) are SIFT and SURF (SURF is generally considered the more mature of the two afaik) http://en.wikipedia.org/wiki/Scale-invariant_feature_transform

Re: Multicore and Replication (scripts vs. java, spellchecker)

2010-12-11 Thread Martin Grotzke
On Sat, Dec 11, 2010 at 12:38 AM, Chris Hostetter hossman_luc...@fucit.org wrote: : #SOLR-433 MultiCore and SpellChecker replication [1]. Based on the : status of this feature request I'd asume that the normal procedure of : keeping the spellchecker index up2date would be running a cron job on

Re: Indexing documents with SOLR

2010-12-11 Thread Adam Estrada
Pankaj, Check this article out on how to get going with Nutch. http://bit.ly/dbBdK4This is a few months old so you will have to note that there is a new parameter called something like -SolrUrl that will allow you to update your solr index with the crawled data. For crawling your local file

Re: Shards + dismax - scoring process?

2010-12-11 Thread Yonik Seeley
On Sat, Dec 11, 2010 at 2:18 AM, bbarani bbar...@gmail.com wrote: Also, if I try to sort the query result from shards.. will sorting happens on the consolidated data or on each individual core data? Both - to find the top 10 docs by any sort, the top 10 docs from each shard are collected and

Re: full text search in multiple fields

2010-12-11 Thread PeterKerk
Ok, Im back ;) There's one final thing that needs to be fixed.. Im trying to apply the same logic as on cities, but now for the title of a location. There's a location with title: hortus rodondendrus This location is found using this query:

Re: [Multiple] RSS Feeds at a time...

2010-12-11 Thread Adam Estrada
Lance, I found that you can have a single config file that can have several entities in it. My question now is how can I add entities without restarting the Solr service? It doesn't really work otherwise but it looks like it should becasue we call the /dataimport handler after the entire

Re: full text search in multiple fields

2010-12-11 Thread Ahmet Arslan
There's a location with title: hortus rodondendrus This location is found using this query: http://localhost:8983/solr/db/select/?indent=onq=hortusdefType=dismaxqf=title_search^20.0 But not when using this query:

Re: [Multiple] RSS Feeds at a time...

2010-12-11 Thread Ahmet Arslan
I found that you can have a single config file that can have several entities in it. My question now is how can I add entities without restarting the Solr service? You mean changing and re-loading xml config file? dataimport?command=reload-config

Re: OutOfMemory GC: GC overhead limit exceeded - Why isn't WeakHashMap getting collected?

2010-12-11 Thread Erick Erickson
unfortunately I can't check the statistics page. For some reason the solr webapp itself is only returning a directory listing. This is very weird and makes me wonder if there's something really wonky with your system. I'm assuming when you say the solr webapp itself you're taking about

Re: Separate Lines Like Google

2010-12-11 Thread Erick Erickson
H, your problem isn't real clear. Solr has no built-in way of understanding paragraphs, sentences, chapters, etc., you have to provide that understanding yourself. In general you can use multivalued fields with a suitable increment gap or a special symbol to build in this kind of

Re: command line parameters for solr

2010-12-11 Thread Erick Erickson
In general, it's discouraged to send private e-mails unless invited since the whole point of open source is to make source, solutions, etc available to everyone. See: http://people.apache.org/~hossman/#private_q Best Erick On Sat, Dec 11, 2010 at 1:08 AM, Jack O jack_...@yahoo.com wrote: Tom,

Re: command line parameters for solr

2010-12-11 Thread J O
That's cool, I am just looking to hire someone to do some solr work for me. Please advise what's the best way to reach out solr development community for contract help? /j On Dec 11, 2010, at 2:59 PM, Erick Erickson erickerick...@gmail.com wrote: In general, it's discouraged to send private

Re: command line parameters for solr

2010-12-11 Thread Erick Erickson
Here's a good place to find for-hire resources on a project basis: http://wiki.apache.org/solr/Support Or for that matter, giving some idea of what the project is about and inviting private responses isn't necessarily bad. I thought you were asking about *free* private guidance, which is another

Re: [Multiple] RSS Feeds at a time...

2010-12-11 Thread Adam Estrada
You are da man! w00t! adam On Sat, Dec 11, 2010 at 4:48 PM, Ahmet Arslan iori...@yahoo.com wrote: I found that you can have a single config file that can have several entities in it. My question now is how can I add entities without restarting the Solr service? You mean changing and

[Parsing] Date Fields

2010-12-11 Thread Adam Estrada
All, I am ingesting a lot of RSS feeds as part of my application and I keep getting the same error. WARNING: Could not parse a Date field java.text.ParseException: Unparseable date: Mon, 06 Dec 2010 23:31:38 + at java.text.DateFormat.parse(Unknown Source) at

Re: [Parsing] Date Fields

2010-12-11 Thread Erick Erickson
Dates in Solr have a very specific format, see: http://lucene.apache.org/solr/api/org/apache/solr/schema/DateField.html Best Erick On Sat, Dec 11, 2010 at 6:32 PM, Adam Estrada estrada.adam.gro...@gmail.com wrote: All, I am ingesting a lot of RSS feeds as part of my application and I keep

Re: [Parsing] Date Fields

2010-12-11 Thread Lance Norskog
Here's the problem, at the end of the DIH file: field column=pubdate xpath=/rss/channel/item/pubDate dateTimeFormat=-MM-dd'T'hh:mm:ss'Z' / /entity This says parse this timestamp into a Java Date object using this date-time spec. This string uses the UTC timestamp format that

Re: SOLR geospatial

2010-12-11 Thread Dennis Gearon
My partner is using a publicly available plugin for GeoSpatial. It is used both during indexing and during search. It forms some kind of gridding system and puts 10 fields per row related to that. Doing a Radius search (vs a bounding box search which is faster in almost all cases in all

Re: SOLR geospatial

2010-12-11 Thread Markus Jelsma
That smells like: http://www.jteam.nl/news/spatialsolr.html My partner is using a publicly available plugin for GeoSpatial. It is used both during indexing and during search. It forms some kind of gridding system and puts 10 fields per row related to that. Doing a Radius search (vs a bounding