Re: SOLR geospatial

2010-12-11 Thread Markus Jelsma
That smells like: http://www.jteam.nl/news/spatialsolr.html > My partner is using a publicly available plugin for GeoSpatial. It is used > both during indexing and during search. It forms some kind of gridding > system and puts 10 fields per row related to that. Doing a Radius search > (vs a bound

Re: SOLR geospatial

2010-12-11 Thread Dennis Gearon
My partner is using a publicly available plugin for GeoSpatial. It is used both during indexing and during search. It forms some kind of gridding system and puts 10 fields per row related to that. Doing a Radius search (vs a bounding box search which is faster in almost all cases in all GeoSpat

Re: [Parsing] Date Fields

2010-12-11 Thread Lance Norskog
Here's the problem, at the end of the DIH file: This says "parse this timestamp into a Java Date object using this date-time spec". This string uses the UTC timestamp format that Solr reads. You need to change this date-format string to the format of your incoming timestamps. The JDK

Re: [Parsing] Date Fields

2010-12-11 Thread Erick Erickson
Dates in Solr have a very specific format, see: http://lucene.apache.org/solr/api/org/apache/solr/schema/DateField.html Best Erick On Sat, Dec 11, 2010 at 6:32 PM, Adam Estrada wrote: > All, > > I am ingesting a lot of RSS feeds as part of my application and I keep > getting the same error. > >

[Parsing] Date Fields

2010-12-11 Thread Adam Estrada
All, I am ingesting a lot of RSS feeds as part of my application and I keep getting the same error. WARNING: Could not parse a Date field java.text.ParseException: Unparseable date: "Mon, 06 Dec 2010 23:31:38 +" at java.text.DateFormat.parse(Unknown Source) at org.apache.solr.

Re: [Multiple] RSS Feeds at a time...

2010-12-11 Thread Adam Estrada
You are da man! w00t! adam On Sat, Dec 11, 2010 at 4:48 PM, Ahmet Arslan wrote: > > I found that you can have a single config file that can > > have several > > entities in it. My question now is how can I add entities > > without restarting > > the Solr service? > > You mean changing and re-lo

Re: command line parameters for solr

2010-12-11 Thread Erick Erickson
Here's a good place to find for-hire resources on a project basis: http://wiki.apache.org/solr/Support Or for that matter, giving some idea of what the project is about and inviting private responses isn't necessarily bad. I thought you were asking about *free* private guidance, which is another

Re: command line parameters for solr

2010-12-11 Thread J O
That's cool, I am just looking to hire someone to do some solr work for me. Please advise what's the best way to reach out solr development community for contract help? /j On Dec 11, 2010, at 2:59 PM, Erick Erickson wrote: In general, it's discouraged to send private e-mails unless invited s

Re: command line parameters for solr

2010-12-11 Thread Erick Erickson
In general, it's discouraged to send private e-mails unless invited since the whole point of open source is to make source, solutions, etc available to everyone. See: http://people.apache.org/~hossman/#private_q Best Erick On Sat, Dec 11, 2010 at 1:08 AM, Jack O wrote: > Tom, > > I would like t

Re: Separate Lines Like Google

2010-12-11 Thread Erick Erickson
H, your problem isn't real clear. Solr has no built-in way of understanding paragraphs, sentences, chapters, etc., you have to provide that understanding yourself. In general you can use multivalued fields with a suitable increment gap or a special symbol to build in this kind of understanding.

Re: OutOfMemory GC: GC overhead limit exceeded - Why isn't WeakHashMap getting collected?

2010-12-11 Thread Erick Erickson
"unfortunately I can't check the statistics page. For some reason the solr webapp itself is only returning a directory listing." This is very weird and makes me wonder if there's something really wonky with your system. I'm assuming when you say "the solr webapp itself" you're taking about ...loc

Re: [Multiple] RSS Feeds at a time...

2010-12-11 Thread Ahmet Arslan
> I found that you can have a single config file that can > have several > entities in it. My question now is how can I add entities > without restarting > the Solr service? You mean changing and re-loading xml config file? dataimport?command=reload-config http://wiki.apache.org/solr/DataImportH

Re: full text search in multiple fields

2010-12-11 Thread Ahmet Arslan
> There's a location with title: hortus rodondendrus > > This location is found using this query: > http://localhost:8983/solr/db/select/?indent=on&q=hortus&defType=dismax&qf=title_search^20.0 > But not when using this query: > http://localhost:8983/solr/db/select/?indent=on&q=hort&defType=dismax&

Re: [Multiple] RSS Feeds at a time...

2010-12-11 Thread Adam Estrada
Lance, I found that you can have a single config file that can have several entities in it. My question now is how can I add entities without restarting the Solr service? It doesn't really work otherwise but it looks like it should becasue we call the /dataimport handler after the entire applicati

Re: full text search in multiple fields

2010-12-11 Thread PeterKerk
Ok, Im back ;) There's one final thing that needs to be fixed.. Im trying to apply the same logic as on cities, but now for the title of a location. There's a location with title: hortus rodondendrus This location is found using this query: http://localhost:8983/solr/db/select/?indent=on&q=hor

Re: Shards + dismax - scoring process?

2010-12-11 Thread Yonik Seeley
On Sat, Dec 11, 2010 at 2:18 AM, bbarani wrote: > Also, if I try to sort the query result from shards.. will sorting happens > on the consolidated data or on each individual core data? Both - to find the top 10 docs by any sort, the top 10 docs from each shard are collected and then sorted to fin

Re: Indexing documents with SOLR

2010-12-11 Thread Adam Estrada
Pankaj, Check this article out on how to get going with Nutch. http://bit.ly/dbBdK4This is a few months old so you will have to note that there is a new parameter called something like -SolrUrl that will allow you to update your solr index with the crawled data. For crawling your local file syste

Re: Multicore and Replication (scripts vs. java, spellchecker)

2010-12-11 Thread Martin Grotzke
On Sat, Dec 11, 2010 at 12:38 AM, Chris Hostetter wrote: > > : #SOLR-433 "MultiCore and SpellChecker replication" [1]. Based on the > : status of this feature request I'd asume that the normal procedure of > : keeping the spellchecker index up2date would be running a cron job on > : each node/slav

Re: Search based on images

2010-12-11 Thread Geert-Jan Brits
Well-known algorithms for detecting 'highly descriptive features' in images that can cope with scaling and rotation (up to a certain degree of course) are SIFT and SURF (SURF is generally considered the more mature of the two afaik) http://en.wikipedia.org/wiki/Scale-invariant_feature_transform h