Re: Seattle Hadoop/Scalability/NoSQL Meetup Tonight!
Not that I'm aware of. 2010/2/25 Tim TerlegÄrd tim.terleg...@gmail.com 2010/2/25 Bradford Stephens bradfordsteph...@gmail.com: Thanks for coming, everyone! We had around 25 people. A *huge* success, for Seattle. And a big thanks to 10gen for sending Richard. Can't wait to see you all next month. Did anyone record the event? /Tim
Re: Seattle Hadoop/Lucene/NoSQL Meetup; Wed Feb 24th, Feat. MongoDB
Reminder: this month's Seattle Hadoop Meetup is this Wednesday. Don't forget to RSVP! On Tue, Feb 16, 2010 at 6:09 PM, Bradford Stephens bradfordsteph...@gmail.com wrote: Greetings, It's time for another awesome Seattle Hadoop/Lucene/Scalability/NoSQL Meetup! As always, it's at the University of Washington, Allen Computer Science building, Room 303 at 6:45pm. You can find a map here: http://www.washington.edu/home/maps/southcentral.html?cse Last month, we had a great talk from Steve McPherson of Razorfish on their usage of Hadoop. This month, we'll have Richard Kreuter from MongoDB talking about, well, MongoDB. As well as assorted discussion on the Hadoop ecosystem. If you can, please RSVP here (not required, but very nice): http://www.meetup.com/Seattle-Hadoop-HBase-NoSQL-Meetup/ My cell # is 904-415-3009 if you have questions/get lost. Cheers, Bradford -- http://www.drawntoscalehq.com -- Big Data for all. The Big Data Platform. http://www.roadtofailure.com -- The Fringes of Scalability, Social Media, and Computer Science
Re: Seattle / PNW Hadoop/Lucene/HBase Meetup, Wed Sep 30th
Hey PNW Clouders! I'd really like to chat further with the crew doing distributed Solr. Give me a ring or shoot me an email, let's do lunch! -Nick On Wed, Sep 30, 2009 at 2:10 PM, Nick Dimiduk ndimi...@gmail.com wrote: As Bradford is out of town this evening, I will take up the mantel of Person-on-Point. Contact me with questions re: tonight's gathering. See you tonight! -Nick 614.657.0267 On Mon, Sep 28, 2009 at 4:33 PM, Bradford Stephens bradfordsteph...@gmail.com wrote: Hello everyone! Don't forget that the Meetup is THIS Wednesday! I'm looking forward to hearing about Hive from the Facebook team ... and there might be a few other interesting talks as well. Here's the details in the wiki: http://wiki.apache.org/hadoop/PNW_Hadoop_%2B_Apache_Cloud_Stack_User_Group Cheers, Bradford On Mon, Sep 14, 2009 at 11:35 AM, Bradford Stephens bradfordsteph...@gmail.com wrote: Greetings, It's time for another Hadoop/Lucene/ApacheCloud Stack meetup! This month it'll be on Wednesday, the 30th, at 6:45 pm. We should have a few interesting guests this time around -- someone from Facebook may be stopping by to talk about Hive :) We've had great attendance in the past few months, let's keep it up! I'm always amazed by the things I learn from everyone. We're back at the University of Washington, Allen Computer Science Center (not Computer Engineering) Map: http://www.washington.edu/home/maps/?CSE Room: 303 -or- the Entry level. If there are changes, signs will be posted. More Info: The meetup is about 2 hours (and there's usually food): we'll have two in-depth talks of 15-20 minutes each, and then several lightning talks of 5 minutes. If no one offers, We'll then have discussion and 'social time'. we'll just have general discussion. Let net know if you're interested in speaking or attending. We'd like to focus on education, so every presentation *needs* to ask some questions at the end. We can talk about these after the presentations, and I'll record what we've learned in a wiki and share that with the rest of us. Contact: Bradford Stephens, 904-415-3009, bradfordsteph...@gmail.com Cheers, Bradford -- http://www.roadtofailure.com -- The Fringes of Scalability, Social Media, and Computer Science -- http://www.roadtofailure.com -- The Fringes of Scalability, Social Media, and Computer Science
Re: Seattle / PNW Hadoop/Lucene/HBase Meetup, Wed Sep 30th
As Bradford is out of town this evening, I will take up the mantel of Person-on-Point. Contact me with questions re: tonight's gathering. See you tonight! -Nick 614.657.0267 On Mon, Sep 28, 2009 at 4:33 PM, Bradford Stephens bradfordsteph...@gmail.com wrote: Hello everyone! Don't forget that the Meetup is THIS Wednesday! I'm looking forward to hearing about Hive from the Facebook team ... and there might be a few other interesting talks as well. Here's the details in the wiki: http://wiki.apache.org/hadoop/PNW_Hadoop_%2B_Apache_Cloud_Stack_User_Group Cheers, Bradford On Mon, Sep 14, 2009 at 11:35 AM, Bradford Stephens bradfordsteph...@gmail.com wrote: Greetings, It's time for another Hadoop/Lucene/ApacheCloud Stack meetup! This month it'll be on Wednesday, the 30th, at 6:45 pm. We should have a few interesting guests this time around -- someone from Facebook may be stopping by to talk about Hive :) We've had great attendance in the past few months, let's keep it up! I'm always amazed by the things I learn from everyone. We're back at the University of Washington, Allen Computer Science Center (not Computer Engineering) Map: http://www.washington.edu/home/maps/?CSE Room: 303 -or- the Entry level. If there are changes, signs will be posted. More Info: The meetup is about 2 hours (and there's usually food): we'll have two in-depth talks of 15-20 minutes each, and then several lightning talks of 5 minutes. If no one offers, We'll then have discussion and 'social time'. we'll just have general discussion. Let net know if you're interested in speaking or attending. We'd like to focus on education, so every presentation *needs* to ask some questions at the end. We can talk about these after the presentations, and I'll record what we've learned in a wiki and share that with the rest of us. Contact: Bradford Stephens, 904-415-3009, bradfordsteph...@gmail.com Cheers, Bradford -- http://www.roadtofailure.com -- The Fringes of Scalability, Social Media, and Computer Science -- http://www.roadtofailure.com -- The Fringes of Scalability, Social Media, and Computer Science
Re: Sharded Index Creation Magic?
I do, but you raise an interesting point. I had named the field incorrectly. I'm a little puzzled as to why individual search worked with the broken field name, but now all is well! On Tue, Jul 14, 2009 at 12:03 AM, Shalin Shekhar Mangar shalinman...@gmail.com wrote: On Tue, Jul 14, 2009 at 2:00 AM, Nick Dimiduk ndimi...@gmail.com wrote: However, when I search across all deployed shards using the shards= query parameter ( http://host00:8080/solr/select?shards=host00:8080/solr,host01:8080/solrq=body \%3A%3Aterm), I get a NullPointerException: java.lang.NullPointerException at org.apache.solr.handler.component.QueryComponent.mergeIds(QueryComponent.java:421) at org.apache.solr.handler.component.QueryComponent.handleResponses(QueryComponent.java:265) at org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:264) at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131) at org.apache.solr.core.SolrCore.execute(SolrCore.java:1204) at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:303) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:232) Debugging into the QueryComponent.mergeIds() method reveals the instance sreq.responses (line 356) contains one response for each shard specified, each with the number of results received by the independant queries. The problems begin down at line 370 because the SolrDocument instance has only a score field -- which proves problematic in the following line where the id is requested. The SolrDocument, only containing a score, lacks the designated ID field (from my schema) and thus the document cannot be added to the results queue. Because the example on the wiki works by loading the documents directly into Solr for indexing, I have come to the conclusion that there is some extra magic happening in this index generation process which my process lacks. Do you have a uniqueKey defined in your schema.xml? -- Regards, Shalin Shekhar Mangar.
Sharded Index Creation Magic?
Hello! I'm working with Solr-1.3.0 using a sharded index for distributed, aggregated search. I've successfully run through the example described in the DistributedSearch wiki page. I have built an index from a corpus of some 50mil documents in an HBase table and created 7 shards using the org.apache.hadoop.hbase.mapred.BuildTableIndex. I can deploy any one of these shards to a single Solr instance and happily search the index after tweaking the schema appropriately. However, when I search across all deployed shards using the shards= query parameter ( http://host00:8080/solr/select?shards=host00:8080/solr,host01:8080/solrq=body\%3A%3Aterm), I get a NullPointerException: java.lang.NullPointerException at org.apache.solr.handler.component.QueryComponent.mergeIds(QueryComponent.java:421) at org.apache.solr.handler.component.QueryComponent.handleResponses(QueryComponent.java:265) at org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:264) at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131) at org.apache.solr.core.SolrCore.execute(SolrCore.java:1204) at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:303) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:232) Debugging into the QueryComponent.mergeIds() method reveals the instance sreq.responses (line 356) contains one response for each shard specified, each with the number of results received by the independant queries. The problems begin down at line 370 because the SolrDocument instance has only a score field -- which proves problematic in the following line where the id is requested. The SolrDocument, only containing a score, lacks the designated ID field (from my schema) and thus the document cannot be added to the results queue. Because the example on the wiki works by loading the documents directly into Solr for indexing, I have come to the conclusion that there is some extra magic happening in this index generation process which my process lacks. Thanks for the help!