Re: Seattle Hadoop/Scalability/NoSQL Meetup Tonight!

2010-02-25 Thread Nick Dimiduk
Not that I'm aware of.

2010/2/25 Tim TerlegÄrd tim.terleg...@gmail.com

 2010/2/25 Bradford Stephens bradfordsteph...@gmail.com:
  Thanks for coming, everyone! We had around 25 people. A *huge*
  success, for Seattle. And a big thanks to 10gen for sending Richard.
 
  Can't wait to see you all next month.

 Did anyone record the event?

 /Tim



Re: Seattle Hadoop/Lucene/NoSQL Meetup; Wed Feb 24th, Feat. MongoDB

2010-02-19 Thread Nick Dimiduk
Reminder: this month's Seattle Hadoop Meetup is this Wednesday. Don't forget
to RSVP!

On Tue, Feb 16, 2010 at 6:09 PM, Bradford Stephens 
bradfordsteph...@gmail.com wrote:

 Greetings,

 It's time for another awesome Seattle Hadoop/Lucene/Scalability/NoSQL
 Meetup!

 As always, it's at the University of Washington, Allen Computer
 Science building, Room 303 at 6:45pm. You can find a map here:
 http://www.washington.edu/home/maps/southcentral.html?cse

 Last month, we had a great talk from Steve McPherson of Razorfish on
 their usage of Hadoop. This month, we'll have Richard Kreuter from
 MongoDB talking about, well, MongoDB. As well as assorted discussion
 on the Hadoop ecosystem.

 If you can, please RSVP here (not required, but very nice):
 http://www.meetup.com/Seattle-Hadoop-HBase-NoSQL-Meetup/

 My cell # is 904-415-3009 if you have questions/get lost.

 Cheers,
 Bradford

 --
 http://www.drawntoscalehq.com -- Big Data for all. The Big Data Platform.

 http://www.roadtofailure.com -- The Fringes of Scalability, Social
 Media, and Computer Science



Re: Seattle / PNW Hadoop/Lucene/HBase Meetup, Wed Sep 30th

2009-10-07 Thread Nick Dimiduk
Hey PNW Clouders! I'd really like to chat further with the crew doing
distributed Solr. Give me a ring or shoot me an email, let's do lunch!
-Nick

On Wed, Sep 30, 2009 at 2:10 PM, Nick Dimiduk ndimi...@gmail.com wrote:

 As Bradford is out of town this evening, I will take up the mantel of
 Person-on-Point. Contact me with questions re: tonight's gathering.

 See you tonight!

 -Nick
 614.657.0267


 On Mon, Sep 28, 2009 at 4:33 PM, Bradford Stephens 
 bradfordsteph...@gmail.com wrote:

 Hello everyone!
 Don't forget that the Meetup is THIS Wednesday! I'm looking forward to
 hearing about Hive from the Facebook team ... and there might be a few
 other
 interesting talks as well. Here's the details in the wiki:
 http://wiki.apache.org/hadoop/PNW_Hadoop_%2B_Apache_Cloud_Stack_User_Group

 Cheers,
 Bradford

 On Mon, Sep 14, 2009 at 11:35 AM, Bradford Stephens 
 bradfordsteph...@gmail.com wrote:

  Greetings,
 
  It's time for another Hadoop/Lucene/ApacheCloud  Stack meetup!
  This month it'll be on Wednesday, the 30th, at 6:45 pm.
 
  We should have a few interesting guests this time around -- someone from
  Facebook may be stopping by to talk about Hive :)
 
  We've had great attendance in the past few months, let's keep it up! I'm
  always
  amazed by the things I learn from everyone.
 
  We're back at the University of Washington, Allen Computer Science
  Center (not Computer Engineering)
  Map: http://www.washington.edu/home/maps/?CSE
 
  Room: 303 -or- the Entry level. If there are changes, signs will be
 posted.
 
  More Info:
 
  The meetup is about 2 hours (and there's usually food): we'll have two
  in-depth talks of 15-20
  minutes each, and then several lightning talks of 5 minutes. If no
  one offers, We'll then have discussion and 'social time'.  we'll just
  have general discussion. Let net know if you're interested in speaking
  or attending. We'd like to focus on education, so every presentation
  *needs* to ask some questions at the end. We can talk about these
  after the presentations, and I'll record what we've learned in a wiki
  and share that with the rest of us.
 
  Contact: Bradford Stephens, 904-415-3009, bradfordsteph...@gmail.com
 
  Cheers,
  Bradford
  --
  http://www.roadtofailure.com -- The Fringes of Scalability, Social
  Media, and Computer Science
 



 --
 http://www.roadtofailure.com -- The Fringes of Scalability, Social Media,
 and Computer Science





Re: Seattle / PNW Hadoop/Lucene/HBase Meetup, Wed Sep 30th

2009-09-30 Thread Nick Dimiduk
As Bradford is out of town this evening, I will take up the mantel of
Person-on-Point. Contact me with questions re: tonight's gathering.

See you tonight!

-Nick
614.657.0267

On Mon, Sep 28, 2009 at 4:33 PM, Bradford Stephens 
bradfordsteph...@gmail.com wrote:

 Hello everyone!
 Don't forget that the Meetup is THIS Wednesday! I'm looking forward to
 hearing about Hive from the Facebook team ... and there might be a few
 other
 interesting talks as well. Here's the details in the wiki:
 http://wiki.apache.org/hadoop/PNW_Hadoop_%2B_Apache_Cloud_Stack_User_Group

 Cheers,
 Bradford

 On Mon, Sep 14, 2009 at 11:35 AM, Bradford Stephens 
 bradfordsteph...@gmail.com wrote:

  Greetings,
 
  It's time for another Hadoop/Lucene/ApacheCloud  Stack meetup!
  This month it'll be on Wednesday, the 30th, at 6:45 pm.
 
  We should have a few interesting guests this time around -- someone from
  Facebook may be stopping by to talk about Hive :)
 
  We've had great attendance in the past few months, let's keep it up! I'm
  always
  amazed by the things I learn from everyone.
 
  We're back at the University of Washington, Allen Computer Science
  Center (not Computer Engineering)
  Map: http://www.washington.edu/home/maps/?CSE
 
  Room: 303 -or- the Entry level. If there are changes, signs will be
 posted.
 
  More Info:
 
  The meetup is about 2 hours (and there's usually food): we'll have two
  in-depth talks of 15-20
  minutes each, and then several lightning talks of 5 minutes. If no
  one offers, We'll then have discussion and 'social time'.  we'll just
  have general discussion. Let net know if you're interested in speaking
  or attending. We'd like to focus on education, so every presentation
  *needs* to ask some questions at the end. We can talk about these
  after the presentations, and I'll record what we've learned in a wiki
  and share that with the rest of us.
 
  Contact: Bradford Stephens, 904-415-3009, bradfordsteph...@gmail.com
 
  Cheers,
  Bradford
  --
  http://www.roadtofailure.com -- The Fringes of Scalability, Social
  Media, and Computer Science
 



 --
 http://www.roadtofailure.com -- The Fringes of Scalability, Social Media,
 and Computer Science



Re: Sharded Index Creation Magic?

2009-07-14 Thread Nick Dimiduk
I do, but you raise an interesting point. I had named the field incorrectly.
I'm a little puzzled as to why individual search worked with the broken
field name, but now all is well!

On Tue, Jul 14, 2009 at 12:03 AM, Shalin Shekhar Mangar 
shalinman...@gmail.com wrote:

 On Tue, Jul 14, 2009 at 2:00 AM, Nick Dimiduk ndimi...@gmail.com wrote:

  However, when I search across all
  deployed shards using the shards= query parameter (
 
 
 http://host00:8080/solr/select?shards=host00:8080/solr,host01:8080/solrq=body
  \%3A%3Aterm),
  I get a NullPointerException:
 
  java.lang.NullPointerException
 at
 
 org.apache.solr.handler.component.QueryComponent.mergeIds(QueryComponent.java:421)
 at
 
 org.apache.solr.handler.component.QueryComponent.handleResponses(QueryComponent.java:265)
 at
 
 org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:264)
 at
 
 org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
 at org.apache.solr.core.SolrCore.execute(SolrCore.java:1204)
 at
 
 org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:303)
 at
 
 org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:232)
 
  Debugging into the QueryComponent.mergeIds() method reveals the instance
  sreq.responses (line 356) contains one response for each shard specified,
  each with the number of results received by the independant queries. The
  problems begin down at line 370 because the SolrDocument instance has
 only
  a
  score field -- which proves problematic in the following line where the
 id
  is requested. The SolrDocument, only containing a score, lacks the
  designated ID field (from my schema) and thus the document cannot be
 added
  to the results queue.
 
  Because the example on the wiki works by loading the documents directly
  into
  Solr for indexing, I have come to the conclusion that there is some extra
  magic happening in this index generation process which my process lacks.
 


 Do you have a uniqueKey defined in your schema.xml?

 --
 Regards,
 Shalin Shekhar Mangar.



Sharded Index Creation Magic?

2009-07-13 Thread Nick Dimiduk
Hello!

I'm working with Solr-1.3.0 using a sharded index for distributed,
aggregated search. I've successfully run through the example described in
the DistributedSearch wiki page. I have built an index from a corpus of some
50mil documents in an HBase table and created 7 shards using the
org.apache.hadoop.hbase.mapred.BuildTableIndex. I can deploy any one of
these shards to a single Solr instance and happily search the index after
tweaking the schema appropriately. However, when I search across all
deployed shards using the shards= query parameter (
http://host00:8080/solr/select?shards=host00:8080/solr,host01:8080/solrq=body\%3A%3Aterm),
I get a NullPointerException:

java.lang.NullPointerException
at 
org.apache.solr.handler.component.QueryComponent.mergeIds(QueryComponent.java:421)
at 
org.apache.solr.handler.component.QueryComponent.handleResponses(QueryComponent.java:265)
at 
org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:264)
at 
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
at org.apache.solr.core.SolrCore.execute(SolrCore.java:1204)
at 
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:303)
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:232)

Debugging into the QueryComponent.mergeIds() method reveals the instance
sreq.responses (line 356) contains one response for each shard specified,
each with the number of results received by the independant queries. The
problems begin down at line 370 because the SolrDocument instance has only a
score field -- which proves problematic in the following line where the id
is requested. The SolrDocument, only containing a score, lacks the
designated ID field (from my schema) and thus the document cannot be added
to the results queue.

Because the example on the wiki works by loading the documents directly into
Solr for indexing, I have come to the conclusion that there is some extra
magic happening in this index generation process which my process lacks.

Thanks for the help!