Re: Informal Mahout MeetUp at ApacheCon Friday

2009-11-05 Thread Isabel Drost
On Friday 06 November 2009 04:27:03 Ted Dunning wrote:
> Pacific Coast Brewery is just down the street.  I am already meeting some
> folks there at about 5 (halfway related to Mahout, but only halfway).

+1

Isabel

-- 
  |\  _,,,---,,_   Web:   
  /,`.-'`'-.  ;-;;,_  
 |,4-  ) )-,_..;\ (  `'-' 
'---''(_/--'  `-'\_) (fL)  IM:  



signature.asc
Description: This is a digitally signed message part.


Re: Informal Mahout MeetUp at ApacheCon Friday

2009-11-05 Thread Ted Dunning
I will stand the group for the first round of beers.

On Thu, Nov 5, 2009 at 7:31 PM, Jake Mannix  wrote:

> On Thu, Nov 5, 2009 at 7:27 PM, Ted Dunning  wrote:
>
> > Pacific Coast Brewery is just down the street.  I am already meeting some
> > folks there at about 5 (halfway related to Mahout, but only halfway).
> >
>
> Did I hear "brewery"?
>
> +1
>
>  -jake
>



-- 
Ted Dunning, CTO
DeepDyve


Re: Informal Mahout MeetUp at ApacheCon Friday

2009-11-05 Thread Jake Mannix
On Thu, Nov 5, 2009 at 7:27 PM, Ted Dunning  wrote:

> Pacific Coast Brewery is just down the street.  I am already meeting some
> folks there at about 5 (halfway related to Mahout, but only halfway).
>

Did I hear "brewery"?

+1

  -jake


Re: Informal Mahout MeetUp at ApacheCon Friday

2009-11-05 Thread Ted Dunning
Pacific Coast Brewery is just down the street.  I am already meeting some
folks there at about 5 (halfway related to Mahout, but only halfway).

How does that sound as a meeting place?  Massimo is enthused about meeting
more Lucene and Mahout people on this trip so it would be nice if we could
have both kinds of meeting happen at the same time.

On Thu, Nov 5, 2009 at 2:55 PM, Jake Mannix  wrote:

> So 16:00-ish?  I can do that, but I'm probably going to want to eat around
> then or a little later - a day of talks is pretty
> hunger-inducing.  We should at least plan some place to either be, or a
> place to at least leave a big note telling latecomers
> where we are.  We don't want to lose any folk who miss the last couple of
> talks, as well as the people who may be *only*
> coming for meetups, and not be paying for the 'con.
>
>  -jake
>
> On Thu, Nov 5, 2009 at 2:39 PM, Grant Ingersoll 
> wrote:
>
> > How about Friday after the last Lucene talk?  We can probably just grab
> > some tables in the big room or find an empty room.
> >
> > -Grant
> >
> >
> > On Nov 4, 2009, at 12:42 AM, Jake Mannix wrote:
> >
> >  So let's meet up (two words, no capitals) Friday night, because the
> >> official
> >> MeetUps tend to run long on already long days, and so Isabel (and Ted!)
> >> can
> >> promote it in their Friday talks?
> >>
> >> Do we want to just stake out some bar space on the 2nd floor?  What time
> >> makes sense on friday?  Ted, are you able to hang out friday night?
> >>
> >>  -jake
> >>
> >> On Mon, Nov 2, 2009 at 6:47 PM, Isabel Drost  wrote:
> >>
> >>  On Monday 02 November 2009 23:22:18 Jake Mannix wrote:
> >>>
>  Quick roll-call: who's already in Oakland or are planning to be there
> 
> >>> this
> >>>
>  week for ApacheCon?
> 
> >>>
> >>> Arrived this afternoon.
> >>>
> >>>
> >>>  In particular:
> 
>   * ) how many of us will be at Lucene MeetUp tuesday night?
> 
>  - Jake Mannix
>  - Grant Ingersoll
>  - Isabel Drost
> 
>   * ) how many of us will be at the Hadoop MeetUp thursday night?
> 
>  - Jake Mannix
>  - Isabel Drost
> 
> >>>
> >>>
> >>> Isabel
> >>>
> >>> --
> >>>  |\  _,,,---,,_   Web:   
> >>> /,`.-'`'-.  ;-;;,_
> >>> |,4-  ) )-,_..;\ (  `'-'
> >>> '---''(_/--'  `-'\_) (fL)  IM:  
> >>>
> >>>
> >>>
> > --
> > Grant Ingersoll
> > http://www.lucidimagination.com/
> >
> > Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using
> > Solr/Lucene:
> > http://www.lucidimagination.com/search
> >
> >
>



-- 
Ted Dunning, CTO
DeepDyve


Re: Low priority: analytics for lucene.apache.org/mahout?

2009-11-05 Thread Grant Ingersoll




On Nov 5, 2009, at 3:29 AM, Sean Owen wrote:


I'm a stats junkie and like knowing about where people find out about
my project, what they look for when they arrive, etc. I had always
instrumented my project site with Google Analytics to track this -- in
particular, which pages of javadoc were getting looked at.

Does anyone think this is useful for lucene.apache.org/mahout? is
there any policy against it?


We just need a privacy policy if we do this.  See Apache JackRabbit  
for a template.  FWIW, others have brought this up for Lucene in  
general (see general@)




I could easily set it up and share access to anyone that's interested.
It is of course quite low priority.



I think you could make it so committers have admin access and others  
can have view access.




Re: Uncommons/Watchmaker in Maven

2009-11-05 Thread Isabel Drost
On Friday 06 November 2009 01:57:29 Grant Ingersoll wrote:
> FYI: https://watchmaker.dev.java.net/issues/show_bug.cgi?id=18
>
> I've asked Watchmaker to publish artifacts into Maven repo.

Great! Thanks: One less artifact we need to publish.

Isabel

-- 
  |\  _,,,---,,_   Web:   
  /,`.-'`'-.  ;-;;,_  
 |,4-  ) )-,_..;\ (  `'-' 
'---''(_/--'  `-'\_) (fL)  IM:  



signature.asc
Description: This is a digitally signed message part.


Uncommons/Watchmaker in Maven

2009-11-05 Thread Grant Ingersoll

FYI: https://watchmaker.dev.java.net/issues/show_bug.cgi?id=18

I've asked Watchmaker to publish artifacts into Maven repo.


Re: Informal Mahout MeetUp at ApacheCon Friday

2009-11-05 Thread Jake Mannix
So 16:00-ish?  I can do that, but I'm probably going to want to eat around
then or a little later - a day of talks is pretty
hunger-inducing.  We should at least plan some place to either be, or a
place to at least leave a big note telling latecomers
where we are.  We don't want to lose any folk who miss the last couple of
talks, as well as the people who may be *only*
coming for meetups, and not be paying for the 'con.

  -jake

On Thu, Nov 5, 2009 at 2:39 PM, Grant Ingersoll  wrote:

> How about Friday after the last Lucene talk?  We can probably just grab
> some tables in the big room or find an empty room.
>
> -Grant
>
>
> On Nov 4, 2009, at 12:42 AM, Jake Mannix wrote:
>
>  So let's meet up (two words, no capitals) Friday night, because the
>> official
>> MeetUps tend to run long on already long days, and so Isabel (and Ted!)
>> can
>> promote it in their Friday talks?
>>
>> Do we want to just stake out some bar space on the 2nd floor?  What time
>> makes sense on friday?  Ted, are you able to hang out friday night?
>>
>>  -jake
>>
>> On Mon, Nov 2, 2009 at 6:47 PM, Isabel Drost  wrote:
>>
>>  On Monday 02 November 2009 23:22:18 Jake Mannix wrote:
>>>
 Quick roll-call: who's already in Oakland or are planning to be there

>>> this
>>>
 week for ApacheCon?

>>>
>>> Arrived this afternoon.
>>>
>>>
>>>  In particular:

  * ) how many of us will be at Lucene MeetUp tuesday night?

 - Jake Mannix
 - Grant Ingersoll
 - Isabel Drost

  * ) how many of us will be at the Hadoop MeetUp thursday night?

 - Jake Mannix
 - Isabel Drost

>>>
>>>
>>> Isabel
>>>
>>> --
>>>  |\  _,,,---,,_   Web:   
>>> /,`.-'`'-.  ;-;;,_
>>> |,4-  ) )-,_..;\ (  `'-'
>>> '---''(_/--'  `-'\_) (fL)  IM:  
>>>
>>>
>>>
> --
> Grant Ingersoll
> http://www.lucidimagination.com/
>
> Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using
> Solr/Lucene:
> http://www.lucidimagination.com/search
>
>


Re: Informal Mahout MeetUp at ApacheCon Friday

2009-11-05 Thread Grant Ingersoll
How about Friday after the last Lucene talk?  We can probably just  
grab some tables in the big room or find an empty room.


-Grant

On Nov 4, 2009, at 12:42 AM, Jake Mannix wrote:

So let's meet up (two words, no capitals) Friday night, because the  
official
MeetUps tend to run long on already long days, and so Isabel (and  
Ted!) can

promote it in their Friday talks?

Do we want to just stake out some bar space on the 2nd floor?  What  
time

makes sense on friday?  Ted, are you able to hang out friday night?

 -jake

On Mon, Nov 2, 2009 at 6:47 PM, Isabel Drost   
wrote:



On Monday 02 November 2009 23:22:18 Jake Mannix wrote:
Quick roll-call: who's already in Oakland or are planning to be  
there

this

week for ApacheCon?


Arrived this afternoon.



In particular:

 * ) how many of us will be at Lucene MeetUp tuesday night?

 - Jake Mannix
 - Grant Ingersoll
 - Isabel Drost

 * ) how many of us will be at the Hadoop MeetUp thursday night?

 - Jake Mannix
 - Isabel Drost



Isabel

--
 |\  _,,,---,,_   Web:   
/,`.-'`'-.  ;-;;,_
|,4-  ) )-,_..;\ (  `'-'
'---''(_/--'  `-'\_) (fL)  IM:  




--
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)  
using Solr/Lucene:

http://www.lucidimagination.com/search



Re: Feedback on release candidate for 0.2

2009-11-05 Thread Grant Ingersoll



On Nov 5, 2009, at 12:52 PM, Sean Owen wrote:

Committed. Re-roll the artifacts and let's finish up? you want me to  
try it?


Sure, go for it.



On Thu, Nov 5, 2009 at 10:56 AM, Sean Owen  wrote:

OK so I shall commit these unless I hear back, and that more or less
band-aids us to proceed with 0.2, yes?

On Wed, Nov 4, 2009 at 1:44 PM, Sean Owen  wrote:
OK, I "gpg --clearsign"-ed all the .jar files in lib and core/lib,  
and

have all the .asc files. Just commit those?

And roll back the maven-gpg-plugin to maven-deploy-plugin -- I see  
the

CL you are talking about?

I can commit this now, sure.







Re: Feedback on release candidate for 0.2

2009-11-05 Thread Sean Owen
Committed. Re-roll the artifacts and let's finish up? you want me to try it?

On Thu, Nov 5, 2009 at 10:56 AM, Sean Owen  wrote:
> OK so I shall commit these unless I hear back, and that more or less
> band-aids us to proceed with 0.2, yes?
>
> On Wed, Nov 4, 2009 at 1:44 PM, Sean Owen  wrote:
>> OK, I "gpg --clearsign"-ed all the .jar files in lib and core/lib, and
>> have all the .asc files. Just commit those?
>>
>> And roll back the maven-gpg-plugin to maven-deploy-plugin -- I see the
>> CL you are talking about?
>>
>> I can commit this now, sure.
>


[jira] Resolved: (MAHOUT-195) doubt about SlopeOneRecommender

2009-11-05 Thread Sean Owen (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Owen resolved MAHOUT-195.
--

   Resolution: Fixed
Fix Version/s: 0.3
 Assignee: Sean Owen

OK committed my javadoc change, and getDiff() improvement. TYVM

> doubt about SlopeOneRecommender
> ---
>
> Key: MAHOUT-195
> URL: https://issues.apache.org/jira/browse/MAHOUT-195
> Project: Mahout
>  Issue Type: Question
>  Components: Collaborative Filtering
>Reporter: Jens Grivolla
>Assignee: Sean Owen
>Priority: Minor
> Fix For: 0.3
>
>
> Looking through the SlopeOne code in order to make some changes, I am having 
> some doubts about how MemoryDiffStorage handles things.
> It looks to me like buildAverageDiffs(), or rather processOneUser() inserts 
> the item pairs in the order they appear in userPreferences, as obtained from 
> dataModel.getPreferencesFromUser(userID).
> So if user A has items (X,Y,Z) we obtain the pairs (X,Y),(X,Z),(Y,Z) and 
> update their averages,
> if user B has items (Z,X,Y) we obtain (Z,X),(Z,Y),(X,Y).
> When using getDiff for (Y,Z) it will not look for the (Z,Y) average that user 
> B contributes to, as the average for (Y,Z) is not null.
> Unless we know that preferences are always ordered, e.g. by itemID, this 
> seems like a bug.  I have not found any mention of it being ordered in the 
> documentation of DataModel or PreferenceArray.  If the items are ordered it 
> would seem to be easier to check the order in getDiff(x,y) instead of trying 
> one, then the other.
> P.s.: I tried to ask on mahout-users, but my message never appeared on the 
> list. There might be some kind of filter rejecting the plus sign in my 
> address or something like that, but it's the one where I receive the list 
> messages.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



Re: Build failed on Hudson - Failures in Tests

2009-11-05 Thread Sean Owen
Yep just found the small but bone-headed typo in my last CL that
caused the break. Submitting a fix now. I'm pretty good about running
tests before submitting so not sure how I dropped the ball on this
one.

On Thu, Nov 5, 2009 at 7:15 PM, Sean Owen  wrote:
> yes not sure what happened there, I have no idea how the change could
> have caused these failures. Obviously it did. :)
>
> On Thu, Nov 5, 2009 at 6:53 PM, Isabel Drost  wrote:
>>
>> Seems like the most recent changes broke some of our unit tests:
>>
>> http://hudson.zones.apache.org/hudson/job/Mahout%20Trunk/442/
>>
>> http://hudson.zones.apache.org/hudson/job/Mahout%20Trunk/442/console
>>
>> Sean, are you aware of that?
>>
>> Isabel
>>
>> --
>>  |\      _,,,---,,_       Web:   
>>  /,`.-'`'    -.  ;-;;,_
>>  |,4-  ) )-,_..;\ (  `'-'
>> '---''(_/--'  `-'\_) (fL)  IM:  
>>
>>
>


[jira] Commented: (MAHOUT-196) bounded values for RecommenderEvaluator

2009-11-05 Thread Sean Owen (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12774025#action_12774025
 ] 

Sean Owen commented on MAHOUT-196:
--

It's an interesting question, yeah. One approach would be to cap this in the 
recommender, which makes some sense. Why would I ever estimate a movie was 
rated 6 stars? the only catch is then you lose some ordering information that 
the estimates provide. A 5.5 star movie should still be recommended before 5.4.

Let me think about a way to incorporate this. I imagine it is indeed just a 
matter of exposing some way to express limits.

> bounded values for RecommenderEvaluator
> ---
>
> Key: MAHOUT-196
> URL: https://issues.apache.org/jira/browse/MAHOUT-196
> Project: Mahout
>  Issue Type: Improvement
>  Components: Collaborative Filtering
>Reporter: Jens Grivolla
>Priority: Minor
>
> When evaluating a recommender using RMSRecommenderEvaluator (or some others) 
> on e.g. Netflix data, a recommender gets heavily penalized for predicting 
> values below 1 or above 5 (that are known to be out of the permitted bounds).
> It seems to me that it makes no sense to change the recommender to avoid 
> those predictions, since an estimated 6 probably has a greater chance to be 
> highly rated than a predicted 5.1.  I therefore propose to allow truncating 
> predictions to those "legal" values directly in the evaluator and leave the 
> recommenders unchanged, since it is more of a post-processing step than part 
> of the recommender itself.
> I added those boundaries to the constructor of RMSRecommenderEvaluator and 
> limit estimatedPreference to the allowed range before calculating 
> "realPref.getValue() - estimatedPreference" and seem to get slightly better 
> scores.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.