Thanks Sean,
I'm just looking at section "3.3 Coping without preference values" of
Mahout in Action book, that explains how to work with
GenericBooleanPrefDataModel.




On 8 June 2013 14:20, Sean Owen <sro...@gmail.com> wrote:

> Use an implementation that doesn't expect a rating. These are
> so-called 'boolean' implementations, like GenericBooleanPrefDataModel.
> For example you can build and item-based recommender with the boolean
> version of item based recommender and a log-likelihood similarity.
>
> Or, yes you can calculate some meaningful edge weight to add more info
> to your model. Maybe the number of times the two users interacted? the
> resulting number can be used as a 'rating' although I don't know if
> you will get great results since it doesn't act a lot like a rating.
> Instead, use the log of this number.
>
> Or, use an algorithm that is comfortable with count-like input, like
> ALS with the "implicit data" option turned on.
>
> Sean
>
> On Sat, Jun 8, 2013 at 2:15 PM, Peter Holland <d99991...@mydit.ie> wrote:
> > Hi All,
> > I am trying to use Mahout for Link Prediction in a Social Network.
> >
> > The data I have is an edges list with 9.4 million rows. The edge list is
> a
> > csv vile where each node is an integer value and a row represents a edge
> > between two nodes. For example;
> >
> > 3432, 5098
> > 3423, 6710
> > 4490, 5843
> > 4490, 2039
> > .....
> >
> > This is a directed graph so row 1 means that node 3432 follows node 5098.
> >
> > I would like to build a recommender to calculate the top 10 nodes a user
> > might like to connect to next. The problem I have is that the recommender
> > classes needs input in the form (user, item, value).  So, how can I first
> > calculate a value to represent the 'weight' of an edge? For example
> > EdgeRank?
> >
> > Any help would be greatly appreciated.
> > Thank you,
> > Peter
>

Reply via email to