Unfairly weighted charmstore results

2016-03-19 Thread Tom Barber
Cross posted from IRC:

Hello folks,

I have a gripe about the charm store search. Mostly because its really
badly weighted towards recommended charms, and finding what you(an end user
wants is really hard unless they know what they are doing).

Take this example:

https://jujucharms.com/q/pentaho

Now I'm writing a charm called Pentaho Data Integration, so why do I have
to scroll past 55 recommended charms that have nothing to do with what I
have looked for?

But

https://jujucharms.com/q/etl

Shows me exactly what I need at the top, with no recommended charms
blocking the view.

So I guess its weighted towards tags, then names, sorta.

Im not against recommended charms being dumped at the top, they are
recommended after all but it appears the ranking could be vastly improved.

Off the top of my head a ranking combo of something like, keyword
relevance, recommended vs non recommended, times deployed, age, tags and
last updated. would give a half decent weighting for the charms and would
hopefully stop 55 unrelated charms appearing at the top of the list.

Now I guess, I could dump pentaho in as a tag to get me top of the SEO
rankings, but it seems like generally the method could be improved as the
amount of charms increases, quite plausibly using something like Apache
Nutch to crawl the available charms and build a proper search facility
would improve things.

Cheers

Tom


--

Director Meteorite.bi - Saiku Analytics Founder
Tel: +44(0)5603641316

(Thanks to the Saiku community we reached our Kickstart

goal, but you can always help by sponsoring the project
)
-- 
Juju mailing list
Juju@lists.ubuntu.com
Modify settings or unsubscribe at: 
https://lists.ubuntu.com/mailman/listinfo/juju


Re: Unfairly weighted charmstore results

2016-03-19 Thread Nate Finch
BTW, I reported a very similar problems in this bug:
https://github.com/CanonicalLtd/jujucharms.com/issues/192

On Thu, Mar 17, 2016 at 10:18 AM Uros Jovanovic <
uros.jovano...@canonical.com> wrote:

> Hi Tom,
>
> We currently bump the recommended charms over the community ones. The
> reason other shows is due to using N-grams (3-N) in search and the ranking
> logic using that puts recommended charms over the non-recommended ones. And
> we're not only searching over names of charms but a bunch of content that a
> charm has.
>
> The system works relatively well for recommended charms if you know the
> name (or close to what name is), but not in cases where a name is long and
> the charm is only in community space. That's why you get better results
> with short query vs a longer one.
>
> We're working on providing better search results in the following weeks.
>
>
>
>
> On Thu, Mar 17, 2016 at 2:18 PM, Tom Barber 
> wrote:
>
>> Cross posted from IRC:
>>
>> Hello folks,
>>
>> I have a gripe about the charm store search. Mostly because its really
>> badly weighted towards recommended charms, and finding what you(an end user
>> wants is really hard unless they know what they are doing).
>>
>> Take this example:
>>
>> https://jujucharms.com/q/pentaho
>>
>> Now I'm writing a charm called Pentaho Data Integration, so why do I have
>> to scroll past 55 recommended charms that have nothing to do with what I
>> have looked for?
>>
>> But
>>
>> https://jujucharms.com/q/etl
>>
>> Shows me exactly what I need at the top, with no recommended charms
>> blocking the view.
>>
>> So I guess its weighted towards tags, then names, sorta.
>>
>> Im not against recommended charms being dumped at the top, they are
>> recommended after all but it appears the ranking could be vastly improved.
>>
>> Off the top of my head a ranking combo of something like, keyword
>> relevance, recommended vs non recommended, times deployed, age, tags and
>> last updated. would give a half decent weighting for the charms and would
>> hopefully stop 55 unrelated charms appearing at the top of the list.
>>
>> Now I guess, I could dump pentaho in as a tag to get me top of the SEO
>> rankings, but it seems like generally the method could be improved as the
>> amount of charms increases, quite plausibly using something like Apache
>> Nutch to crawl the available charms and build a proper search facility
>> would improve things.
>>
>> Cheers
>>
>> Tom
>>
>>
>> --
>>
>> Director Meteorite.bi - Saiku Analytics Founder
>> Tel: +44(0)5603641316
>>
>> (Thanks to the Saiku community we reached our Kickstart
>> 
>> goal, but you can always help by sponsoring the project
>> )
>>
>> --
>> Juju mailing list
>> Juju@lists.ubuntu.com
>> Modify settings or unsubscribe at:
>> https://lists.ubuntu.com/mailman/listinfo/juju
>>
>>
> --
> Juju mailing list
> Juju@lists.ubuntu.com
> Modify settings or unsubscribe at:
> https://lists.ubuntu.com/mailman/listinfo/juju
>
-- 
Juju mailing list
Juju@lists.ubuntu.com
Modify settings or unsubscribe at: 
https://lists.ubuntu.com/mailman/listinfo/juju


Re: Unfairly weighted charmstore results

2016-03-19 Thread Uros Jovanovic
Hi Tom,

We currently bump the recommended charms over the community ones. The
reason other shows is due to using N-grams (3-N) in search and the ranking
logic using that puts recommended charms over the non-recommended ones. And
we're not only searching over names of charms but a bunch of content that a
charm has.

The system works relatively well for recommended charms if you know the
name (or close to what name is), but not in cases where a name is long and
the charm is only in community space. That's why you get better results
with short query vs a longer one.

We're working on providing better search results in the following weeks.




On Thu, Mar 17, 2016 at 2:18 PM, Tom Barber  wrote:

> Cross posted from IRC:
>
> Hello folks,
>
> I have a gripe about the charm store search. Mostly because its really
> badly weighted towards recommended charms, and finding what you(an end user
> wants is really hard unless they know what they are doing).
>
> Take this example:
>
> https://jujucharms.com/q/pentaho
>
> Now I'm writing a charm called Pentaho Data Integration, so why do I have
> to scroll past 55 recommended charms that have nothing to do with what I
> have looked for?
>
> But
>
> https://jujucharms.com/q/etl
>
> Shows me exactly what I need at the top, with no recommended charms
> blocking the view.
>
> So I guess its weighted towards tags, then names, sorta.
>
> Im not against recommended charms being dumped at the top, they are
> recommended after all but it appears the ranking could be vastly improved.
>
> Off the top of my head a ranking combo of something like, keyword
> relevance, recommended vs non recommended, times deployed, age, tags and
> last updated. would give a half decent weighting for the charms and would
> hopefully stop 55 unrelated charms appearing at the top of the list.
>
> Now I guess, I could dump pentaho in as a tag to get me top of the SEO
> rankings, but it seems like generally the method could be improved as the
> amount of charms increases, quite plausibly using something like Apache
> Nutch to crawl the available charms and build a proper search facility
> would improve things.
>
> Cheers
>
> Tom
>
>
> --
>
> Director Meteorite.bi - Saiku Analytics Founder
> Tel: +44(0)5603641316
>
> (Thanks to the Saiku community we reached our Kickstart
> 
> goal, but you can always help by sponsoring the project
> )
>
> --
> Juju mailing list
> Juju@lists.ubuntu.com
> Modify settings or unsubscribe at:
> https://lists.ubuntu.com/mailman/listinfo/juju
>
>
-- 
Juju mailing list
Juju@lists.ubuntu.com
Modify settings or unsubscribe at: 
https://lists.ubuntu.com/mailman/listinfo/juju