GoranSMilovanovic added a comment.

  @MGerlach
  
  The authors of the paper that you have cited in T282563#7419722 
<https://phabricator.wikimedia.org/T282563#7419722> use a similar - if not the 
same - approach to feature engineering for the prediction task as I have used 
in T282563#7251679 <https://phabricator.wikimedia.org/T282563#7251679> w. the 
RF classifier, pp.  4 in the PDF: 
<https://parklize.github.io/publications/ISWC2021.pdf>
  
    3. Diversity of edit actions (Divedit·act). To capture the diversity of 
different
    types of edit actions (see Section 4), we use the Shannon-Entropy [25] of
    different edit actions in the same manner as in [24] as: H(T) = −
    Pn i=1 P(ti)·log P(ti) where T indicates different types of edit actions, 
and |T| = n.
    
    4. Diversity of entities (Divent). We measure the diversity of edited 
entities
    of a user using the Shannon-Entropy. The intuition is that the diversity of
    edited entities of a user could also be different across active and inactive
    editors.

TASK DETAIL
  https://phabricator.wikimedia.org/T282563

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic
Cc: Esh77, Pablo, Mohammed_Sadat_WMDE, Tobi_WMDE_SW, MGerlach, awight, 
WMDE-leszek, Manuel, Lydia_Pintscher, Aklapper, Jan_Dittrich, Invadibot, 
maantietaja, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org

Reply via email to