GoranSMilovanovic added a comment.
@MGerlach The authors of the paper that you have cited in T282563#7419722 <https://phabricator.wikimedia.org/T282563#7419722> use a similar - if not the same - approach to feature engineering for the prediction task as I have used in T282563#7251679 <https://phabricator.wikimedia.org/T282563#7251679> w. the RF classifier, pp. 4 in the PDF: <https://parklize.github.io/publications/ISWC2021.pdf> 3. Diversity of edit actions (Divedit·act). To capture the diversity of different types of edit actions (see Section 4), we use the Shannon-Entropy [25] of different edit actions in the same manner as in [24] as: H(T) = − Pn i=1 P(ti)·log P(ti) where T indicates different types of edit actions, and |T| = n. 4. Diversity of entities (Divent). We measure the diversity of edited entities of a user using the Shannon-Entropy. The intuition is that the diversity of edited entities of a user could also be different across active and inactive editors. TASK DETAIL https://phabricator.wikimedia.org/T282563 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GoranSMilovanovic Cc: Esh77, Pablo, Mohammed_Sadat_WMDE, Tobi_WMDE_SW, MGerlach, awight, WMDE-leszek, Manuel, Lydia_Pintscher, Aklapper, Jan_Dittrich, Invadibot, maantietaja, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org