AndrewTavis_WMDE added a comment.
β οΈ Currently WIP β οΈ =================== Going through the files sent by @JAllemandou above <https://phabricator.wikimedia.org/T358311#9648470>. This message will be saved as I go so that I don't loose my progress π If I do find something worth documenting, then I'll also include it below so that this task can serve as a reference for later if need be. stat1004 -------- All of the files are not worth keeping. See descriptions and reasoning below: total 28 Analytics ββ NewEditors ββ adHoc (nothing of interest) ββ Compaigns ββ 2019 and 2020 email compaigns with R based analysis (nothing of interest) ββ WDCM ββ WDCM_Output ββ Lots directories of CSVs (nothing of interest) ββ WDCM_Scripts ββ R based scripts that would be archived on Gerrit if they were ever in production (nothing of interest) ββ Wikidata ββ misc ββ Some ad hoc work (nothing of interest) ββ WD_languagesLandscape ββ R based scripts that would be archived on Gerrit if they were ever in production (nothing of interest) ββ WD_ORES_ItemQuality (nothing of interest given Lift Wing migration) ββ WD_UsageCoverage ββ R and Python scripts that are doubtless versions of the WDCM UsageCoverage dashboard that's archived on Gerrit (nothing of interest) Experiments ββ Empty _miscWMDE ββ summerBannerCampaign2017_DataOUT ββ TSV files (nothing of interest) ββ TWLBanner_2017 ββ TSV files and simple HQL queries from `wmf.webrequest` for banner campaigns hits (nothing of interest, easy to learn as needed) Example query: SELECT count(*) FROM wmf.webrequest WHERE uri_host = 'de.wikipedia.org' AND uri_query LIKE "$/wiki/Wikipedia:Umfragen/Technische_WΓΌnsche_2017$" AND http_method = 'GET' AND is_pageview = TRUE AND YEAR = 2017 AND MONTH = 6 AND DAY = 1 and HOUR = 20; ββ TWLBanner_2017_DataOUT ββ TSV files (nothing of interest) _miscWMDE_1004 ββ TWLBanner_2017 ββ One HQL and one TSV file that are similar to the above (nothing of interest) R ββ x86_64-pc-linux-gnu-library (nothing of interest) Research ββ DydimusZengenene ββ Note: work to support a researcher (nothing of interest) ββ _analytics ββ _data ββ DydimusZengenene.Rproj ββ ParseTargetPage.R wdUsagePerPage ββ Related to the percentage usage dashboard, so would be archived on Gerrit if they were ever in production (nothing of interest) stat1005 -------- total 964 Analytics ββ BotEdits_perProject.ipynb ββ crontabstat1005.txt ββ DataModelTerms_20210228_Updates.ipynb ββ dewiki_NewEds_2021.ipynb ββ QCF_M2_Test.ipynb ββ QuratorCuriousFacts_Separators.ipynb ββ Qurator_M1.ipynb ββ R ββ snapshot_query.hql ββ Untitled1.ipynb ββ untitled1.txt ββ Untitled2.ipynb ββ Untitled3.ipynb ββ Untitled4.ipynb ββ Untitled5.ipynb ββ Untitled.ipynb ββ untitled.txt ββ venv ββ wd_cluster_fetch_items_M2.ipynb ββ wd_cluster_fetch_items_M3.ipynb ββ WDCM_ETL_OTHER_TEST.ipynb ββ WDCM_Statements_Test.ipynb ββ WD_HumanEditsPerClass_RevisionTags.ipynb ββ WD_Inequality_Intake.ipynb ββ WD_Languages_Datamodel_CollectInit.ipynb ββ WD_Languages_Datamodel_EXP.ipynb ββ WD_MonthlyEditors.ipynb ββ WD_Sitelinks_WDAHP_202108.ipynb ββ wd_statements_HiveQL_Query.hql ββ WD_Translations.ipynb ββ WHEIP_exps.ipynb ββ wikidata_analytics_examples ββ WikidataRevisions_November2020.csv ββ stat1006 -------- total 48 misc_projects ββ myTemp ββ NewEds ββ nohup.out ββ R ββ RPckg ββ RScripts ββ sqlIn ββ sqlOut ββ WDCM_Credentials ββ WDCM_DataIN ββ WDCM_DataOUT ββ WDCM_sql ββ stat1007 -------- total 28 Analytics ββ crontabstat1007.txt ββ Experiments ββ Python3 ββ R ββ RScripts ββ venv ββ stat1008 -------- total 16 Analytics ββ R ββ renv ββ venv ββ stat1009 -------- total 0 stat1010 -------- total 0 HDFS ---- Found 55 items /user/goransm/.Trash ββ /user/goransm/.metadata ββ /user/goransm/.sparkStaging ββ /user/goransm/.staging ββ /user/goransm/.temp ββ /user/goransm/Architectural-Structure_ItemIDs.csv ββ /user/goransm/Astronomical-Object_ItemIDs.csv ββ /user/goransm/Book_ItemIDs.csv ββ /user/goransm/Chemical-Entities_ItemIDs.csv ββ /user/goransm/Event_ItemIDs.csv ββ /user/goransm/Gene_ItemIDs.csv ββ /user/goransm/Geographical-Object_ItemIDs.csv ββ /user/goransm/Human_ItemIDs.csv ββ /user/goransm/ORESPredictions ββ /user/goransm/Organization_ItemIDs.csv ββ /user/goransm/Taxon_ItemIDs.csv ββ /user/goransm/Thoroughfare_ItemIDs.csv ββ /user/goransm/WDCM_Biases_ETL_Test ββ /user/goransm/WDCM_CollectedGeoItems ββ /user/goransm/WDCM_CollectedItems ββ /user/goransm/Wikimedia_Internal_ItemIDs.csv ββ /user/goransm/Work-Of-Art_ItemIDs.csv ββ /user/goransm/dewiki_revisions ββ /user/goransm/dfTrain1.csv ββ /user/goransm/dfTrain2.csv ββ /user/goransm/dfTrain3.csv ββ /user/goransm/dfTrain4.csv ββ /user/goransm/dfTrain5.csv ββ /user/goransm/flights.csv ββ /user/goransm/mysql-analytics-research-client-pw.txt ββ /user/goransm/refClassSubclasses.csv ββ /user/goransm/separators.csv ββ /user/goransm/singleValueConstraintProperties.csv /user/goransm/subclasses.csv ββ /user/goransm/tfMatrixDF.csv ββ /user/goransm/tfMatrix_Human.csv ββ /user/goransm/wdORESQuality.csv ββ /user/goransm/wdORESQuality_Reuse.csv ββ /user/goransm/wdORESQuality_Reuse_Commons.csv ββ /user/goransm/wdORESQuality_Reuse_nonCommons.csv ββ /user/goransm/wd_dump_geocoded ββ /user/goransm/wd_dump_human_author ββ /user/goransm/wd_dump_human_creator ββ /user/goransm/wd_dump_human_gender ββ /user/goransm/wd_dump_human_occupation ββ /user/goransm/wd_dump_item_language ββ /user/goransm/wd_dump_labels_English ββ /user/goransm/wd_entity_reuse ββ /user/goransm/wd_extId_data_qual_.csv ββ /user/goransm/wd_extId_data_ref_.csv ββ /user/goransm/wd_extId_data_ref_snak_.csv ββ /user/goransm/wd_extId_data_stat_.csv ββ /user/goransm/wdcmsqoop ββ /user/goransm/wdtranslationsb ββ /user/goransm/wikidataRevisions_EXP.csv ββ Hive ---- Nothing found TASK DETAIL https://phabricator.wikimedia.org/T358311 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: AndrewTavis_WMDE Cc: brouberol, JAllemandou, MoritzMuehlenhoff, Manuel, Aklapper, AndrewTavis_WMDE, Danny_Benjafield_WMDE, S8321414, Astuthiodit_1, BTullis, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Dringsim, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, KimKelting, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org