dr0ptp4kt added a comment.

  I kicked off a run using the current version of the patch with the following 
command and backing table, and its status should be able to be followed here: 
https://yarn.wikimedia.org/cluster/app/application_1713178047802_16409
  
  So long as I haven't made an error somewhere in here that produces a runtime 
exception (e.g., pathing), we should be able to see after a couple hours how 
it's going.
  
    spark3-submit --master yarn --driver-cores 2 --conf 
spark.sql.autoBroadcastJoinThreshold=-1 --conf 
spark.dynamicAllocation.maxExecutors=128 --conf 
spark.sql.shuffle.partitions=512 --conf spark.executor.memoryOverhead=4g 
--executor-cores 4 --executor-memory 12g --driver-memory 16g  --name 
scholarly_article_split_manual__scholarly_article_split_triples__T362060_personal_namespace
  --conf spark.yarn.maxAppAttempts=1 --class 
org.wikidata.query.rdf.spark.transform.structureddata.dumps.ScholarlyArticleSplit
 --deploy-mode cluster 
/home/dr0ptp4kt/rdf-spark-tools-0.3.138-SNAPSHOT-jar-with-dependencies-T362060.jar
 --input-table-partition-spec 
discovery.wikibase_rdf_t337013/date=20231016/wiki=wikidata 
--output-table-partition-spec 
dr0ptp4kt.wikibase_rdf_scholarly_split_T362060/snapshot=20231016/wiki=wikidata
  
  Here was the manual table creation I did while `use`ing the `dr0ptp4kt` 
namespace.
  
    CREATE TABLE IF NOT EXISTS dr0ptp4kt.wikibase_rdf_scholarly_split_T362060 (
      `subject` string,
      `predicate` string,
      `object` string,
      `context` string
    )
    PARTITIONED BY (
        `snapshot` string,
        `wiki` string,
        `scope` string
    )
    STORED AS PARQUET
    LOCATION 
'hdfs://analytics-hadoop/user/dr0ptp4kt/wikibase_rdf_scholarly_split_T362060/wikidata/rdf_scholarly_split_T362060/'
    ;

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dr0ptp4kt
Cc: dr0ptp4kt, dcausse, Aklapper, Danny_Benjafield_WMDE, Isabelladantes1983, 
Themindcoder, Adamm71, S8321414, Jersione, Hellket777, LisafBia6531, 
Astuthiodit_1, 786, Biggs657, karapayneWMDE, Invadibot, maantietaja, Juan90264, 
Alter-paule, Beast1978, ItamarWMDE, Un1tY, Akuckartz, Hook696, Kent7301, 
joker88john, CucyNoiD, Nandana, Gaboe420, Giuliamocci, Cpaulf30, Lahi, Gq86, 
Af420, Bsandipan, GoranSMilovanovic, QZanden, EBjune, KimKelting, LawExplorer, 
Lewizho99, Maathavan, _jensen, rosalieper, Neuronton, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org

Reply via email to