Tarrow created this task. Tarrow added a project: Wikidata - Reference Treasure Hunt. Restricted Application added a subscriber: Aklapper. Restricted Application added a project: Wikidata.
TASK DESCRIPTION Steps to Reproduce: run `python3 wikidatarefisland/run.py --step match --input "scraped_data.jsonl" --output "matched_references.jsonl"` Actual Results: `python3 wikidatarefisland/run.py --step match --input "scraped_data.jsonl" --output "matched_references.jsonl"` Traceback (most recent call last): File "wikidatarefisland/run.py", line 97, in <module> main(sys.argv, __file__) File "wikidatarefisland/run.py", line 75, in main simple_pump.run(pipe, args.input_path, args.output_path) File "/home/tom/src/wikimedia/reference-island/wikidatarefisland/pumps/pump.py", line 20, in run output = pipe.flow(line) File "/home/tom/src/wikimedia/reference-island/wikidatarefisland/pipes/value_matcher_pipe.py", line 33, in flow if not any(match(potential_match) for match in filters): File "/home/tom/src/wikimedia/reference-island/wikidatarefisland/pipes/value_matcher_pipe.py", line 33, in <genexpr> if not any(match(potential_match) for match in filters): File "/home/tom/src/wikimedia/reference-island/wikidatarefisland/data_model/wikibase/value_matchers.py", line 77, in match_datetime return value in reference["extractedData"] File "/home/tom/src/wikimedia/reference-island/wikidatarefisland/data_model/wikibase/value_types.py", line 79, in __eq__ date = isoparse(self.value) File "/home/tom/src/wikimedia/reference-island/venv/lib/python3.7/site-packages/dateutil/parser/isoparser.py", line 37, in func return f(self, str_in, *args, **kwargs) File "/home/tom/src/wikimedia/reference-island/venv/lib/python3.7/site-packages/dateutil/parser/isoparser.py", line 146, in isoparse return datetime(*components) ValueError: day is out of range for month Makefile:20: recipe for target 'data/matched_references.jsonl' failed Expected Results: No explosion but it skips TASK DETAIL https://phabricator.wikimedia.org/T254587 WORKBOARD https://phabricator.wikimedia.org/project/board/4635/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Tarrow Cc: Aklapper, Tarrow, Ferdinand0101, darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Lydia_Pintscher, Mbch331
_______________________________________________ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs