dcausse added a comment.
happened a couple of times on a test run:
FailedOp(FullImport(Q93246620,2020-05-04T12:57:49Z,1173447691),org.wikidata.query.rdf.tool.exception.ContainedException:
Didn't get a revision id for [])
gerritbot added a comment.
Change 588670 **merged** by jenkins-bot:
[wikidata/query/rdf@master] Cleaning of entity ids was added, along with the
filtering of changes for property talk pages
https://gerrit.wikimedia.org/r/588670
TASK DETAIL
https://phabricator.wikimedia.org/T249099
gerritbot added a comment.
Change 588670 had a related patch set uploaded (by ZPapierski; owner:
ZPapierski):
[wikidata/query/rdf@master] Cleaning of entity ids was added, along with the
filtering of changes for property talk pages
https://gerrit.wikimedia.org/r/588670
TASK DETAIL
dcausse added a comment.
Are we emitting exceptions when the HTTP status is not what we expect, e.g.
404? If yes this is worrisome and we definitely need to look into what entity
and revision is producing such RDF.
TASK DETAIL
https://phabricator.wikimedia.org/T249099
EMAIL PREFERENCES
Zbyszko added a comment.
No, right now exceptions are just uncaught.
As for an error itself - triple list doesn't have to be empty. During
munging, statement list is being modified and by the time that this exception
is thrown, it can be empty. I'm not yet suggesting that this is the
dcausse added a comment.
Do you have the entity and its revision causing this?
The `[]` in `Didn't get a revision id for []` suggests that the list of
triple was empty, could it be some error handling that is missing when fetching
the RDF data?
TASK DETAIL