[Wikidata-bugs] [Maniphest] T290961: rewrite KrBot to publish Constraint Violation pages

2021-09-20 Thread VladimirAlexiev
VladimirAlexiev added a comment.


  @ivan
  
  > see updated report immediately after fixing several items
  
  How would this work, on demand? Click a button and the page is regenerated?

TASK DETAIL
  https://phabricator.wikimedia.org/T290961

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: VladimirAlexiev
Cc: Ivan_A_Krestinin, Tacsipacsi, Bugreporter, Aklapper, VladimirAlexiev, 
Invadibot, maantietaja, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T290961: rewrite KrBot to publish Constraint Violation pages

2021-09-18 Thread Ivan_A_Krestinin
Ivan_A_Krestinin added a comment.


  I think rewriting KrBot is right idea. It has one significant limitation: it 
generates report periodically. As the reports user I want to see updated report 
immediately after fixing several items.
  
  But please do not lose several important features of KrBot:
  
  - Report is generated for property, not for item. Tasks like data import work 
with few numbers of properties, but with huge amount of items. So such 
processes can not be controlled using reports for individual items.
  - Bot processes all property values, not some subset of items.
  - Report contains all constraints for individual property. This allows review 
a property status quickly.
  - Bot processes deprecated values also. It is important for Format, Type and 
some other constraints.
  
  Maybe we should think about checking constraints in items edit API. This may 
make constraints reports redundant at all in future. I created T291335 
 to discuss the idea.

TASK DETAIL
  https://phabricator.wikimedia.org/T290961

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Ivan_A_Krestinin
Cc: Ivan_A_Krestinin, Tacsipacsi, Bugreporter, Aklapper, VladimirAlexiev, 
Invadibot, maantietaja, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T290961: rewrite KrBot to publish Constraint Violation pages

2021-09-16 Thread Tacsipacsi
Tacsipacsi added a comment.


  I don’t have access to the bot’s source code either, but I guess so.

TASK DETAIL
  https://phabricator.wikimedia.org/T290961

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Tacsipacsi
Cc: Tacsipacsi, Bugreporter, Aklapper, VladimirAlexiev, Invadibot, maantietaja, 
Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, 
_jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T290961: rewrite KrBot to publish Constraint Violation pages

2021-09-16 Thread VladimirAlexiev
VladimirAlexiev added a comment.


  @Tacsipacsi Labels are missing at 
https://www.wikidata.org/wiki/Wikidata:Database_reports/Constraint_violations/P2088.
 Maybe because it has a much bigger number of violations?

TASK DETAIL
  https://phabricator.wikimedia.org/T290961

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: VladimirAlexiev
Cc: Tacsipacsi, Bugreporter, Aklapper, VladimirAlexiev, Invadibot, maantietaja, 
Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, 
_jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T290961: rewrite KrBot to publish Constraint Violation pages

2021-09-15 Thread Bugreporter
Bugreporter added a comment.


  So: T291091: Snapshots for saved queries 
.
  
  In T290961#7352669 , 
@Tacsipacsi wrote:
  
  > For me, one of the best features of KrBot that it actually edits wiki pages 
instead of embedding the results from somewhere—this produces valuable diffs. 
Diffs are useful to see new violations; for example I may want to concentrate 
on them so that reverting an incomprehensible value is more likely to result in 
the author adding a correct value; or I may have intentionally skipped several 
constraint violations, which are extremal rather than wrong values, and I don’t 
want to go over them again and again. Page histories also provide nice 
statistics about the constraint violations over time: are we doing a good job, 
having less and less / a constant low number of constraint violations? Or do we 
get more new ones than what we clean up? They also show up on watchlists, 
reminding from time to time that there may be new violations. For these 
reasons, I’d like T67626 ’s result 
//not// to be used for the KrBot rewrite.
  >
  >> - An improvement is needed: print the labels of WD items in addition to 
`Q`
  >
  > KrBot supports this, see for example Wikidata:Database reports/Constraint 
violations/P4082 
.
 It’s actually a feature that it doesn’t produce output using the `{{Q}}` 
template on certain pages: these pages are so big that the template would break 
the display of the page (e.g. it may exceed the Lua memory/time limits).

TASK DETAIL
  https://phabricator.wikimedia.org/T290961

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Bugreporter
Cc: Tacsipacsi, Bugreporter, Aklapper, VladimirAlexiev, Invadibot, maantietaja, 
Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, 
_jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T290961: rewrite KrBot to publish Constraint Violation pages

2021-09-14 Thread Tacsipacsi
Tacsipacsi added a comment.


  For me, one of the best features of KrBot that it actually edits wiki pages 
instead of embedding the results from somewhere—this produces valuable diffs. 
Diffs are useful to see new violations; for example I may want to concentrate 
on them so that reverting an incomprehensible value is more likely to result in 
the author adding a correct value; or I may have intentionally skipped several 
constraint violations, which are extremal rather than wrong values, and I don’t 
want to go over them again and again. Page histories also provide nice 
statistics about the constraint violations over time: are we doing a good job, 
having less and less / a constant low number of constraint violations? Or do we 
get more new ones than what we clean up? They also show up on watchlists, 
reminding from time to time that there may be new violations. For these 
reasons, I’d like T67626 ’s result 
//not// to be used for the KrBot rewrite.
  
  > - An improvement is needed: print the labels of WD items in addition to 
`Q`
  
  KrBot supports this, see for example Wikidata:Database reports/Constraint 
violations/P4082 
.
 It’s actually a feature that it doesn’t produce output using the `{{Q}}` 
template on certain pages: these pages are so big that the template would break 
the display of the page (e.g. it may exceed the Lua memory/time limits).

TASK DETAIL
  https://phabricator.wikimedia.org/T290961

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Tacsipacsi
Cc: Tacsipacsi, Bugreporter, Aklapper, VladimirAlexiev, Invadibot, maantietaja, 
Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, 
_jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T290961: rewrite KrBot to publish Constraint Violation pages

2021-09-14 Thread Bugreporter
Bugreporter added a comment.


  I think we should wait until we can directly transclude query results into 
wiki pages (T67626: [Epic] Support for queries on-wiki (automated list 
generation) ).

TASK DETAIL
  https://phabricator.wikimedia.org/T290961

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Bugreporter
Cc: Bugreporter, Aklapper, VladimirAlexiev, Invadibot, maantietaja, Akuckartz, 
Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, 
rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T290961: rewrite KrBot to publish Constraint Violation pages

2021-09-14 Thread VladimirAlexiev
VladimirAlexiev updated the task description.

TASK DETAIL
  https://phabricator.wikimedia.org/T290961

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: VladimirAlexiev
Cc: Aklapper, VladimirAlexiev, Invadibot, maantietaja, Akuckartz, Nandana, 
Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T290961: rewrite KrBot to publish Constraint Violation pages

2021-09-14 Thread VladimirAlexiev
VladimirAlexiev updated the task description.

TASK DETAIL
  https://phabricator.wikimedia.org/T290961

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: VladimirAlexiev
Cc: Aklapper, VladimirAlexiev, Invadibot, maantietaja, Akuckartz, Nandana, 
Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T290961: rewrite KrBot to publish Constraint Violation pages

2021-09-14 Thread VladimirAlexiev
VladimirAlexiev updated the task description.

TASK DETAIL
  https://phabricator.wikimedia.org/T290961

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: VladimirAlexiev
Cc: Aklapper, VladimirAlexiev, Invadibot, maantietaja, Akuckartz, Nandana, 
Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T290961: rewrite KrBot to publish Constraint Violation pages

2021-09-14 Thread VladimirAlexiev
VladimirAlexiev updated the task description.

TASK DETAIL
  https://phabricator.wikimedia.org/T290961

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: VladimirAlexiev
Cc: Aklapper, VladimirAlexiev, Invadibot, maantietaja, Akuckartz, Nandana, 
Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T290961: rewrite KrBot to publish Constraint Violation pages

2021-09-14 Thread VladimirAlexiev
VladimirAlexiev added a parent task: T214362: RFC: Store 
WikibaseQualityConstraint check data in persistent storage.

TASK DETAIL
  https://phabricator.wikimedia.org/T290961

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: VladimirAlexiev
Cc: Aklapper, VladimirAlexiev, Invadibot, maantietaja, Akuckartz, Nandana, 
Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T290961: rewrite KrBot to publish Constraint Violation pages

2021-09-14 Thread VladimirAlexiev
VladimirAlexiev created this task.
VladimirAlexiev added a project: Wikidata.
Restricted Application added a subscriber: Aklapper.

TASK DESCRIPTION
  WMDE declined to take over KrBot operations (T290635 
, T189747 
) for two reasons:
  
  - it's not open source
  - WMDE puts higher priority on being able to access violations through 
SPARQL/API (T214362 ), but that 
still needs to complete various tech tasks (eg T201150 
)
  
  KrBot generates violation reports like 
  
https://www.wikidata.org/wiki/Wikidata:Database_reports/Constraint_violations/P2088
 that are integrated in Property Discussion pages and are viewed as core part 
of WD.
  
  - These pages are the best way to work out data quality problems of specific 
props. Eg I'm now working out through 
https://www.wikidata.org/wiki/Wikidata:Database_reports/Constraint_violations/P2088#%22Single_value%22_violations
 to remove stale or wrong CrunchBase identifiers
  - Even when I can get all violation info with SPARQL, I'd prefer to work from 
a generated WD page because:
- all the info is available at a glance,
- it can be used by non-tech people (eg Getty Vocabulary Program editors 
will now use ULAN constraint violations to improve their own data)
- I can use it to generate QS corrections.
  - An improvement is needed: print the labels of WD items in addition to 
`Q`
  
  There is https://www.wikidata.org/wiki/Special:ConstraintReport/ (eg 
  https://www.wikidata.org/wiki/Special:ConstraintReport/Q389336 shows `P2088` 
violations) but big-data editors don't fix data problems item by item.

TASK DETAIL
  https://phabricator.wikimedia.org/T290961

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: VladimirAlexiev
Cc: Aklapper, VladimirAlexiev, Invadibot, maantietaja, Akuckartz, Nandana, 
Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org