Feliciss created this task.
Feliciss added projects: Outreachy (Round 24), Wikidata.
Restricted Application added a subscriber: Aklapper.

TASK DESCRIPTION
  I'm documenting this because the bot can run on Toolforge for two days 
without breaking to add papers from ADS to Wikidata.
  
  What statements and properties is the bot adding to Wikidata:
  
  1. ADS bibcode
  2. author name string and authors found on Wikidata or created authors from 
ORCID iD in ADS database, with properties series ordinal, author given names, 
author last names, affiliation string, object stated as (only in authors).
  3. publication date
  4. DOI
  5. Published in, used with corresponded ISSN found on Wikidata
  6. issue
  7. page(s)
  8. number of pages
  9. volume
  10. title, with title in HTML property if it contains HTML tags
  11. arXiv ID, with arXiv classifications (if applicable)
  
  The bot first creates an item page on Wikidata and then adds these above 
statements and properties with retrieved date, database info, and ADS bibcode.
  
  Example: https://www.wikidata.org/wiki/Q113556225
  
  Be careful that the bot may fail if it adds thousands of authors to one paper 
due to a heavy server load on Toolforge, read and write edits in a large file 
on Wikidata.
  
  The bot uses two online links to get sources of information to feed into ADS:
  
  1. uses property on Q112684896 <https://www.wikidata.org/wiki/Q112684896>
  2. the open-sourced fields enum to query on adsabs-dev-api 
<https://raw.githubusercontent.com/adsabs/adsabs-dev-api/master/openapi/parameters.yaml>
  
  The first is to query property names to compare them with the enum listed in 
the second, with a name conversion file 
<https://github.com/outreachy-wasian/wasian/blob/main/wasian/sparqlwrapper/surname/wikidata_key_replace_map.json>.
 At the moment, there're 11 statements used to add to Wikidata from ADS.
  
  Detailed code on GitHub 
<https://github.com/outreachy-wasian/wasian/blob/main/wasian/sparqlwrapper/surname/import_papers_from_ads.py>.
  
  The bot runs on Toolforge on the 3rd day every month with a cronjob in the 
job framework on Kubernetes. The code is here 
<https://github.com/outreachy-wasian/wasian/blob/main/wasian/toolforge/kubernetes/jobs/jobs.yaml>.
  
  If you find errors or exceptions, start over it with a regular job here 
<https://github.com/outreachy-wasian/wasian/blob/main/wasian/toolforge/kubernetes/jobs/jobs-test.yaml>
 on Toolforge.
  
  If you find interested in this project, feel free to leave your developer 
account name here. One of our maintainers on wasian 
<https://toolsadmin.wikimedia.org/tools/id/wasian> will add you to the 
maintainer list.
  
  The way reaches out to the bot account is here 
<https://www.wikidata.org/wiki/User:ADSEnglishBot>.
  
  If you have any questions, please leave a message here or to @Mike_Peel

TASK DETAIL
  https://phabricator.wikimedia.org/T315660

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Feliciss
Cc: Pigsonthewing, Feliciss, Aklapper, Mike_Peel, Astuthiodit_1, karapayneWMDE, 
Invadibot, maantietaja, ItamarWMDE, Karma2902, Akuckartz, Nandana, Lahi, Gq86, 
contraexemplo, GoranSMilovanovic, QZanden, Xephyr826, Miriya52, srishakatux, 
merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331, Jay8g
_______________________________________________
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org

Reply via email to