Hi David,
This may be better for the legal team as they maintain the source repository for the license list. However, I can answer your question since I maintain the tools that produce the JSON data (and I'm on both lists). Sorry I didn't reply sooner - I was traveling when our original email was sent. I'll add the legal team to the dist. list as well. When comparing the licenseText and the licenseTextHtml I found a content difference beyond formatting/editorial remarks, the screenshot below shows it. Could it be that the HTML-text and the other text are not in sync? I assume the licenseText is considered as the original/main, and the HTML manually augmented with manual formatting and remarks. It is entirely possible that the licenseText and licenseTextHtml are out of sync. However, we do verify that the text matches per the license matching guidelines <https://spdx.github.io/spdx-spec/v2.3/license-matching-guidelines-and-templ ates/> before publishing. More detail: The licenseText is intended to be the original text including any text formatting. The source of the licenseText is the text file in the test/simpleTestForGenerator/ directory. The licenseTextHtml is generated from the XML file. Since the license XML file is generated from a number of different sources and tools, it is entirely possible that there will be some inconsistencies. However, in most cases the submitter of the license XML copy/pastes much of the information from the licenseText reducing the probability of any inconsistencies. The above is documented in the Accepted License Process <https://github.com/spdx/license-list-XML/blob/main/DOCS/request-new-license .md#accepted-license-process> . When a new license is submitted and before any data is published, an automated check is made to make sure the licenseText and the text associated with the licenseTextHtml match per the license matching guidelines. If you find a situation where a published licenseText doesn't match the corresponding licenseTextHtml per the license matching guidelines, there would either be an issue with the or a bug in the publishing tool. In that case, I would suggest submitting a new issue in the license list XML repository <https://github.com/spdx/license-list-XML/issues> . Hope that helps. Best regards, Gary From: Spdx-tech@lists.spdx.org <Spdx-tech@lists.spdx.org> On Behalf Of david.schumm via lists.spdx.org Sent: Sunday, July 9, 2023 10:58 PM To: spdx-tech@lists.spdx.org Cc: christian.w...@mercedes-benz.com; ciaran.farr...@mercedes-benz.com Subject: Re: [spdx-tech] Question on difference in License Text HTML vs. JSON of Python Software Foundation License 2.0 (PSF-2.0) Dear SPDX Technical Team, I wanted to ask if you already had time to look into the issue mentioned below and, if the SPDX Technical Team is the right addressee, or if I should better get in contact with the SPDX Legal Team. Many thanks, David Schumm Von: Schumm, David (096) Gesendet: Mittwoch, 28. Juni 2023 11:31 An: 'spdx-tech@lists.spdx.org' <spdx-tech@lists.spdx.org <mailto:spdx-tech@lists.spdx.org> > Cc: Wege, Christian (096) <christian.w...@mercedes-benz.com <mailto:christian.w...@mercedes-benz.com> >; Farrell, Ciaran (096) <ciaran.farr...@mercedes-benz.com <mailto:ciaran.farr...@mercedes-benz.com> > Betreff: Question on difference in License Text HTML vs. JSON of Python Software Foundation License 2.0 (PSF-2.0) Dear SPDX Technical Team, I have a question on the license data provided in JSON ("licenseText") compare to the one provided in HTML ("licenseTextHtml"). I was not sure if this is a question to the SPDX Technical Team or to the SPDX Legal Team. Question refers to the license text for the "Python Software Foundation License 2.0 (PSF-2.0)". * HTML Display of Python Software Foundation License 2.0 (PSF-2.0): <https://spdx.org/licenses/PSF-2.0.html> https://spdx.org/licenses/PSF-2.0.html * JSON Source of Python Software Foundation License 2.0 (PSF-2.0) "licenseText": https://raw.githubusercontent.com/spdx/license-list-data/master/json/details /PSF-2.0.json (or https://spdx.org/licenses/PSF-2.0.json) When comparing the licenseText and the licenseTextHtml I found a content difference beyond formatting/editorial remarks, the screenshot below shows it. Could it be that the HTML-text and the other text are not in sync? I assume the licenseText is considered as the original/main, and the HTML manually augmented with manual formatting and remarks. The differences between licenseText and licenseTextHtml: Thanks for looking into this, and best regards, David Schumm ITT/FA - FOSS Governance mailto: david.sch...@mercedes-benz.com <mailto:david.sch...@mercedes-benz.com> mobile: +49 160 866 0365 If you are not the addressee, please inform us immediately that you have received this e-mail by mistake, and delete it. We thank you for your support. -=-=-=-=-=-=-=-=-=-=-=- Links: You receive all messages sent to this group. View/Reply Online (#5230): https://lists.spdx.org/g/Spdx-tech/message/5230 Mute This Topic: https://lists.spdx.org/mt/100060381/21656 Group Owner: spdx-tech+ow...@lists.spdx.org Unsubscribe: https://lists.spdx.org/g/Spdx-tech/unsub [arch...@mail-archive.com] -=-=-=-=-=-=-=-=-=-=-=-