Re: [Wikisource-l] Proposal
This is an ooold issue: Wikisource does not have proper metadata. I'm one of the people that Nemo Was mentioning: I'm a digital librarian working for MLOL, and in the past 2 years, among other dozens of digital libraries, I've managed to import Wikisource metadata on MLOL in 3 languages (English, French, Italian): http://openmlol.it/Media/Search?Publisher=620 It has been a pain in the ass because I actually had to web scrape the websites, from lists derived from categories. There is no easy way to have metadata about Wikisource books, at the moment. There are prototypes like WS-search, from Sam Wilson: http://tools.wmflabs.org/ws-search/ And I'm sure that Tpt has some scripts to parse Wikisource (also, there was OPDS). The Wikisource community tried somewhat to solve the issue via Wikidata, but discovered another black hole: modelling books on Wikidata it's very tricky, and even after 2 Wikicite conferences (with Tpt and other Wikisource people) I must confess I'm still confused... For personal reasons I've not worked on the matter in the last months, hopefully, will do in the near future. I remember Vigneron was braver than me and recently tried to revive the discussion: https://www.wikidata.org/wiki/Wikidata_talk:WikiProject_Books IF (and it's a big IF) we find a workable solution with Wikidata (theoretical and practical) and IF (and it's a big IF) we find some skilled Wikidata people to help us with customised queries and bots to help with the transition (we need to import Wikisource data into Wikidata, and we need to clear existing Wikidata items following the standard model, and we need to maintain them in the future), then it's doable and we have solved the metadata problem. Unfortunately, I spent most of my adult and professional life complaining about this (also, I really tried), and nothing really changed... ;-) Aubrey On Thu, Oct 12, 2017 at 3:05 PM, Sam Wilsonwrote: > Sorry, Sam, the other thing I meant to say was: that's a brilliant idea! > :-) I'd love to help make it happen, if I can be of any use. :) > > > On Thu, 12 Oct 2017, at 08:51 PM, Sam Wilson wrote: > > It's slightly tricky at the moment to extract data for validated works, > for one because we don't have solid data linking Index pages to their > corresponding main namespace (i.e. "work") pages. The Index pages have the > status, but the mainspace pages are what we think of as the work. There's > P1957 now, which is the connextion we need, but the data for that isn't > complete. > > We've also got (incomplete) support for OPDS in the wsexport tool, which I > think is probably a brilliant way forward for sharing the Wikisource > catalogue with other systems. Once we have better structural support in > Wikisource itself (e.g. structured data for querying validation status) > then we'll much more efficiently and easily be able to produce all sorts of > output for sharing. > > > On Thu, 12 Oct 2017, at 08:37 PM, Sam Walton wrote: > > Hi all, > > I work on the Wikipedia Library program, and wanted to jump in with a > passing thought I'd had about Wikisource and TWL. We'll be building search > and discovery tools into the library card platform ( > https://wikipedialibrary.wmflabs.org/) that's currently under ongoing > development. They'll index all the usually-paywalled resources we have > access to, but also open access content. As part of that process it's a > desire of mine to index completed Wikisource works, though I haven't given > it much thought beyond 'that would be nice'. This might be able to function > as a kind of centralised search for all completed Wikisource works, if > implemented. > > If you're interested, the relevant Phab task is > https://phabricator.wikimedia.org/T169875, where your thoughts are very > welcome. It won't be worked on for a while and I can't guarantee that it > will definitely happen, but if it's something the Wikisource community > would benefit from, then that would absolutely increase the likelihood > we'll work on it. > > Best, > Sam > > On 12 October 2017 at 13:07, Federico Leva (Nemo) > wrote: > > Gerard Meijssen, 12/10/2017 15:04: > > Given the discussion about finished books on the Korean Wikisource, I this > demonstrates that we really need to advertise the finished books to a > reading public. > > > In Italy, after many years of talk with local libraries, the Wikisource > books are included in the catalogs of many libraries (also via a local > ebook provider, MLOL, who hired some wikimedians to work on the "open > collection", big kudos to them). > > Nemo > > ___ > Wikisource-l mailing list > Wikisource-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikisource-l > > > > > -- > Sam Walton > Partnerships Coordinator > The Wikipedia Library > > s...@wikipedialibrary.org / swal...@wikimedia.org > *___* > Wikisource-l mailing
Re: [Wikisource-l] Proposal
Sorry, Sam, the other thing I meant to say was: that's a brilliant idea! :-) I'd love to help make it happen, if I can be of any use. :) On Thu, 12 Oct 2017, at 08:51 PM, Sam Wilson wrote: > It's slightly tricky at the moment to extract data for validated > works, for one because we don't have solid data linking Index pages to > their corresponding main namespace (i.e. "work") pages. The Index > pages have the status, but the mainspace pages are what we think of as > the work. There's P1957 now, which is the connextion we need, but the > data for that isn't complete.> > We've also got (incomplete) support for OPDS in the wsexport tool, > which I think is probably a brilliant way forward for sharing the > Wikisource catalogue with other systems. Once we have better > structural support in Wikisource itself (e.g. structured data for > querying validation status) then we'll much more efficiently and > easily be able to produce all sorts of output for sharing.> > > On Thu, 12 Oct 2017, at 08:37 PM, Sam Walton wrote: >> Hi all, >> >> I work on the Wikipedia Library program, and wanted to jump in with a >> passing thought I'd had about Wikisource and TWL. We'll be building >> search and discovery tools into the library card platform >> (https://wikipedialibrary.wmflabs.org/) that's currently under >> ongoing development. They'll index all the usually-paywalled >> resources we have access to, but also open access content. As part of >> that process it's a desire of mine to index completed Wikisource >> works, though I haven't given it much thought beyond 'that would be >> nice'. This might be able to function as a kind of centralised search >> for all completed Wikisource works, if implemented.>> >> If you're interested, the relevant Phab task is >> https://phabricator.wikimedia.org/T169875, where your thoughts are >> very welcome. It won't be worked on for a while and I can't guarantee >> that it will definitely happen, but if it's something the Wikisource >> community would benefit from, then that would absolutely increase the >> likelihood we'll work on it.>> >> Best, >> Sam >> >> On 12 October 2017 at 13:07, Federico Leva (Nemo) >>wrote:>>> Gerard Meijssen, 12/10/2017 15:04: Given the discussion about finished books on the Korean Wikisource, I this demonstrates that we really need to advertise the finished books to a reading public.>>> >>> In Italy, after many years of talk with local libraries, the >>> Wikisource books are included in the catalogs of many libraries >>> (also via a local ebook provider, MLOL, who hired some wikimedians >>> to work on the "open collection", big kudos to them).>>> >>> Nemo >>> >>> ___ >>> Wikisource-l mailing list >>> Wikisource-l@lists.wikimedia.org >>> https://lists.wikimedia.org/mailman/listinfo/wikisource-l >> >> >> >> -- >> Sam Walton >> Partnerships Coordinator >> The Wikipedia Library >> >> s...@wikipedialibrary.org / swal...@wikimedia.org >> _ >> Wikisource-l mailing list >> Wikisource-l@lists.wikimedia.org >> https://lists.wikimedia.org/mailman/listinfo/wikisource-l > > _ > Wikisource-l mailing list > Wikisource-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikisource-l ___ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Re: [Wikisource-l] Proposal
It's slightly tricky at the moment to extract data for validated works, for one because we don't have solid data linking Index pages to their corresponding main namespace (i.e. "work") pages. The Index pages have the status, but the mainspace pages are what we think of as the work. There's P1957 now, which is the connextion we need, but the data for that isn't complete. We've also got (incomplete) support for OPDS in the wsexport tool, which I think is probably a brilliant way forward for sharing the Wikisource catalogue with other systems. Once we have better structural support in Wikisource itself (e.g. structured data for querying validation status) then we'll much more efficiently and easily be able to produce all sorts of output for sharing. On Thu, 12 Oct 2017, at 08:37 PM, Sam Walton wrote: > Hi all, > > I work on the Wikipedia Library program, and wanted to jump in with a > passing thought I'd had about Wikisource and TWL. We'll be building > search and discovery tools into the library card platform > (https://wikipedialibrary.wmflabs.org/) that's currently under ongoing > development. They'll index all the usually-paywalled resources we have > access to, but also open access content. As part of that process it's > a desire of mine to index completed Wikisource works, though I haven't > given it much thought beyond 'that would be nice'. This might be able > to function as a kind of centralised search for all completed > Wikisource works, if implemented.> > If you're interested, the relevant Phab task is > https://phabricator.wikimedia.org/T169875, where your thoughts are > very welcome. It won't be worked on for a while and I can't guarantee > that it will definitely happen, but if it's something the Wikisource > community would benefit from, then that would absolutely increase the > likelihood we'll work on it.> > Best, > Sam > > On 12 October 2017 at 13:07, Federico Leva (Nemo) >wrote:>> Gerard Meijssen, 12/10/2017 15:04: >>> Given the discussion about finished books on the Korean Wikisource, >>> I this demonstrates that we really need to advertise the finished >>> books to a reading public.>> >> In Italy, after many years of talk with local libraries, the >> Wikisource books are included in the catalogs of many libraries (also >> via a local ebook provider, MLOL, who hired some wikimedians to work >> on the "open collection", big kudos to them).>> >> Nemo >> >> ___ >> Wikisource-l mailing list >> Wikisource-l@lists.wikimedia.org >> https://lists.wikimedia.org/mailman/listinfo/wikisource-l > > > > -- > Sam Walton > Partnerships Coordinator > The Wikipedia Library > > s...@wikipedialibrary.org / swal...@wikimedia.org > _ > Wikisource-l mailing list > Wikisource-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikisource-l ___ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Re: [Wikisource-l] Proposal
Hoi, In my opinion this is a service not only for editors but for the complete Wikimedia reader public. So the notion of putting it in that platform is great but we need to advertise the finished books of Wikisource to the whole world. Thanks, GerardM On 12 October 2017 at 14:37, Sam Waltonwrote: > Hi all, > > I work on the Wikipedia Library program, and wanted to jump in with a > passing thought I'd had about Wikisource and TWL. We'll be building search > and discovery tools into the library card platform ( > https://wikipedialibrary.wmflabs.org/) that's currently under ongoing > development. They'll index all the usually-paywalled resources we have > access to, but also open access content. As part of that process it's a > desire of mine to index completed Wikisource works, though I haven't given > it much thought beyond 'that would be nice'. This might be able to function > as a kind of centralised search for all completed Wikisource works, if > implemented. > > If you're interested, the relevant Phab task is > https://phabricator.wikimedia.org/T169875, where your thoughts are very > welcome. It won't be worked on for a while and I can't guarantee that it > will definitely happen, but if it's something the Wikisource community > would benefit from, then that would absolutely increase the likelihood > we'll work on it. > > Best, > Sam > > On 12 October 2017 at 13:07, Federico Leva (Nemo) > wrote: > >> Gerard Meijssen, 12/10/2017 15:04: >> >>> Given the discussion about finished books on the Korean Wikisource, I >>> this demonstrates that we really need to advertise the finished books to a >>> reading public. >>> >> >> In Italy, after many years of talk with local libraries, the Wikisource >> books are included in the catalogs of many libraries (also via a local >> ebook provider, MLOL, who hired some wikimedians to work on the "open >> collection", big kudos to them). >> >> Nemo >> >> ___ >> Wikisource-l mailing list >> Wikisource-l@lists.wikimedia.org >> https://lists.wikimedia.org/mailman/listinfo/wikisource-l >> > > > > -- > Sam Walton > Partnerships Coordinator > The Wikipedia Library > > s...@wikipedialibrary.org / swal...@wikimedia.org > > ___ > Wikisource-l mailing list > Wikisource-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikisource-l > > ___ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Re: [Wikisource-l] Proposal
Hi all, I work on the Wikipedia Library program, and wanted to jump in with a passing thought I'd had about Wikisource and TWL. We'll be building search and discovery tools into the library card platform ( https://wikipedialibrary.wmflabs.org/) that's currently under ongoing development. They'll index all the usually-paywalled resources we have access to, but also open access content. As part of that process it's a desire of mine to index completed Wikisource works, though I haven't given it much thought beyond 'that would be nice'. This might be able to function as a kind of centralised search for all completed Wikisource works, if implemented. If you're interested, the relevant Phab task is https://phabricator. wikimedia.org/T169875, where your thoughts are very welcome. It won't be worked on for a while and I can't guarantee that it will definitely happen, but if it's something the Wikisource community would benefit from, then that would absolutely increase the likelihood we'll work on it. Best, Sam On 12 October 2017 at 13:07, Federico Leva (Nemo)wrote: > Gerard Meijssen, 12/10/2017 15:04: > >> Given the discussion about finished books on the Korean Wikisource, I >> this demonstrates that we really need to advertise the finished books to a >> reading public. >> > > In Italy, after many years of talk with local libraries, the Wikisource > books are included in the catalogs of many libraries (also via a local > ebook provider, MLOL, who hired some wikimedians to work on the "open > collection", big kudos to them). > > Nemo > > ___ > Wikisource-l mailing list > Wikisource-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikisource-l > -- Sam Walton Partnerships Coordinator The Wikipedia Library s...@wikipedialibrary.org / swal...@wikimedia.org ___ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Re: [Wikisource-l] Proposal
Gerard Meijssen, 12/10/2017 15:04: Given the discussion about finished books on the Korean Wikisource, I this demonstrates that we really need to advertise the finished books to a reading public. In Italy, after many years of talk with local libraries, the Wikisource books are included in the catalogs of many libraries (also via a local ebook provider, MLOL, who hired some wikimedians to work on the "open collection", big kudos to them). Nemo ___ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
[Wikisource-l] Proposal
Hoi, Given the discussion about finished books on the Korean Wikisource, I this demonstrates that we really need to advertise the finished books to a reading public. After all what is it that we do it for but for finding a public for the transcribed books? Thanks, GerardM http://ultimategerardm.blogspot.nl/2017/10/wikisource-proof-of-pudding.html ___ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
[Wikisource-l] Proposal to move the Oldwikisource projects to the Incubator
Someone (not me) has proposed to move the language projects hosted in Oldwikisource to the Incubator https://meta.wikimedia.org/wiki/Meta:Babel#Move_betawikiversity_and_oldwikisource_to_Incubator I'm not sure if is a good idea or if it is technically feasible at all, but you can comment it on the link. Thanks! Micru ___ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Re: [Wikisource-l] Proposal to move the Oldwikisource projects to the Incubator
I posted there a very bold idea - to assign to oldwikisourcea new and inspiring nick name: Common wikisource. :-) Alex 2013/6/24 David Cuenca dacu...@gmail.com Someone (not me) has proposed to move the language projects hosted in Oldwikisource to the Incubator https://meta.wikimedia.org/wiki/Meta:Babel#Move_betawikiversity_and_oldwikisource_to_Incubator I'm not sure if is a good idea or if it is technically feasible at all, but you can comment it on the link. Thanks! Micru ___ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l ___ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Re: [Wikisource-l] Proposal: two desirable ParserFunctions
2010/12/14 Federico Leva (Nemo) nemow...@gmail.com Alex Brollo, 14/12/2010 09:10: 1. I'd like to have a {{nbsp:string}} function, to replace all spaces with nbsp; html entities into string. It would very useful to format table cell content where spaces must be avoided, while saving the original string. There's a class for this: http://en.wikipedia.org/wiki/Template:Nowrap 2. I'd like to generalize {{#titleparts:string}} transforming it into a general str.split() function something like {{split:string|char}} where string is the string to split, and char is the splitting character). I use a lot #titleparts as a string-splitter, and I can manage difficult issues with ease (i.e.: how to pass multiple values into a single parameter of a template) but sometimes I'd like to use a character different from / (i.e. splitting a list of page/subpage names). This should be http://www.mediawiki.org/wiki/Extension:StringFunctions , which will never be implemented in Wikimedia projects. 1. Thank you Nemo!! Problem solved. 2. There are lots of string functions already implemented - lc, uc, lcfirs, ucfirst, #titleparts... PAGENAME, FULLPAGENAME, BASEPAGENAME... - today. IMHO a generalized *split* function would be great to extend just a little bit the field of #titleparts to implement an efficient, simple way to manage simple arrays. StringFunctions could fire fantasy - *split* function would only increase efficiency to solve common issues. Alex ___ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Re: [Wikisource-l] proposal to extend the ref tag syntax
For the moment it is only a proposal. Since the Cite extension is used at most wikimedia wikis, it has to be discussed with the developers. This is why I posted the original message to wikitech-l ; wikisource-l was only in cc. I realize now that it was not a good idea to send the same message to wikisource-l, because users of this ML do not know that there is a discussion at wikitech-l Thomas ___ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
[Wikisource-l] proposal to extend the ref tag syntax
Sorry, I forgot a topic ; reposting the previous message: I would like to extend the syntax of the ref tag (Cite extension), in order to deal with footnotes that are spread on several transcluded pages. Since the Cite extension is widely used, I guess I better ask here first. Here is an illustration of the problem : http://en.wikisource.org/wiki/Page:Robert_the_Bruce_and_the_struggle_for_Scottish_independence_-_1909.djvu/49 On the bottom of the scan you can see the second half of a footnote. That footnote begins at the previous page : http://en.wikisource.org/wiki/Page:Robert_the_Bruce_and_the_struggle_for_Scottish_independence_-_1909.djvu/48 Wikisourcers currently have no way to deal with these cases in a clean way. I have written a patch for this (the code is here : http://dpaste.org/QOMH/ ). This patch extends the ref syntax by adding a follow parameter, like this : ref follow=foobar/ref After two pages are transcluded, the wikitext passed to the parser will look like this : blah blah blah blah blah blahref name=note1beginning of note 1/ref blah blah blah blah blah blah blah blah blahref follow=note1end of note/ref blah blah blah This wikitext is rendered as a single footnote, located in the text at the position of the parent ref. If the parent ref is not found (as is the case when you render only the second page), then the text inside the tag is rendered at the beginning of the list of references, with no number and no link. does this make sense ? Thomas ___ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l