Hello everyone. I'm happy to tell you that, right now, the Italian Wikisource has been indexed by MediaLibrary Online, a digital library platform used by over 4000 libraries in Italy: http://www.medialibrary.it/media/ricerca.aspx?seltip=310&selarg=-1&k...
As an employee of MediaLibrary Online, I worked in these months to get all the metadata and index the texts. A bit of perspective
* All the texts have been either proofread or validated. They are not all whole books: we indexed the texts directly in namespace 0 (and not in the Index: namespace), in order to provide the user a better search experience and findability. This, in our opinion, is very important: we used the MediaWiki API to retrieve all the data, and some HTML scraping for the rest :-) * We link directly the EPUB generated by the awesome tool from Tpt, but also to the page in Wikisource. * We automatically generated the EPUB covers for every text which didn't have one. Ex: http://www.medialibrary.it/media/scheda.aspx?id=850275912
MediaLibraryOnline is a digital platform that provide Italian libraries with the possibility of lend digital resource, as ebook or audiobooks. It's not just "a portal on the Internet", but a service used and managed by single libraries for their uses. It also has an "Open" collection, freely accessible and downloadable for everyone (and not only the patrons of the libraries which have access to MediaLibrary).
I've been hired few months ago to develop such collection, and this is a major milestone for us (and for me :-). I'm expecially excited by the fact that now Wikisource ebooks "enter" in the collection of libraries, and often in their very catalog. I think it is a very good step forward for our project, and I'm eager to replicate this project with other Wikisources as well :-)
If needed, I can explain some details.
Cheers,
Aubrey
Kudos !
Did you documented this with further details somewhere ? (a patern on meta for instance).
Cdlt, ~nicolas
Nope :-)
I plan on publish the (small) scripts I used on GithHub, but, the Learning Pattern is a good idea. Not sure how useful, though, because it really depends on the "receiving" library/service, and also on the Wikisource you are using. Everyone is different and the "works" (the texts in namespace 0) have different categories... But thanks for the suggestion.
Aubrey
On Thu, Apr 16, 2015 at 6:04 PM, Nicolas VIGNERON < vigneron.nicolas@gmail.com> wrote:
Kudos !
Did you documented this with further details somewhere ? (a patern on meta for instance).
Cdlt, ~nicolas
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Yeah, this looks like a brilliant project. Well done!
I agree, the variety (to put it nicely) of categorisation for works' pages in NS0 makes for interesting working, with automated tools. I've been attempting to do various things with enws validated works for a while... :)
Do you know much about OPDS? http://opds-spec.org/about/ It seems, on the surface, a good way to expose WS works to the outside world. I've wondered about implementing it (limited, as you say, to validated and proofread works only). Would that sort of thing work nicely with your library systems?
sam.
On Fri, April 17, 2015 12:13 am, Andrea Zanni wrote:
Nope :-)
I plan on publish the (small) scripts I used on GithHub, but, the Learning Pattern is a good idea. Not sure how useful, though, because it really depends on the "receiving" library/service, and also on the Wikisource you are using. Everyone is different and the "works" (the texts in namespace 0) have different categories... But thanks for the suggestion.
Aubrey
On Thu, Apr 16, 2015 at 6:04 PM, Nicolas VIGNERON < vigneron.nicolas@gmail.com> wrote:
Kudos !
Did you documented this with further details somewhere ? (a patern on meta for instance).
Cdlt, ~nicolas
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
On 17.04.2015 02:07, Sam Wilson wrote:
Do you know much about OPDS? http://opds-spec.org/about/ It seems, on the surface, a good way to expose WS works to the outside world. I've wondered about implementing it (limited, as you say, to validated and proofread works only). Would that sort of thing work nicely with your library systems?
Already done, at least for WSFR: http://wsexport.wmflabs.org/opds/wikisource-fr-good.atom
Emmanuel
Sorry I didn't respond earlier. I don't know anything regarding OPDS, studying it right now.
I also plan to use the scripts directly for EN, FR, and ES Wikisource, asap.
@emmanuel do you have some insights or even scripts to share? Are you using OPDS in some way?
Aubrey
On Fri, Apr 17, 2015 at 6:00 PM, Emmanuel Engelhart kelson@kiwix.org wrote:
On 17.04.2015 02:07, Sam Wilson wrote:
Do you know much about OPDS? http://opds-spec.org/about/ It seems, on the surface, a good way to expose WS works to the outside world. I've wondered about implementing it (limited, as you say, to validated and proofread works only). Would that sort of thing work nicely with your library systems?
Already done, at least for WSFR: http://wsexport.wmflabs.org/opds/wikisource-fr-good.atom
Emmanuel
Kiwix - Wikipedia Offline & more
- Web: http://www.kiwix.org
- Twitter: https://twitter.com/KiwixOffline
- more: http://www.kiwix.org/wiki/Communication
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Hi Andrea
On 05.05.2015 17:40, Andrea Zanni wrote:
Sorry I didn't respond earlier. I don't know anything regarding OPDS, studying it right now.
I also plan to use the scripts directly for EN, FR, and ES Wikisource, asap.
@emmanuel do you have some insights or even scripts to share? Are you using OPDS in some way?
For now, the answer is "no" but: * We have behind us the scrapping/re-rendering of the Gutenberg project https://github.com/kiwix/gutenberg * We want to continue that effort by adding new sources, like Wikisource * I have told TPT that we were interested to include Wikisource and because he has already all the scripts to generate good ebooks, I want to base our scraper on his work through OPDS * We are currently continuing the project through the "Projet Césaire", focused on French ebooks, so the WSFR OPDS feed will be used by our scripts later this autumn.
Emmanuel
wikisource-l@lists.wikimedia.org