Dear list,
we have included parts of wikibooks (languages with most content) in our academic search engine.
The pages are generated by getting title, url and abstract from wikibooks abstract.xml
This is combined with pages-arcticles.xml to build metadata pages for indexing.
Unfortunately, the building of wikibooks-latest-abstract.xml.gz has been discontinued last year.
Currently we have 389.429 records indexed with 24 languages but they are now over a year old.
Is there any replacement for "Page abstract for Yahoo" dumps?
If not, is there any easy solution to fetch or generate these data somehow?
Kind regards
Bernd
--
*************************************************************
Bernd Fehling Bielefeld University Library
Dipl.-Inform. (FH) LibTec - Library Technology
Universitätsstr. 25 and Knowledge Management
33615 Bielefeld
Tel. +49 521 106-4060 bernd.fehling(at)uni-bielefeld.de
https://www.ub.uni-bielefeld.de/~befehl/
BASE - Bielefeld Academic Search Engine - www.base-search.net
*************************************************************