Hello M.Srilasya,
The XML data dumps of all the Wikipedias are free to download and use as
per the licensing discussed here <https://dumps.wikimedia.org/legal.html>.
So you can just download anything you'd like from the website here:
https://dumps.wikimedia.org/backup-index.html.
If you let me know a specific language you're interested in, I can point
you to the exact download link. But since you asked for a smaller download,
let me offer simplewiki, which is a smaller English wiki that uses
"Simplified English'', yet it is big enough to be interesting to do proof
of concepts with:
All pages with complete page edit history (.bz2)
- simplewiki-20240201-pages-meta-history.xml.bz2
<https://dumps.wikimedia.org/simplewiki/20240201/simplewiki-20240201-pages-meta-history.xml.bz2>
2.9
GB
-
All pages, current versions only.
- simplewiki-20240201-pages-meta-current.xml.bz2
<https://dumps.wikimedia.org/simplewiki/20240201/simplewiki-20240201-pages-meta-current.xml.bz2>
356.7
MB
On Thu, Feb 22, 2024 at 1:10 AM 21131A0564 MANCHUKONDA SRILASYA <
21131a0564(a)gvpce.ac.in> wrote:
Dear xmldatadumps owner,
I'm a student working on a search engine project for which i
need the xml data dumps. i do not have excess storage capabilities. so, I
just need a small xml data dump. so that I can use it for my project.
I will make sure that I will not misuse the data provided by
you. please consider my request.
Yours obediently,
M.Srilasya
--
Xabriel J. Collazo Mojica (he/him, pronunciation
<https://commons.wikimedia.org/wiki/File:Xabriel_Collazo_Mojica_-_pronunciation.ogg>
)
Sr Software Engineer
Wikimedia Foundation