Hello M.Srilasya, The XML data dumps of all the Wikipedias are free to download and use as per the licensing discussed here <https://dumps.wikimedia.org/legal.html>.
So you can just download anything you'd like from the website here: https://dumps.wikimedia.org/backup-index.html. If you let me know a specific language you're interested in, I can point you to the exact download link. But since you asked for a smaller download, let me offer simplewiki, which is a smaller English wiki that uses "Simplified English'', yet it is big enough to be interesting to do proof of concepts with: All pages with complete page edit history (.bz2) - simplewiki-20240201-pages-meta-history.xml.bz2 <https://dumps.wikimedia.org/simplewiki/20240201/simplewiki-20240201-pages-meta-history.xml.bz2> 2.9 GB - All pages, current versions only. - simplewiki-20240201-pages-meta-current.xml.bz2 <https://dumps.wikimedia.org/simplewiki/20240201/simplewiki-20240201-pages-meta-current.xml.bz2> 356.7 MB On Thu, Feb 22, 2024 at 1:10 AM 21131A0564 MANCHUKONDA SRILASYA < 21131a0...@gvpce.ac.in> wrote: > Dear xmldatadumps owner, > I'm a student working on a search engine project for which i > need the xml data dumps. i do not have excess storage capabilities. so, I > just need a small xml data dump. so that I can use it for my project. > I will make sure that I will not misuse the data provided by > you. please consider my request. > > Yours obediently, > M.Srilasya > -- Xabriel J. Collazo Mojica (he/him, pronunciation <https://commons.wikimedia.org/wiki/File:Xabriel_Collazo_Mojica_-_pronunciation.ogg> ) Sr Software Engineer Wikimedia Foundation
_______________________________________________ Xmldatadumps-l mailing list -- xmldatadumps-l@lists.wikimedia.org To unsubscribe send an email to xmldatadumps-l-le...@lists.wikimedia.org