On Thu, Oct 23, 2008 at 4:06 PM, Andre Engels <andreengels(a)gmail.com> wrote:
On Thu, Oct 23, 2008 at 3:40 PM, Rula Sayaf
<rula.sayaf(a)gmail.com> wrote:
I would like to ask for a specific kind of
download if possible ...
Let me first start mentioning what exactly I'm working on. I am building
a
specific-structured semantic network of words in
biographies in
Wikipedia.
The network will save the relation between a
certain occupation and words
related to this occupation (according to what is mentioned in the
articles
in the corpus). Hence it would be so efficient to
have the biographies
with
tagging of the profession/occupation. From what I
noticed the biographies
are ordered according to the occupation or to the name. So is it possible
for me to have the biographies ordered according to occupation?
For this purpose you can use the categories. For example, if you want
to have biographies of pediatricians only, you go to
http://en.wikipedia.org/wiki/Category%3APediatricians, and download
all pages mentioned there, as well as all pages in the subcategories
mentioned there (and in the subcategories of those subcategories
etcetera).
I guess you would want to start here:
and then go deeper and deeper. I don't know, if there is a handy tool to
download all pages in a given category.
-
Bence Damokos
--
André Engels, andreengels(a)gmail.com
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l