Dear Sir/Madame
I would like to ask for a specific kind of download if possible ... Let me first start mentioning what exactly I'm working on. I am building a specific-structured semantic network of words in biographies in Wikipedia. The network will save the relation between a certain occupation and words related to this occupation (according to what is mentioned in the articles in the corpus). Hence it would be so efficient to have the biographies with tagging of the profession/occupation. From what I noticed the biographies are ordered according to the occupation or to the name. So is it possible for me to have the biographies ordered according to occupation?
Last but not least, I am working on this project for my thesis in master of Artificial Intelligence in Katholieke Universiteit Leuven - Belgium.
Any help will be highly appreciated Thanks and Best Regards Rula
On Thu, Oct 23, 2008 at 3:40 PM, Rula Sayaf rula.sayaf@gmail.com wrote:
I would like to ask for a specific kind of download if possible ... Let me first start mentioning what exactly I'm working on. I am building a specific-structured semantic network of words in biographies in Wikipedia. The network will save the relation between a certain occupation and words related to this occupation (according to what is mentioned in the articles in the corpus). Hence it would be so efficient to have the biographies with tagging of the profession/occupation. From what I noticed the biographies are ordered according to the occupation or to the name. So is it possible for me to have the biographies ordered according to occupation?
For this purpose you can use the categories. For example, if you want to have biographies of pediatricians only, you go to http://en.wikipedia.org/wiki/Category%3APediatricians, and download all pages mentioned there, as well as all pages in the subcategories mentioned there (and in the subcategories of those subcategories etcetera).
On Thu, Oct 23, 2008 at 4:06 PM, Andre Engels andreengels@gmail.com wrote:
On Thu, Oct 23, 2008 at 3:40 PM, Rula Sayaf rula.sayaf@gmail.com wrote:
I would like to ask for a specific kind of download if possible ... Let me first start mentioning what exactly I'm working on. I am building
a
specific-structured semantic network of words in biographies in
Wikipedia.
The network will save the relation between a certain occupation and words related to this occupation (according to what is mentioned in the
articles
in the corpus). Hence it would be so efficient to have the biographies
with
tagging of the profession/occupation. From what I noticed the biographies are ordered according to the occupation or to the name. So is it possible for me to have the biographies ordered according to occupation?
For this purpose you can use the categories. For example, if you want to have biographies of pediatricians only, you go to http://en.wikipedia.org/wiki/Category%3APediatricians, and download all pages mentioned there, as well as all pages in the subcategories mentioned there (and in the subcategories of those subcategories etcetera).
I guess you would want to start here:
http://en.wikipedia.org/wiki/Category:People_by_occupation and then go deeper and deeper. I don't know, if there is a handy tool to download all pages in a given category.
- Bence Damokos
-- André Engels, andreengels@gmail.com _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
I guess you would want to start here:
http://en.wikipedia.org/wiki/Category:People_by_occupation and then go deeper and deeper. I don't know, if there is a handy tool to download all pages in a given category.
Special:Export can process pages by category, but does not automatically descend into subcategories.
-- daniel
wikitech-l@lists.wikimedia.org