2010/6/13 杨杰 <xtyangjie(a)gmail.com>om>:
Hi, everyone,
Hello Yang Jie,
I want to do some experiments on classification using web pages of
wikipedia. Now that I have got the web page archive, the experiment
needs the following category information:
1. what is the category (or categories) of a web page (an article)?
eg. once I can get the two tips, the information is enough.
a. Web page P1 belongs to category C1;
b. Category C1 is under two parent categories CC1 and CC2, while
the two categories own their parent category chains seperately.
Then I can build a tree, which leaves are the web pages.
2. how do guys in wikipedia deal with the category work upon the huge
amount of articles, for example, category method, level or inheritance
between categories.
Could you give me some adivces or URLs to find them ?
The best URL to know how Wikipedia users use the categories is [1]
I'm not sure I understand your questions well so, don't hesitate to
ask more precise questions once you have read [1]
[1]
http://en.wikipedia.org/wiki/Wikipedia:Categories
Xi’an Jiaotong University
Hey! I've been there!
once i didn't know software is not free, but found it days later; now
i realize that it's indeed free.
Yes, it's free!
Yours sincerely
--
Peter Potrowl
http://www.mediawiki.org/wiki/User:Peter17