<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">

<HTML><HEAD>

<META http-equiv=Content-Type content="text/html; charset=iso-8859-1">

<META content="MSHTML 6.00.2800.1400" name=GENERATOR>

<STYLE></STYLE>

</HEAD>

<BODY>

<DIV><FONT face=Arial size=2>1. The list</FONT></DIV>

<DIV>

<P class=MsoNormal style="MARGIN: 0cm 0cm 0pt"><FONT face=Arial 

size=2>Wikipedia, other encyclopedias, dictionaries, etc. as we know it, have 

the following in common: they are a list of usually single words (entry words, 

items, index words, etc.) associated with longer texts (articles) for which 

connection (whether explicit or implicit) the whole collection is sought. The 

way the number of items in the list is growing is either by inclusion of a new 

item that an author knows enough, or by the inclusion of a new word as a stub 

that a curious reader is not knowledgeable enough and wants to know more of. 

Each entry word first is found somewhere out there (in the context) and is then 

decontextualised, lemmatised, etc.) to be included in that list. If there are 

homonyms, the original context of each occurence is partially reconstructed and 

marked by disambiguation. </FONT></P>

<P class=MsoNormal style="MARGIN: 0cm 0cm 0pt"><FONT face=Arial size=2>Context, 

however, should be considered to introduce additional relevant points (sometimes 

the originator for example), and if you go on without exactly specifying the 

relations that a given context reflects, you should have a lot more contexts 

shown in a structured fasion than normally found in dictionaries and/or in 

encyclopedias (compare: categories, senses, semantic web, etc.).</FONT></P>

<P class=MsoNormal style="MARGIN: 0cm 0cm 0pt"><FONT face=Arial size=2>Given 

that wikipedia and its sister projects are word-centered devices you can hardly 

<SPAN style="mso-spacerun: yes">&nbsp;</SPAN>go beyond basic grammar terms and 

the alphabet used a) to bring an entry word to a common form (noun, single 

word), and b) to sort them according to a single („senseless”) criterion that 

keeps resulting in an alphabetic index only to rely on in searching. </FONT></P>

<P class=MsoNormal style="MARGIN: 0cm 0cm 0pt"><SPAN 

style="mso-spacerun: yes"><FONT face=Arial size=2>&nbsp; </FONT></SPAN></P>

<P class=MsoNormal style="MARGIN: 0cm 0cm 0pt"><FONT face=Arial size=2>2. The 

text/article</FONT></P>

<P class=MsoNormal style="MARGIN: 0cm 0cm 0pt"><?xml:namespace prefix = o ns = 

"urn:schemas-microsoft-com:office:office" /><o:p><FONT face=Arial 

size=2>&nbsp;</FONT></o:p></P>

<P class=MsoNormal style="MARGIN: 0cm 0cm 0pt"><FONT face=Arial size=2>Since on 

learning about the world and/or words all of us progress from known items toward 

unknown items by establishing various new connections and sorting the input 

individually, it may be desirable to include a ring of pointers that a) take you 

from scratch (trivia, dictionary entry) to the latest advances connected to an 

entry word, b) allow you to check for covering the complete process of acquiring 

knowledge in a given subject in succession. And context therefore 

should&nbsp;not be dispersed aroud the free-floating text. In other words, 

knowledge associated with an item should be graded and presented accordingly. 

Using a hypertext structure is fine, but not using <SPAN 

style="mso-spacerun: yes">&nbsp;</SPAN>other text processing tools such as 

concordance (KWIC/KWOC) programs to recreate and/or remodel textual information 

is not quite comprehensible with so many computers at hand.</FONT></P>

<P class=MsoNormal style="MARGIN: 0cm 0cm 0pt"><o:p><FONT face=Arial 

size=2>&nbsp;</FONT></o:p></P>

<P class=MsoNormal style="MARGIN: 0cm 0cm 0pt"><o:p><FONT face=Arial 

size=2>&nbsp;apogr</FONT></o:p></P>

<P class=MsoNormal style="MARGIN: 0cm 0cm 0pt"><o:p><FONT face=Arial 

size=2></FONT></o:p>&nbsp;</P></DIV>

<DIV><FONT face=Arial size=2>----- Original Message ----- </FONT>

<DIV><FONT face=Arial size=2>From: "Ray Saintonge" &lt;</FONT><A 

href="mailto:saintonge@telus.net"><FONT face=Arial 

size=2>saintonge@telus.net</FONT></A><FONT face=Arial size=2>&gt;</FONT></DIV>

<DIV><FONT face=Arial size=2>To: &lt;</FONT><A 

href="mailto:wikipedia-l@Wikimedia.org"><FONT face=Arial 

size=2>wikipedia-l@Wikimedia.org</FONT></A><FONT face=Arial 

size=2>&gt;</FONT></DIV>

<DIV><FONT face=Arial size=2>Sent: Saturday, June 19, 2004 11:45 PM</FONT></DIV>

<DIV><FONT face=Arial size=2>Subject: Re: [Wikipedia-l] Categories considered 

harmful</FONT></DIV></DIV>

<DIV><FONT face=Arial><BR><FONT size=2></FONT></FONT></DIV><FONT face=Arial 

size=2>&gt; Jakob wrote:<BR>&gt; <BR>&gt; &gt;Categories considered 

harmful<BR>&gt; &gt;<BR>&gt; I generally appreciate your comments, but I would 

never go so far as to <BR>&gt; consider this initiative harmful.<BR>&gt; 

<BR>&gt; &gt;Since Version 1.3 of MediaWiki we have the nice category function. 

In the <BR>&gt; &gt;german wikipedia there is a lot of confusion and struggle on 

how to use <BR>&gt; &gt;categories in the right way. As a student of library 

science I could tell <BR>&gt; &gt;several methods how to classify, index and 

sort things but none of them <BR>&gt; &gt;seems to be applicable easily with the 

current implementation of categories.<BR>&gt; &gt;<BR>&gt; The confusion and 

struggle that you are experiencing on the german <BR>&gt; Wikipedia is being 

faced by all the projects.&nbsp; Each is likely to find <BR>&gt; its own way of 

dealing with the issue, and the solutions are likely to <BR>&gt; show 

considerable variation.&nbsp; That's fine and very wiki.&nbsp; Some will 

<BR>&gt; undoubtedly be discarded at a later stage, as a part of a normal 

<BR>&gt; evolutionary process.&nbsp; We need to avoid being overly critical of 

those <BR>&gt; who experiment with other possibilities.<BR>&gt; <BR>&gt; I also 

believe that effective categorization depends on having a <BR>&gt; properly 

functioning internal search function.&nbsp; Hopefully, the day will <BR>&gt; 

come when our developpers will be able to get past the constant stresses 

<BR>&gt; on the system, and find something that does not depend on Google. 

:-)<BR>&gt; <BR>&gt; &gt;As far as I can tell there are three main reasons for 

Wikipedia's success:<BR>&gt; &gt;<BR>&gt; &gt;1. It's very easy to contribute 

(Wikitax, everybody can edit)<BR>&gt; &gt;2. Every edit is monitored in 

watchlists and list of lasts edits<BR>&gt; &gt;&nbsp;&nbsp; so we can control 

each other<BR>&gt; &gt;3. There is a clear common mission - to create an 

encyclopedia (+NPOV)<BR>&gt; &gt;<BR>&gt; Agreed.<BR>&gt; <BR>&gt; &gt;As far as 

I also can see the category-function contradicts all of them:<BR>&gt; 

&gt;<BR>&gt; To a significant extent, yes.<BR>&gt; <BR>&gt; &gt;1. It's not 

easy. <BR>&gt; &gt;<BR>&gt; &gt;It's not easy to know how to do it in the right 

way because subject <BR>&gt; &gt;indexing is a complex issue and it's not easy 

because of lacks in the <BR>&gt; &gt;implementation (no rename, no redirects, no 

assignment of articles to <BR>&gt; &gt;categories without editing every single 

the article pages). Editing an <BR>&gt; &gt;article I have to guess which 

categories are existing, how they are <BR>&gt; &gt;spelled and the rules what to 

classify into them and what not.<BR>&gt; &gt;<BR>&gt; Perhaps it's too 

easy.&nbsp; Anybody can propose any new category, including <BR>&gt; misspelled 

ones.&nbsp; Doing it without creating chaos is a different thing.&nbsp; <BR>&gt; 

It's especially difficult for people who specialize in a particular area 

<BR>&gt; of knowledge.&nbsp; To categorize effectively at its top level requires 

an <BR>&gt; ability to grasp the "big picture" of the Wikipedia.<BR>&gt; 

<BR>&gt; &gt;2. It's not controllable.<BR>&gt; &gt;<BR>&gt; &gt;You cannot watch 

a category to get noticed on new articles or when <BR>&gt; &gt;somebody removes 

an article from the category. <BR>&gt; &gt;<BR>&gt; Mostly yes.&nbsp; Building 

categories is a top-down activity.&nbsp; Categorizing <BR>&gt; articles is a 

bottom-up activity.&nbsp; The challenge lies in establishing an <BR>&gt; 

interface between these two activities.<BR>&gt; <BR>&gt; &gt;3. There is no 

common mission<BR>&gt; &gt;<BR>&gt; &gt;Can anybody tell the purpose of 

categories? Finding articles (without <BR>&gt; &gt;a coordinated search 

function?!) Browsing in topics (without a clear<BR>&gt; &gt;overview of all 

categories?!) Are we trying to index articles with <BR>&gt; &gt;subject heading, 

using a thesaurus, a classification or even a structure <BR>&gt; &gt;ontology? 

Library science has invented several kind of schemes like that <BR>&gt; &gt;but 

at the moment everybody is muddling this and that trying to invent <BR>&gt; 

&gt;the already invented wheels of documentation (by the way there are 

also<BR>&gt; &gt;methods of automatic indexing, clustering and 

classification).<BR>&gt; &gt;<BR>&gt; Whatever the mission of categories it is a 

subordinate mission motivated <BR>&gt; by a desire to make the information in 

the projects more accessible.&nbsp; <BR>&gt; Categories have no meaning in an 

information vacuum..&nbsp; I would answer <BR>&gt; your questions by saying, 

"All of the above, and more."<BR>&gt; <BR>&gt; Library science has indeed 

invented numerous schemes.&nbsp; Any such scheme <BR>&gt; designed for general 

application is as good as its competitors.&nbsp; Each <BR>&gt; developped 

independently to address the priorities of the originating <BR>&gt; 

library.&nbsp; Any of them may thus be validly criticized for its nationalist 

<BR>&gt; tendencies.&nbsp; Nevertheless, choosing one of them to serve as a 

starting <BR>&gt; point need not be a nationalist act.&nbsp; That choice is more 

likely to be <BR>&gt; driven by the availability of detailed data, and the 

willingness of some <BR>&gt; individual(s) to do the work of adapting that 

system to serve wiki purposes.<BR>&gt; <BR>&gt; The muddling and the 

re-invention of the wheel implicit in most people's <BR>&gt; approach to 

categorization was completely forseeable.&nbsp; I say this <BR>&gt; without 

finding fault.&nbsp; It was just one of those miseries that had to be <BR>&gt; 

gone through; system convergence comes later.&nbsp; Let's just keep away from 

<BR>&gt; automated system until we know what we want.&nbsp; A premature 

application of <BR>&gt; automation will only support the muddling.<BR>&gt; 

<BR>&gt; &gt;And: In classification there is no NPOV because there is no "right" 

way <BR>&gt; &gt;to classify the world but it depends on the special needs and 

questions <BR>&gt; &gt;I want to answer with a special system of subject 

indexing.<BR>&gt; &gt;<BR>&gt; I agree with what you seem to be saying but I 

would not put it in terms <BR>&gt; of NPOV.&nbsp; "Wiki is not paper," is a far 

more useful principle.&nbsp; A <BR>&gt; traditional librarian may want to 

classify a single copy of a book on <BR>&gt; German libraries, and must decide 

whether that book should be shelved <BR>&gt; with books about Germany or books 

about libraries.&nbsp; We do not have that <BR>&gt; restriction.<BR>&gt; 

<BR>&gt; &gt;Given the reasons I strongly recommend to stop using the categories 

and<BR>&gt; &gt;to focus on writing and improving good articles. Many categories 

can easily<BR>&gt; &gt;be replaced with normal links between articles. Adding 

and removing <BR>&gt; &gt;categories do not change an article's content a bit. 

If you want to <BR>&gt; &gt;keep track of all articles in some area use 

(Wiki)Projects, article <BR>&gt; &gt;series, portals and learn how to use the 

"what links here"-function! <BR>&gt; &gt;A good article is an article that can 

be found easily without categories.<BR>&gt; &gt;<BR>&gt; I don't arrive at the 

same conclusion about stopping the use of <BR>&gt; categories.&nbsp; The 

techniques that you mention are all good and effective, <BR>&gt; and they should 

obviously continue to be used.&nbsp; Categories are a way of <BR>&gt; providing 

a comprehensive overview.&nbsp; It is easy to see at an appropriate <BR>&gt; 

place on an article just how that article has been categorized, or <BR>&gt; 

indeed '''if''' it has been categorized.&nbsp; A list system has its uses, 

<BR>&gt; but needs to be manually maintained.&nbsp; It is not evident on the 

face of <BR>&gt; the article that it has been properly listed.&nbsp; A 

non-contributing reader <BR>&gt; may not be aware of the list's existence, or of 

the purpose of "What <BR>&gt; links here."&nbsp; Even a contributor is not going 

to be inclined to check <BR>&gt; every article to see if it has been properly 

listed, but without <BR>&gt; checking he has no way of knowing.&nbsp; This 

brings me back to my earlier <BR>&gt; point about categorization of articles 

being a bottom-up procedure.<BR>&gt; <BR>&gt; &gt;Indeed classifying wikipedia 

articles is very interesting and will <BR>&gt; &gt;become more important, but 

this should be an independent project - maybe <BR>&gt; &gt;in a "Classifipedia" 

or "Categorypedia" that links to wikipedia articles.<BR>&gt; &gt;<BR>&gt; 

No.&nbsp; the categories are meaningless in isolation.<BR>&gt; <BR>&gt; &gt;You 

know - librarians normally do not write the books they organize and <BR>&gt; 

&gt;search engine experts do not write the websites they crawl, so let's 

focus<BR>&gt; &gt;on what we can do the best: creating the most detailed, most 

understandable<BR>&gt; &gt;and freest encyclopedia in the history of 

mankind!<BR>&gt; &gt;&nbsp; <BR>&gt; &gt;<BR>&gt; Your premise here is the most 

important thing that you say.&nbsp; No <BR>&gt; professional librarian would 

tolerate an author who goes around <BR>&gt; insisting that his books be 

classified in a particular way.&nbsp; The <BR>&gt; authors, the editors and the 

classifiers all have their own roles on the <BR>&gt; wiki.&nbsp; All are working 

toward the goal that you specify, but not in <BR>&gt; complete 

isolation.<BR>&gt; <BR>&gt; Ec<BR>&gt; <BR>&gt; 

_______________________________________________<BR>&gt; Wikipedia-l mailing 

list<BR>&gt; </FONT><A href="mailto:Wikipedia-l@Wikimedia.org"><FONT face=Arial 

size=2>Wikipedia-l@Wikimedia.org</FONT></A><BR><FONT face=Arial size=2>&gt; 

</FONT><A href="http://mail.wikipedia.org/mailman/listinfo/wikipedia-l"><FONT 

face=Arial 

size=2>http://mail.wikipedia.org/mailman/listinfo/wikipedia-l</FONT></A></BODY></HTML>