Dear list,
do you provide more information about the pages-logging dump somewhere? While parsing it we came across some questions that we are trying to clarify:
* Why are there only ~40 million logs (at over 450 million revisions)? Which logs does the pages-logging dump / the "public" logging table (not) contain? (We double-checked the number of logs on the database, using our Toolserver-Account (Logging table).)
* Can you give us more information about the TextElement that is defined for <logitem> in the XML Schema? Definition: <element name="text" type="mw:TextType"/>
In pages-logging it sometimes occurs as: <text deleted="deleted" />
How is this being used?
Kind regards, Katja Mueller
On 06/01/12 10:42, Katja Mueller wrote:
Dear list,
do you provide more information about the pages-logging dump somewhere? While parsing it we came across some questions that we are trying to clarify:
- Why are there only ~40 million logs (at over 450 million revisions)?
Which logs does the pages-logging dump / the "public" logging table (not) contain? (We double-checked the number of logs on the database, using our Toolserver-Account (Logging table).)
The logs live at Special:Log Most edits don't produce revisions, although some actions produce both and a log (upload, protect, move...) and others only leave a log entry (such as a block or patrol)
- Can you give us more information about the TextElement that is defined
for<logitem> in the XML Schema? Definition:<element name="text" type="mw:TextType"/>
In pages-logging it sometimes occurs as:
<text deleted="deleted" />
How is this being used?
It's used to hide log entries when they contain personal info. Replaces <logtitle> and <params>. It's not used for anything in non-revdeleted entries.
- Why are there only ~40 million logs (at over 450 million revisions)?
Which logs does the pages-logging dump / the "public" logging table (not) contain? (We double-checked the number of logs on the database, using our Toolserver-Account (Logging table).)
The logs live at Special:Log Most edits don't produce revisions, although some actions produce both and a log (upload, protect, move...) and others only leave a log entry (such as a block or patrol)
Ok - is there an overview somewhere that is different from or does include more information about the log types than Special:Log?
- Can you give us more information about the TextElement that is defined
for<logitem> in the XML Schema? Definition:<element name="text" type="mw:TextType"/>
In pages-logging it sometimes occurs as:
<text deleted="deleted" />
How is this being used?
It's used to hide log entries when they contain personal info. Replaces<logtitle> and<params>. It's not used for anything in non-revdeleted entries.
Which kind of personal information would this be? Is there a flag somewhere (i.e., in user settings)?
Kind regards, Katja
wikitech-l@lists.wikimedia.org