Hello,
Thank you for your work on MediaWiki!
I'm interested in representing individual diffs by digest values.
Is there a generic or standard format for representing the difference
between two versions of a MediaWiki article?
Thanks!
J. Patrick Bedell
jpbedell(a)gmail.com
http://infoeng.sourceforge.net
With the start of the school season, the request rate and thus load have gone
way up in the last few weeks. A few more Apache boxen are being planned for, but
we also have some optimization to do in storage access patterns.
On Domas's urging I've added a cache for extracted revision text using
memcached. This can reduce both the load on the external storage servers as well
as the time spent decompressing and extracting individual items from a bulk
storage blob.
According to live profiling data on en.wikipedia and de.wikipedia, the time
spent in Revision::getRevisionText has dropped from the 40s to the 10s in
percentage, with a corresponding increase in memcached::get from about 8% to
about 18%. That's a drop of about 200ms per average profiled request in realtime
during the US-daytime-Europe-evening rush.
Hopefully it keeps up and doesn't cause any other problems; I've set it to cache
for an hour, but this could be reduced or increased as necessary.
The setting is $wgRevisionCacheExpiry and is available in 1.8 on SVN trunk,
disabled by default. To be useful, it requires the main $wgMemc cache to be set
up, and you should have a fair amount of external storage in use or else it
might just be slower to make all those memcached queries. :)
-- brion vibber (brion @ pobox.com)
An automated run of parserTests.php showed the following failures:
Running test TODO: Table security: embedded pipes (http://mail.wikipedia.org/pipermail/wikitech-l/2006-April/034637.html)... FAILED!
Running test TODO: Link containing double-single-quotes '' (bug 4598)... FAILED!
Running test TODO: Template with thumb image (with link in description)... FAILED!
Running test Template infinite loop... FAILED!
Running test TODO: message transform: <noinclude> in transcluded template (bug 4926)... FAILED!
Running test TODO: message transform: <onlyinclude> in transcluded template (bug 4926)... FAILED!
Running test BUG 1887, part 2: A <math> with a thumbnail- math enabled... FAILED!
Running test TODO: HTML bullet list, unclosed tags (bug 5497)... FAILED!
Running test TODO: HTML ordered list, unclosed tags (bug 5497)... FAILED!
Running test TODO: HTML nested bullet list, open tags (bug 5497)... FAILED!
Running test TODO: HTML nested ordered list, open tags (bug 5497)... FAILED!
Running test TODO: Parsing optional HTML elements (Bug 6171)... FAILED!
Running test TODO: Inline HTML vs wiki block nesting... FAILED!
Running test TODO: Mixing markup for italics and bold... FAILED!
Running test TODO: 5 quotes, code coverage +1 line... FAILED!
Running test TODO: HTML Hex character encoding.... FAILED!
Running test TODO: dt/dd/dl test... FAILED!
Passed 412 of 429 tests (96.04%) FAILED!
Hi.
If I set MediaWiki:cite_reference_link to just $3 (so <ref>...</ref>
will output the plain number), can I do
{{#expr: <ref/> + <ref/>}}
to get "3"?
(I don't want to change the global reference format just to check this,
that's why I don't look it myself. If somebody knows...)
Why I ask this is because in es.wiktionary we are planning to use <ref/>
for counters. We need counters, and we don't want to accumulate
references at the bottom of the page, so it's a good use for this feature.
Wiktionary definitions will be internally labeled and the numbers will
be generated automatically. So, we can rearrange them and the numbering
will be automatically updated as well as the references in the
translations box. Example (w/o the necessary templates for formatting,
etc.):
{{lemma|avis}}
<ref name="bird"/> Bird.
<ref name="company"/> Car rental company.
Translations: Spanish: <ref name="bird"/> Ave; ...
(I'm hoping one developer will jump saying "this is a kludge! here! have
real counters! :)".)
Sometimes we have more than one lemma per page (because it has a
different etymology, or because the word is used in more then one
language). In that case, we need to reset the <ref> counter. I thought
of doing that this way:
{{lemma|avis|1}}
{{def|bird|1}} Bird.
{{lemma|avis|2}}
{{def|company|2}} Car rental company.
Where {{lemma}} would output an invisible <ref name="last{{{2}}}"/> and
the number outputted by {{def}} would be {{#expr: <ref name="{{{1}}}"/>
- <ref name="last{{{2}}}"/>. That's the reason of the question in the
subject.
Whew. Does all this make any sense? Is there any better way to do it?
Thanks.
--
I've implemented the feature that's been in discussion at
[[wikisource:project_talk:Labeled_section_transclusion]], and attached the
patch to [[bugzilla:5881]], since I don't have commit access.
I just wanted to see if anyone has feedback on that, and what the next step
is to move that along.
Thanks
-Steve
An automated run of parserTests.php showed the following failures:
Running test TODO: Table security: embedded pipes (http://mail.wikipedia.org/pipermail/wikitech-l/2006-April/034637.html)... FAILED!
Running test TODO: Link containing double-single-quotes '' (bug 4598)... FAILED!
Running test TODO: Template with thumb image (with link in description)... FAILED!
Running test Template infinite loop... FAILED!
Running test TODO: message transform: <noinclude> in transcluded template (bug 4926)... FAILED!
Running test TODO: message transform: <onlyinclude> in transcluded template (bug 4926)... FAILED!
Running test BUG 1887, part 2: A <math> with a thumbnail- math enabled... FAILED!
Running test TODO: HTML bullet list, unclosed tags (bug 5497)... FAILED!
Running test TODO: HTML ordered list, unclosed tags (bug 5497)... FAILED!
Running test TODO: HTML nested bullet list, open tags (bug 5497)... FAILED!
Running test TODO: HTML nested ordered list, open tags (bug 5497)... FAILED!
Running test TODO: Parsing optional HTML elements (Bug 6171)... FAILED!
Running test TODO: Inline HTML vs wiki block nesting... FAILED!
Running test TODO: Mixing markup for italics and bold... FAILED!
Running test TODO: 5 quotes, code coverage +1 line... FAILED!
Running test TODO: HTML Hex character encoding.... FAILED!
Running test TODO: dt/dd/dl test... FAILED!
Passed 412 of 429 tests (96.04%) FAILED!
Hello all,
Is there an extension, with it user can add tags to the pages?
Thanks
Ting
--
"The bane of my existence is doing things
that I know the computer could do for me"
-- Dan Connolly
An automated run of parserTests.php showed the following failures:
Running test TODO: Table security: embedded pipes (http://mail.wikipedia.org/pipermail/wikitech-l/2006-April/034637.html)... FAILED!
Running test TODO: Link containing double-single-quotes '' (bug 4598)... FAILED!
Running test TODO: Template with thumb image (with link in description)... FAILED!
Running test Template infinite loop... FAILED!
Running test TODO: message transform: <noinclude> in transcluded template (bug 4926)... FAILED!
Running test TODO: message transform: <onlyinclude> in transcluded template (bug 4926)... FAILED!
Running test BUG 1887, part 2: A <math> with a thumbnail- math enabled... FAILED!
Running test TODO: HTML bullet list, unclosed tags (bug 5497)... FAILED!
Running test TODO: HTML ordered list, unclosed tags (bug 5497)... FAILED!
Running test TODO: HTML nested bullet list, open tags (bug 5497)... FAILED!
Running test TODO: HTML nested ordered list, open tags (bug 5497)... FAILED!
Running test TODO: Parsing optional HTML elements (Bug 6171)... FAILED!
Running test TODO: Inline HTML vs wiki block nesting... FAILED!
Running test TODO: Mixing markup for italics and bold... FAILED!
Running test TODO: 5 quotes, code coverage +1 line... FAILED!
Running test TODO: HTML Hex character encoding.... FAILED!
Running test TODO: dt/dd/dl test... FAILED!
Passed 412 of 429 tests (96.04%) FAILED!
An automated run of parserTests.php showed the following failures:
Running test TODO: Table security: embedded pipes (http://mail.wikipedia.org/pipermail/wikitech-l/2006-April/034637.html)... FAILED!
Running test TODO: Link containing double-single-quotes '' (bug 4598)... FAILED!
Running test TODO: Template with thumb image (with link in description)... FAILED!
Running test Template infinite loop... FAILED!
Running test TODO: message transform: <noinclude> in transcluded template (bug 4926)... FAILED!
Running test TODO: message transform: <onlyinclude> in transcluded template (bug 4926)... FAILED!
Running test BUG 1887, part 2: A <math> with a thumbnail- math enabled... FAILED!
Running test TODO: HTML bullet list, unclosed tags (bug 5497)... FAILED!
Running test TODO: HTML ordered list, unclosed tags (bug 5497)... FAILED!
Running test TODO: HTML nested bullet list, open tags (bug 5497)... FAILED!
Running test TODO: HTML nested ordered list, open tags (bug 5497)... FAILED!
Running test TODO: Parsing optional HTML elements (Bug 6171)... FAILED!
Running test TODO: Inline HTML vs wiki block nesting... FAILED!
Running test TODO: Mixing markup for italics and bold... FAILED!
Running test TODO: 5 quotes, code coverage +1 line... FAILED!
Running test TODO: HTML Hex character encoding.... FAILED!
Running test TODO: dt/dd/dl test... FAILED!
Passed 412 of 429 tests (96.04%) FAILED!
Okay, so we (or those few of us that spoke up about it) generally agree that
category intersections will be popular. I also have the impression that SQL
queries against the existing table will likely not be fast enough (Domas,
Brion, anyone else care to comment more explicitly?) - based on a few tests
on my wiki (only about 25,000 rows in my categorylinks table) the sql query
alone to get the intersection of 2 categories takes about 2/10 of a second,
so certainly for more categories and more rows it will take longer (anyone
have any suggestions on optimising sql queries for use on the existing
table? I tried a couple of methods, and they took the same amount of time).
So, *if* throwing sql at the existing table is too slow (and it looks like
it probably is), then the next alternative is to create records for the
intersections themselves, so the retrieval query is a simple one. This
would make the write time longer, but save a lot on the many reads. So,
here's a table of how many combinations that would have to be written for up
to 10 categories. Anybody have a histogram of how many categories are on a
given page? I'd also guess pages would tend to get *more* categories once
the intersection tool is available.
categories distinct
combinations 1 1 2 3 3 7 4 15 5 31 6 63 7 127 8 255 9 511 10
1023
So you can see that the number of combinations would get to be quite a lot.
I would propose writing them into the categorylinks table, and making the
existing category pages smart enough to deal with them, rather than writing
them into a new table. I think sorting the categories on a given page, then
concatenating the each possible with some character (perhaps a tab or
newline) between them, and then posting them into the database would work.
Only downside I can think of, is I'd guess it would take a *long* time to
post hundreds of records. I guess I'll try it and see. Limiting the number
of categories in an intersection to 3 or 4 would reduce the number of
combinations very significantly.
Any ideas? Feedback?
Best Regards,
Aerik