Re: [Wikitech-l] Re: Chat about Wikipedia performance?

1 May 2003


      ...
(David A. Wheeler david_a_wheeler@yahoo.com):
...
...

Perhaps for simple reads of the current article (cur), you

could completely skip using MySQL and use the filesystem instead.
In other words, caching.
Sorry, I wasn't clear.
I wasn't thinking of caching - I was thinking of accessing the
filesystem INSTEAD of MySQL when getting the current wikitext.
No, you were clear. I am using "caching" in the plain English
sense of the word. Using the file system as a cache in front of
the database is just one possible implementation of the idea.
...
...
It isn't. And there's no reason to expect flex to be any
faster than any other language.
...
Actually, for some lexing applications flex can be MUCH faster.
That's because it can pre-compile a large set of patterns
into C, and compile the result.  Its "-C" option can, for
some applications, result in blazingly fast operations.
I suppose that's true. I do want to formalize the wikitext
grammar at some point, and using something like Lex/Yacc
code compiled and linked into PHP as a module is certainly
a possibility.
...
Oh, one note - if you want to simply store whether or not a
given article entry exists or not, and quickly check it, one
fancy way of doing this is by using a Bloom filter.
You can hash the article title, and then using a fancy data
structure can store its existance or non-existance.
Yes, that's a very good idea. I just recompiled the PHP on
the server to have the shared memory extensions, so putting
a Bloom filter into that memory is probably better than a
more typical hash table.
-- 
Lee Daniel Crocker lee@piclab.com http://www.piclab.com/lee/
"All inventions or works of authorship original to me, herein and past,
are placed irrevocably in the public domain, and may be used or modified
for any purpose, without permission, attribution, or notification."--LDC

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] Re: Chat about Wikipedia performance?