I'm adding wikipedia stuff to the LDP site. I will definitely pass the result past you before any publication, to make sure you're happy with how I've implemented it - since the official "policy" has never been finally established.
I hope you know by now I will do everything I reasonably can to make sure what I do benefits the Wikipedia and doesn't "fork" it in any meaningful way. I'll be driving traffic to you, for one thing, and won't do any actual "editing" on my site. Clicking the "Edit this page" link will do just what it says -- but will take you to the Wikipedia to do your edits. I think the LDP probably gets a lot more traffic right now than the Wikipedia does (6 million page views a month), so that should be a good thing for you.
I do have a problem though, that you can help me with if you will. It seems the tarball is not being updated regularly. I want to download and process it regularly, probably weekly. An rsync feed would be really ideal, because it would minimize the bandwidth requirements. It's not a problem on my end, but it could be on yours.
But if an rsync feed is not in the cards, at least update the tarball. Can you get the tarballs updated on a more regular basis? It should just take a few minutes to write a script and put it in /etc/cron.daily.
Thanks,
David, that's great news!
Actually, Wikipedia did 14,780,010 pageviews in December, and is on track to break 17 million this month. :-)
I do have a problem though, that you can help me with if you will. It seems the tarball is not being updated regularly.
In my opinion, the tarball is updated nightly, and has been for a few weeks. I just looked, and it looks good! Have you checked it lately?
It's not a problem on my end, but it could be on yours.
Oh, I love rsync. It's one of my favorite inventions, it's just so clever. :-)
What do I need to do to set that up for you?
--Jimbo
Jimmy Wales wrote:
Actually, Wikipedia did 14,780,010 pageviews in December, and is on track to break 17 million this month. :-)
I obviously had a mental breakdown when I wrote this 10 minutes ago.
That's 1,478,010 pageviews in December, and on track to break 1.7 million this month. I looked at the numbers, and my eyes and enthusiasm tricked me.
So, if LDP has 6 million pageviews a month, then, yes you DO have a lot more traffic than us! :-)
--Jimbo
On ven, 2002-01-25 at 10:24, Jimmy Wales wrote:
I do have a problem though, that you can help me with if you will. It seems the tarball is not being updated regularly.
In my opinion, the tarball is updated nightly, and has been for a few weeks. I just looked, and it looks good! Have you checked it lately?
http://www.wikipedia.com/tarballs/ Error 404 File Not Found
Hmm...
http://www.wikipedia.com/ Warning: MySQL Connection Failed: Too many connections in ./databaseFunctions.php on line 6
Warning: Supplied argument is not a valid MySQL-Link resource in ./wikiPage.php on line 46 ......
:((
-- brion vibber (brion @ pobox.com)
Brion Vibber wrote:
http://www.wikipedia.com/tarballs/ Error 404 File Not Found
Darn, this is due to the upgrade. I'll fix it now.
http://www.wikipedia.com/ Warning: MySQL Connection Failed: Too many connections in ./databaseFunctions.php on line 6
Warning: Supplied argument is not a valid MySQL-Link resource in ./wikiPage.php on line 46 ......
I'm working on changing the config file to allow more connections.
--Jimbo
On ven, 2002-01-25 at 14:20, Jimmy Wales wrote:
Brion Vibber wrote:
http://www.wikipedia.com/tarballs/ Error 404 File Not Found
Darn, this is due to the upgrade. I'll fix it now.
Forbidden You don't have permission to access /tarballs/ on this server.
Well, we're on the right track at least. :)
http://www.wikipedia.com/ Warning: MySQL Connection Failed: Too many connections in ./databaseFunctions.php on line 6
Warning: Supplied argument is not a valid MySQL-Link resource in ./wikiPage.php on line 46 ......
I'm working on changing the config file to allow more connections.
Seems better now, thanks.
Testing.... Uh-oh. Looks like the entities bug is still there. :( This is a serious bug for pages containing HTML entities for non-Latin1 characters (for instance the Japanese names in http://wikipedia.com/wiki/Miyazaki_Hayao ). The &s are not escaped into &s when the text is inserted into edit box, and the browser thus interprets character entities as the actual characters. When the text is submitted for save or preview, everything after the first non-Latin1 character gets cut off from the page (ie, MASSIVE DATA LOSS).
I thought this had been fixed... leastwise, it works perfectly in the copy of the source I grabbed earlier this month to work on. (basicFunctions.php from 9 Janurary attached for comparison, see lines 67 and 89)
-- brion vibber (brion @ pobox.com)
On Fri, Jan 25, 2002 at 10:24:49AM -0800, Jimmy Wales wrote:
David, that's great news!
Actually, Wikipedia did 14,780,010 pageviews in December, and is on track to break 17 million this month. :-)
I do have a problem though, that you can help me with if you will. It seems the tarball is not being updated regularly.
In my opinion, the tarball is updated nightly, and has been for a few weeks. I just looked, and it looks good! Have you checked it lately?
Checked it last night. I thought it was outdated because there is no Andrew_S._Tanenbaum.kp file. Perhaps I misunderstand the file structure? Which subdir is the official, latest-and-greatest? Feel free to tell me to rtfm if you also give me the location of the fm.
:-)
I think /pages/ is the latest and greatest. The 'keep' files are, uh, I'm not sure what exactly.
I guess everything is changing pronto, though! I'll make sure your tarball is downloadable, but since we've changed software the old tarball will be increasingly irrelevant.
We'll have to put together a new tarball of the mysql database.
David Merrill wrote:
On Fri, Jan 25, 2002 at 10:24:49AM -0800, Jimmy Wales wrote:
David, that's great news!
Actually, Wikipedia did 14,780,010 pageviews in December, and is on track to break 17 million this month. :-)
I do have a problem though, that you can help me with if you will. It seems the tarball is not being updated regularly.
In my opinion, the tarball is updated nightly, and has been for a few weeks. I just looked, and it looks good! Have you checked it lately?
Checked it last night. I thought it was outdated because there is no Andrew_S._Tanenbaum.kp file. Perhaps I misunderstand the file structure? Which subdir is the official, latest-and-greatest? Feel free to tell me to rtfm if you also give me the location of the fm.
:-)
-- David C. Merrill http://www.lupercalia.net Linux Documentation Project david@lupercalia.net Collection Editor & Coordinator http://www.linuxdoc.org
Asked how small software companies could compete on products that Microsoft wants to fold into Windows, [Microsoft COO Bob] Herbold told Bloomberg News they could either fight a losing battle, sell out to Microsoft or a larger company or 'not go into business to begin with.' --Newsweek, March 1998 [Wikipedia-l] To manage your subscription to this list, please go here: http://www.nupedia.com/mailman/listinfo/wikipedia-l
On Fri, Jan 25, 2002 at 04:22:38PM -0800, Jimmy Wales wrote:
I think /pages/ is the latest and greatest. The 'keep' files are, uh, I'm not sure what exactly.
I guess everything is changing pronto, though! I'll make sure your tarball is downloadable, but since we've changed software the old tarball will be increasingly irrelevant.
We'll have to put together a new tarball of the mysql database.
I assume the format will not be changing? If so, I'll wait until it's settled down.
I will also look at the /pages/ directory. I don't remember why I didn't think that was right. IIRC, it was because it contained lots of:
foo
<bar
type of stuff, which made me assume it was "changes". But I'll look at it again. Is there a webpage which describes the file formats and stuff, or do I need to go through the code?
Thanks a lot, Jimbo. I appreciate the help.
On ven, 2002-01-25 at 16:22, Jimmy Wales wrote:
I think /pages/ is the latest and greatest. The 'keep' files are, uh, I'm not sure what exactly.
I guess everything is changing pronto, though! I'll make sure your tarball is downloadable, but since we've changed software the old tarball will be increasingly irrelevant.
We'll have to put together a new tarball of the mysql database.
The tarballs are still broken. http://www.wikpedia.com/tarballs/ gives a 403 Forbidden error; if you know the filenames you can still get a particular tarball, but they don't seem to be getting updated. The Esperanto wikipedia has added a lot of content since January 9; it would be nice to be able to get a current tarball for backup and testing the conversion to the new software.
-- brion vibber (brion @ pobox.com)
On Thu, 24 Jan 2002, David Merrill wrote:
I'm adding wikipedia stuff to the LDP site. I will definitely pass the result past you before any publication, to make sure you're happy with how I've implemented it - since the official "policy" has never been finally established.
I hope you know by now I will do everything I reasonably can to make sure what I do benefits the Wikipedia and doesn't "fork" it in any meaningful way. I'll be driving traffic to you, for one thing, and won't do any actual "editing" on my site. Clicking the "Edit this page" link will do just what it says -- but will take you to the Wikipedia to do your edits. I think the LDP probably gets a lot more traffic right now than the Wikipedia does (6 million page views a month), so that should be a good thing for you.
Yes, and yes. Sounds great!
I do have a problem though, that you can help me with if you will. It seems the tarball is not being updated regularly.
...and, we need to check on this *again*, now that the new software is uploaded. I thought the tarball was being updated either daily, or weekly. It turns out that http://www.wikipedia.com/tarballs is currently inaccessible (probably due to the software change, eh, or the server switchover?). Anyway, there should be a notice on wikipedia.com/tarballs stating (accurately!) how often it's updated. This is very important for our credibility as an open content/free project.
I want to download and process it regularly, probably weekly. An rsync feed would be really ideal, because it would minimize the bandwidth requirements. It's not a problem on my end, but it could be on yours.
But if an rsync feed is not in the cards, at least update the tarball. Can you get the tarballs updated on a more regular basis? It should just take a few minutes to write a script and put it in /etc/cron.daily.
We *were* doing it regularly. :-(
Larry
wikipedia-l@lists.wikimedia.org