-----Original Message-----
From: Venu Durgam [mailto:vdurgam@yahoo.com]
Sent: Friday, October 10, 2003 5:06 PM
To: wikien-l(a)Wikipedia.org
Subject: [WikiEN-l] Java code to convert Wiki text to HTML ?
Hello All,
Is there an open source Java code to convert Wiki text to HTML ?
Thanks.
_______________________________________________
WikiEN-l mailing list
WikiEN-l(a)Wikipedia.org
http://mail.wikipedia.org/mailman/listinfo/wikien-l
>> do we go about setting up a clean round robin DNS that's fairly tolerant of one of the servers dropping out for "unplanned maintenance"? <<
You don't, for the reasons already outlined. Consider this alternative:
o router (cheap home type or something better, anything with remote console) set to primary IP and forward traffic to an internal IP.
o IP addressable remote power switch for the router so you can reset it if necessary. And the servers if it's not already there, costs $500 or so for 4 remote switches, half that for a single. Maybe less on ebay.
o each server running Squid as a reverse proxy server set up to do load balancing/failover between the boxes on their internal IP addresses.
Point the router to the preferred Squid IP/port. If that server goes down, tell the router to point the unchanging external IP to the machine which is up. If the other server goes down, Squid will switch the traffic the machine it is running on automatically. If the router goes down, use the remote power switch.
Squid should do well on the database server most of the time. It'll offload some of the CPU load from the main web server and use it on the disk-bottlenecked database server with the less used CPU.
To add speed these are some options:
o more web servers, same database, helps spread page building load with Squid doing the load balancing.
o squid on a different machine, probably more reliable than shared service, so site reliability should go up. Also helps with caching, since Squid does that and could offload that chore from the page builders.
o if connection levels really rise, the router can be switched to a load balancer pointing to a set of Squid machines offloading connections from a page builder farm.
I'm assuming that memcached keeps database load under control.
I've tweaked up the unfinished importUseModWiki.php to make what seems
to be a workable minimalist version: it doesn't do any conversion of
subpage links, import of images, or character references that could be
turned to straight unicode; it doesn't move talk pages into talk
namespace; it doesn't fix links broken by case insensitivity; it
doesn't (yet) try to import user accounts; it doesn't try to create
interlanguage links from existing ones pointing in.
It just dumps pages as-is into cur and old, and assumes that humans
will clean it up. This should be enough for many of the smaller wikis
to get going on MediaWiki in the near future.
The new version is in maintenance directory of both test and stable
branches in cvs.
Caveat: after the pages are imported to a newly created wiki, all the
support tables will need to be rebuilt. I'm having some trouble getting
the link rebuild script to work, I'll debug that a bit.
-- brion vibber (brion @ pobox.com)
Yay-- as of 21:00 UTC Wikipedia is fairly flyin (I say fairly because a few
page loads still need to think about it for a moment before taking the
plunge). Any chance we can get a bullet list of server additions/changes?
I'm especially interested in whether you have any round robin happening. I'm
hoping to free up time to help out with some programming and maybe some
server/DB admin, if you need it-- just want a feel for the new lay of the
land.
--
JDG
From: "Poor, Edmund W" <Edmund.W.Poor(a)abc.com>
Subject: RE: [Wikitech-l] Re: Spam and enlarging our
capacity
>Wikien-l is configured to "pass" only messages from
>subscribers. All
>other messages are put on "hold" until an
>administrator deals with
>them.
We do the same
>I don't know who was dealing with this before my
>vacations this year,
>but recently I've been looking once a week and
>deleting anything that
>didn't look look a legitimate message.
Once a week...
>I **!! NEVER !!** communicate with spammers. Telling
>them not to send more mail only alerts them to the
>fact that they've found a real live human being,
>someone who might read their, um, merde. Until the US
>Congress passes an anti-spam law, it's best just to
>delete unwanted e-mail.
Please DO PASS that law ! :-)
But if you email automatically reject their mail, do
they think there is a human being .
>If you need help configuring Wiki-fr you could e-mail
>the password privately and I could take a look.
Thanks, I made some changes upon Jimbo
recommandations. I'll see. I fear I totally forgot the
fr password btw. I must have written it somewhere :-(
Ed Poor
From: Brion Vibber <brion(a)pobox.com>
>I installed SpamAssassin on the server (from RPMs)
but >never quite got to figuring out how to get
things >filtered through it.
>If anyone's got experience with it, please advise.
Yes, if anyone with experience....please advise Brion
From: Jimmy Wales <jwales(a)bomis.com>
>I recommend to go to 'privacy options' and then
>'sender filters'.
>Scroll to the bottom. For the setting for what
>happens when people
>who are not subscribed send email to the list, click
>on either
>'reject' or 'discard'. And then set the next setting
>so that when
>that automatic action happens, you are not notified.
>The decision between 'reject' and 'discard' is based
>on this: If you
>discard, it is kinder to the world at large in the
>sense that you
>don't bother responding with a rejected message to
the >99% of what you
>get that is totally useless spam from invalid
>messages. If you
>'reject', it is kinder to the unfortunate list member
>who accidentally
>doesn't realize that they posted from the wrong
>address.
>A discard does not send out a notification to the
>sender at all, it
>just discards their message and that's the end of
>that.
>So I slightly favor 'reject'.
Oh, right. Thank you. I looked at these pages. I made
a change "just" not to receive notice on my mail for
now. When I am bored and the ml rather sleeping, I
will switch to "reject". "Discard" is imho impolite.
__________________________________
Do you Yahoo!?
The New Yahoo! Shopping - with improved product search
http://shopping.yahoo.com
From: "Poor, Edmund W" <Edmund.W.Poor(a)abc.com>
Subject: RE: [Wikitech-l] spam
>Anthere,
>I am not getting any of the mailing list spam you are
>talking about --
>at least, not in noticeable amounts.
That is *totally* unfair.
One or two weeks ago, I had a sort of rebouncing mail
which filled up my mailbox with more than 100 mails in
a couple of hours. When I went to the admin page, most
had to be treated individually to remove them from the
list. Thank God Brion helped me there to clean and to
block the mail.
Today, I just caught one soon after it started.
Fortunately, all had the same sending adress so it was
quick to clean.
But how happen you admin a list and have no spam ???
>I use Microsoft Outlook, and I'm behind a corporate
>firewall, so my
>experience may be different from yours. But I have
>configured my mail
>client (MS Outlook) to move e-mail containg words
>like:
>[Wikitech-l]
>Into a folder market Wikipedia Developers.
Is it possible then in the admin of the list, to
prevent the administrator to receive the individual
mails, just the reminders perhaps ?
Mind you, when I find more than 100 mails in my mail
box in the evening, I do not always dig for
unauthorized messages sent by careless wikipedia
editors under an unregistered email adress.
>My anti-spam strategy is a poor man's version of
>filtering.
>I leave everything in "Inbox", except e-mail from
>known sources like:
>* Business colleagues (address contains the same
>domain as mine)
>* Friends and relatives (added to a list, one by one)
>* Wikipedia (subject line contains "Wiki")
(I hope I am in the list :-))
Aren't you the listadmin for wikitech ? Do you
sometimes have messages to authorize ?
>I've heard of some sophisticated techniques >using
"Bayesian statistics"
>but I haven't tried them yet.
>Is there anything I can do, as mailing list
>administrator, to help get
>rid of spam?
>Ed Poor
>Wiki-tech admin
>Wikien admin
Explain to me why wikien-l and wikitech-l have no spam
perhaps :-)
And explain to the spamers I alone receive these
mails, ain't enlarge my penis, and my lips are ok, so
they can just drop the matter ?
Dunno :-(
Wikifr admin
------------------------------
From: Andre Engels <engels(a)uni-koblenz.de>
Subject: Re: [Wikitech-l] Re: Wikitech-l spam
On Thu, 9 Oct 2003, Anthere wrote:
> Is it possible to add some words to an anti-spam
list,
> and to have any message containing "sex", "enlarge",
> "lips", "money", "daemon", "free business
> proposition", "free flower" (if only, sigh) and such
> just be automatically sent back ? ,(or just plainly
dumped)
>And how are we going to discuss getting MONEY to
>ENLARGE the capacity of our hardware then?
>Andre Engels
Err...french people do not discuss sex and money on
the mailing list Andr�. They discuss "sexe" (not even)
and "argent". Let's not be desperate upon enlarging
the hardware :-)
Soooo...perhaps a mail subject with a bunch of
classical english words might be enough to get rid of
the worse.
__________________________________
Do you Yahoo!?
The New Yahoo! Shopping - with improved product search
http://shopping.yahoo.com
There was general agreement on wikipedia-l that the
current donation text on each Wikipedia page should be
changed to:
Find out how you can help [support] Wikipedia's
phenomenal growth.
with [support] being a link to
http;//wikimediafoundation.org/fundraising
Please 'make it so!' somebody. This message does not
apply to other projects for obvious reasons (another
message will have to be developed for them).
-- mav
__________________________________
Do you Yahoo!?
The New Yahoo! Shopping - with improved product search
http://shopping.yahoo.com