(Take with some grains of salt: never used FreeBSD or Solaris shells myself and am not a ts-admin. Also, this is a cell phone and I shouldn't even be awake now!)
Also, what do you lose with FreeBSD?
so far I can think of: * zfs dedupe * there's no longer a single "goto" company to get support from (guessing...) so you lose that annual cost but it may be non trivial to find the right hacker to find/fix a problem in an emergency. of course much of that support is likely to come for free (a guess) * you're still a different platform from the rest of wikimedia (basically the foundation) and so lose economy of scale * you mentioned SGE would still be available but it may be unmaintained (last I checked its enwp article, it said oracle was close sourcing it) just to keep in mind, I wouldn't stay just for SGE
things you lose with linux vs. FreeBSD: * zfs is gone and (assuming lvm2 + traditional RAID vs. zfs) generally thin storage provisioning and snapshotting is much more limited and wasteful and fragile/more room for error. also most storage expansions will end up w/ transition periods that have *no* redundancy. * zones/jails
I haven't kept up with the status in the last ~8 months but debian-kfreebsd may have matured enough to warrant a look. (See #debian-kbsd on oftc)
-Jeremy On Jun 7, 2011 2:58 AM, "Daniel Kinzler" daniel@brightbyte.de wrote:
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Jeremy Baron:
Also, what do you lose with FreeBSD?
so far I can think of:
- zfs dedupe
ZFS version 28, including dedup, was integrated into FreeBSD head (9-CURRENT) in February. I imagine it will make it back into 8.x eventually, but although dedup is nice on paper, I'm not sure it's actually very useful. You need so much memory to cache the DDT (since putting it on disk kills performance) that it's usually cheaper to just buy more storage.
It's also unlikely that we would ever migrate the /home NFS server to FreeBSD (at least in the near-term future), since it lacks the HA clustering and volume management that Solaris has. (This is an example of an area where Linux *does* have an advantage over FreeBSD.)
It's *also* unlikely that we would ever put /home on ZFS, whatever OS we used. I don't mind ZFS for root, and I use it at home (on both FreeBSD and Solaris) without problems, but we've had some unfortunate issues with it at the TS that have made me a bit wary.
- there's no longer a single "goto" company to get support from
(guessing...) so you lose that annual cost but it may be non trivial to find the right hacker to find/fix a problem in an emergency. of course much of that support is likely to come for free (a guess)
This is true, and is something that came up the last time we discussed this internally. I'm not too worried about this; the login server environment doesn't really tax the OS, and I can't think of many (any?) problems we've had there that would require vendor support.
There are places like iXsystems http://www.ixsystems.com/bsdsupport which offer third-party support for FreeBSD. I don't have any experience with that, or any idea how it compares price-wise.
- you're still a different platform from the rest of wikimedia (basically
the foundation) and so lose economy of scale
Not sure what economies we could gain here. The Toolserver is completely separate from Wikimedia's infrastructure (except that we use the same colo), and both of us prefer it that way. IOW, even if we used Linux, TS systems could not become just another Wikimedia server.
- you mentioned SGE would still be available but it may be unmaintained
(last I checked its enwp article, it said oracle was close sourcing it) just to keep in mind, I wouldn't stay just for SGE
SGE is dead on every platform, including Solaris. The replacement is either Oracle Grid Engine (which costs money), or one of the various forks of the open-source SGE (which should work on both FreeBSD and Solaris). We're still using SGE because I want to see which (if any) fork gains acceptable before switching.
things you lose with linux vs. FreeBSD:
- zfs is gone and (assuming lvm2 + traditional RAID vs. zfs) generally thin
storage provisioning and snapshotting is much more limited and wasteful and fragile/more room for error. also most storage expansions will end up w/ transition periods that have *no* redundancy.
ZFS snapshots are nice, but OTOH ZFS can't remove storage from a pool or do any kind of online relayout, both of which are standard features in VxVM (and I assume in Linux LVM as well, although I have no experience of it.)
In any case we don't store data on login servers.
- zones/jails
Linux has lxc and OpenVZ, which are both very similar to zones, as well as Xen or KVM for full virtualisation.
I haven't kept up with the status in the last ~8 months but debian-kfreebsd may have matured enough to warrant a look. (See #debian-kbsd on oftc)
Never really saw the point of that. If you want Linux, use Linux. (I guess at the moment it's the only way for Linux users to get a useful ZFS?)
- river.
On Tue, Jun 7, 2011 at 5:22 AM, River Tarnell river.tarnell@wikimedia.de wrote:
ZFS snapshots are nice, but OTOH ZFS can't remove storage from a pool or do any kind of online relayout, both of which are standard features in VxVM (and I assume in Linux LVM as well, although I have no experience of it.)
Linux LVM can do anything you want online, in my experience: grow and shrink logical volumes, move logical volumes between physical volumes within the same volume group, and add or remove physical volumes, for instance. I've found that moving physical volumes is much slower than it should be, though, and I've had bad experiences once or twice moving a PV that's being actively used (system hang, but no data loss and the operation continued automatically after power-cycling and completed successfully).
md isn't quite so good about what it can do online, but it's getting better recently. It can add or remove devices from RAID1/5/6 online, and even change the RAID level online sometimes. If it doesn't do what you want, and you have enough storage attached, you can always create the new md device and move the old one's contents to it online using LVM, possibly leaving one or both RAID arrays degraded while you do so to get the extra space necessary.
ext4 can grow online if you grow the device it's sitting on, but it can't shrink online, and (AFAIK) nor can any other high-profile Linux filesystem except btrfs.
So in practice, on Linux you can do all your I/O-related stuff online except shrinking filesystems. This is usually enough, but the inability to shrink filesystems online can occasionally be a pain. btrfs is slated to fix this, in addition to solving world hunger and curing cancer.
toolserver-l@lists.wikimedia.org