Alan Cox Quits As Linux TTY Maintainer — "I've Had Enough"

spoondan · on July 30, 2009

What I find impressive is that Alan continues to assist on the issue after he has "had enough", providing in-detail responses to questions and correcting part of Linus's patch. I think he also responded to Linus's initial complaint with grace and aplomb.

I'd like to think I am poised and dedicated, but I doubt I am even half so as Alan.

rythie · on July 30, 2009

He has been doing it since 1991, and spent 10 years working at RedHat.

zaph0d · on July 30, 2009

From TFA - "After reading the thread on a mirror, it's important to note that after the argument, Alan and Linus continued to debate the technical merits of how to patch the bugs. The ongoing conversation was civilized and concise. Of course Linus is too much of a pompous ass to apologize to Alan for completely misunderstanding the problem and proposing dangerous and useless ways of fixing it, then arguing about it ad nauseum. Oddly, this doesn't seem to bother Alan or maybe he's just used to it."

DarkShikari · on July 30, 2009

This is one thing I have found true of great open source developers in general: regardless of their attitude, regardless of past events, regardless of the situation, they can carry on technical debates on how to deal with an issue.

In other words, for such people, the technical merits of an idea supersede all other aspects.

pbhjpbhj · on July 30, 2009

I read the thread (didn't understand most of it) but Torvalds describes Cox work as "crap" and tells him "you're full of crap". Way to stick to the technical merits there ...

That's some attitude.

DarkShikari · on July 30, 2009

Did I ever say I was referring to Linus? ;)

pbhjpbhj · on July 31, 2009

No, I didn't assume you did. My comment was partially tangential (now there's a geeky oxymoron).

rythie · on July 30, 2009

If you want to read some actual detail on this, read this LWN article: http://lwn.net/SubscriberLink/343828/d298eb0f131f2039/

barrkel · on July 30, 2009

A telling piece of opinion in that article:

> This code worked with the older TTY code, but broke with 2.6.31. There is probably no way to fix it which doesn't saddle the kernel with maintaining weird legacy bug-compatibility code

"Maintaining weird legacy bug-compatibility code" is the essence of professional and responsible platform software engineering - the software that other software relies upon. It's one of the reasons why platform software development is so hard and why software engineers are paid as much as they are.

There's another misguided notion mentioned that I see a lot in both open source and Wikipedia, two kinds of crowd-sourcing efforts which generally desperately seek authority-figures of some kind, like "standards":

> Alan thinks this behavior is reasonable; it complies with the applicable standards

Compliance to the "applicable standards" isn't worth anything if it breaks software and makes users' lives worse. It's not for nothing than POSIXLY_CORRECT applies to many GNU apps, and is not enabled by default.

corbet · on July 30, 2009

(I'm the author of the article in question)

I disagree completely. Maintaining ABI compatibility is crucially important, and the kernel development community understands that. But ABI compatibility does not mean going to all lengths to enabling buggy code to continue to work. In this case, "fixing" kdesu would have required putting in code to recognize this specific data stream and ensure that a read terminated exactly where it always used to so kdesu wouldn't get confused and throw away good data. It would be an albatross around the neck of any developer working in the TTY layer for years, and those developers are already up to their asses in albatrosses. All to perpetuate an easily-fixed bug in an open-source application.

That kind of "compatibility" wrecks your platform over the course of a few years; you simply can't maintain code like that. A "professional and responsible" software engineer needs to recognize that and avoid it.

Gormo · on July 30, 2009

Maybe Linux (and other similar projects) ought to divide into separate streams - one "research" trunk, that focuses on cleanliness, elegance, and standards compliance and is not constrained by backward compatibility commitments, and from which multiple "practical" branches might diverge, each always maintaining compatibility to its point of origin, despite the tendency to accumulate cruft as it evolves.

davidw · on July 30, 2009

You're free to fork it:-)

Quick answers aside, those guys work hard to try and make everyone happy with the resources they have available, and it's not an easy job.

gaius · on July 30, 2009

Maintaining ABI compatibility is crucially important, and the kernel development community understands that.

History suggests otherwise.

Compare that to Sun's attitude, which is if your old binary doesn't run on the latest Solaris then it's a bug in our code and we'll fix it.

moe · on July 30, 2009

Well, also compare where solaris is today.

Software development is about trade-offs and apparently linux wasn't hurt as bad by the occassional ABI breakage as solaris was hurt by the "backwards compatibility above all" mindset (which ofcourse reaches way beyond the basic ABI).

It's all about the albatrosses.

gaius · on July 30, 2009

Dtrace and ZFS and other features that Linux is struggling to copy (e.g. SystemTAP and BTRFS).

Backwards compatibility in Solaris is a matter of minor annoyances (e.g. ksh93) it hasn't hindered it being a cutting-edge Unix.

moe · on July 30, 2009

I was arguing more in terms of market- and mindshare than in terms of technical merit. Yes, Solaris grew a few very strong subsystems. The problem is that your list ends very shortly after Dtrace, ZFS and Zones (which is already matched by OpenVZ anyways). Those few bright spots are simply not enough reason for most people to put up with with an OS that is otherwise firmly stuck in the 90s (horrible userland, absence of a package manager, no worthwhile mass provisioning options, etc.).

gaius · on July 30, 2009

What came first, Jumpstart or Kickstart? Again, Linux is playing catch-up.

And the userland/package manager problem is neatly addressed by Blastwave.org.

moe · on July 30, 2009

This is leading offtopic, but anyways.

What came first, Jumpstart or Kickstart? Again, Linux is playing catch-up.

Jumpstart, Kickstart? Well, I guess they were hot in 2001, yes.

All sizable solaris deployments I know today are struggling with FAI, puppet and a lot of custom glue scriptery to enable at least a basic grasp on their deployments. JET and the ilk is just not even touching a worthwhile real-world problem.

And the userland/package manager problem is neatly addressed by Blastwave.org.

Likewise I can only scratch my head over reading blastwave and "neat" in the same sentence. Have you even used it?

I'm in fact a bit surprised it still exists because most solaris admins seem to either scoff at the idea altogether and stick with the regular pkgadd, or look for greener pastures in OpenSolaris - which wisely refrained from any attempts to re-invent apt.

Well, but as said, my initial point wasn't even about discussing implementation details in one way or another. There is no doubt solaris displays great technical excellence in some areas. Unfortunately too often these are not the areas that people in the real world care about.

gaius · on July 31, 2009

Yes of course I've used it. Updated all my packages yesterday in fact, went perfectly smoothly. YMMV but all the full-time Solaris admins I know are Blastwave fans.

The point I am making it: a serious OS doesn't break backwards compatibility on an individual developer's whim, no matter how famous they are.

moe · on July 31, 2009

a serious OS doesn't break backwards compatibility on an individual developer's whim, no matter how famous they are.

Well, most people seem to like their OS a little bit less serious nowadays.

Or, more seriously, over here in the real world this is simply a non-issue for almost everybody. Binary compatibility between OS revisions was important when compile-runs would take days. Nowadays you just compile for your target platform and move on.

lallysingh · on July 30, 2009

Indeed, and OpenSolaris covers most of the minor annoyances pretty well. A pretty gnu-friendly userland, with new package management.

barrkel · on July 30, 2009

That's why (for example) Windows has compartmentalized and "pluginized" its AppCompat infrastructure. ABI compatibility trumps almost every other concern, IMHO.

rbanffy · on July 30, 2009

""Maintaining weird legacy bug-compatibility code" is the essence of professional and responsible platform software engineering"

I couldn't disagree more with your point.

Open source is about openness. This is about evolving a platform in a clean and open way. If someone's program is buggy and does weird things it shouldn't do, then the bug should get fixed, not a bug be added to another piece of software in order to make the buggy program (that worked properly only by accident) still work. A feature that is not needed is a bug all in itself.

If your old program won't work with your new OS, then don't upgrade. There is no pressing need to run the latest and greatest OS, at least not in "professional and responsible" environments.

JeremyChase · on July 30, 2009

Being a fan of the BSDs I have to point out that this type of decision would not happen there. The reason being that the kernel and userland are not separate entities; they are a system. If you update your kernel the userland has to update with it, and that is to accommodate changes just like this.

Saavedro · on July 30, 2009

I would point out that the userland in BSDs that would update with the kernel would be the CORE utilities and programs. Say, like GNU utils on a linux distro. kdesu is part of KDE, which is an application installed on top of the OS, not something maintained as part of the OS. Would you say there wouldn't be any issue with a BSD kernel update breaking an extremely widely used application such as, say, Apache?

Malus · on July 30, 2009

There would be issues, but the BSDs tend to break the ABIs on the development branches only, not the stable branches (e.g. CURRENT vs STABLE in FreeBSD). This usually gives port maintainers enough time to patch their ports appropriately or to send their patches upstream. Users should never see any breakage unless they choose to use the development branches.

JeremyChase · on July 30, 2009

Your point is precisely the example I mean. In *BSD the kernel, core, and packages/ports are all tied to one ABI at any time. Linux/GNU's system is more free from, and as a result can result in these problems.

rbanffy · on July 30, 2009

Evolving the API is a fact of life. It your critical program breaks with the most current bleeding-edge version, then either fix it (or have your vendor do it for you) or don't run the latest release. It's not like someone will die for having an earlier release.

That's also why Linux distros don't use the latest kernels: to allow the userland programs they package into them to run until they get fixed and the latest kernel can be added. While I know my Ubuntu doesn't run yesterday's kernel, I am happy it runs perfectly well with last month's kernel.

akkartik · on July 30, 2009

Alan: http://linux.slashdot.org/comments.pl?sid=1319387&cid=28...

Linus: http://linux.slashdot.org/comments.pl?sid=1319387&cid=28...

Coda: http://linux.slashdot.org/comments.pl?sid=1319387&cid=28...

sachinag · on July 30, 2009

Wait, none of these comments are actually made by the people you say they are.

akkartik · on July 30, 2009

Yeaaaah.. they were summaries of positions. Thanks for insinuating that I was being misleading. Did you really think 'Coda' was a person?

I've been watching parent get upmodded and my comment get downmodded with increasing disbelief. Time was that +16 used to mean it was not to be missed, and -4 was a troll or ad hominem.

benreesman · on July 30, 2009

The voting is completely out of whack these days. I'm not sure if the constituency of the community has changed, or if people are just feeling ornery.

smithjchris · on July 30, 2009

This is the typical example where Linux suffers from lack of true engineering. The kernel API and libs should be stable, predictable and well documented. There should be no hacks to support odd behaviour because odd behaviour should not be specified. Isn't that why we have POSIX?

What with recent issues with egotistical libc maintainers, people throwing their pacifiers over having to support junk, people moaning about ext4 (even though it works fine and is poor app behaviour), it doesn't give people much confidence.

Confidence is key.

davidw · on July 30, 2009

> The kernel API and libs should be stable, predictable and well documented. There should be no hacks to support odd behaviour because odd behaviour should not be specified.

There should be a pony too.

How can the kernel not have lots of hacks? It deals with hardware, which is often weird.

smithjchris · on July 30, 2009

NetBSD comes with a Pony :)

POSIX rarely changes. They should start there and work backwards.

UNIX nearly died because vendors wouldn't allow interoperability. Don't let it happen again.

Look at Win32 - still works like it did back in '91 (with Win16).

jasonkester · on July 30, 2009

Certainly you understand why Win32 still runs old Windows 3.1 code from 1991? It does so because of countless little hacks and "compatabilty mode" switches in the OS that emulate old bugs that existed in old versions that badly behaved software was taking advantage of.

http://www.joelonsoftware.com/articles/APIWar.html

That's a good overview of how much "Maintaining weird legacy bug-compatibility code" you need in an OS with any history whatsoever.

smithjchris · on July 30, 2009

I have seen the win32 subsystem source (legitimately through shared source - I used to work for the British Ministry of Defence). It's very clean and tidy and not full of junk like that believe it or not.

The reason it still works is that they thought about it properly to start with. There have been very few extensions. The main one being adjusting pointer sizes (which are hidden behind a typedef like LPSTR anyway so that is irrelevent).

And for the record I take anything Joel says with a pinch of salt. The guy is a trendy blog mouth piece.

jasonkester · on July 30, 2009

In that case, I should have linked you directly to the Raymond Chen articles that Joel was paraphrasing:

http://blogs.msdn.com/oldnewthing/archive/2003/10/15/55296.a... http://blogs.msdn.com/oldnewthing/archive/2003/12/23/45481.a...

Here's a quote:

This is just the tip of the iceberg with respect to application compatibility. I could probably write for months solely about bad things apps do and what we had to do to get them to work again (often in spite of themselves). Which is why I get particularly furious when people accuse Microsoft of maliciously breaking applications during OS upgrades. If any application failed to run on Windows 95, I took it as a personal failure. I spent many sleepless nights fixing bugs in third-party programs just so they could keep running on Windows 95.

gaius · on July 30, 2009

Windows 95 and Windows NT are not the same thing.

smithjchris · on July 30, 2009

Precisely. I forget that the 9x series of operating systems existed sometimes!

undees · on July 30, 2009

I've seen dozens of those "personal failures." :-)

Granted, backwards ABI compatibility is a huge, complicated, thankless task for which you get no notice if you do the job perfectly. The ideal spot for the "break old programs" vs. "cripple new programs to avoid breaking old ones" dial is different for every project. Sometimes the right thing to do is keep the code base from degenerating into a pile of back-hacks. Sometimes it's all about bending over backwards to keep a long-gone vendor's bug from biting your end users.

smithjchris · on July 30, 2009

Windows 9x was a mess. That same is not true with NT which is really nicely put together.

I've dragged some stuff which we wrote for NT4, recompiled it with VC++ 2008 (plus spent some time frigging VS settings) and it just worked on Win2k8. The original binaries, even though they were compiled 12 years ago, still run on Win2k3 fine and past the test suite (!).

tezza · on July 30, 2009

It sounds like your default state is "Full Confidence", and so Confidence is 'eaten away' in your description

The better definition concerning Linux is that Confidence is "Built Up". Linux started as an unknown and has forced itself into people's confidence.

Linux operates well 24x7x365 in many places, and that rate of Confidence build up is way more than any rate of loss due to Steering Committee squabbles.

varjag · on July 30, 2009

If Linux had its API frozen in 1991 there would be very little use for it now. Sometimes you have to change things to move on.

smithjchris · on July 30, 2009

Read this from end to end:

http://msdn.microsoft.com/en-us/library/ms632680%28VS.85%29....

Unchanged since 27 July 1993. Used trillions of times a day by millions/billions of computers.

varjag · on Aug 3, 2009

Except that you are wrong, since the call includes special cases for Windows 98, ME, 2000 and XP.

radioactive21 · on July 30, 2009

nerd fights make great drama. let me know when episode two starts.

I wanted to edit that it would be freaking hilarious if Alan went and worked with Microsoft.

RyanMcGreal · on July 30, 2009

Is this a bad time for me to adopt my most gravitasy James Earl Jones voice and intone, "The perfect is the enemy of the good"?

RyanMcGreal · on July 30, 2009

Boy, did I ever misfire in my attempt at humour!

smithjchris · on July 30, 2009

I +1'ed you to make you feel better :)