Why ISS Computers Failed 324
Geoffrey.landis writes "It was only a small news item four months ago: all three of the Russian computers that control the International Space Station failed shortly after the Space Shuttle brought up a new solar array. But why did they fail? James Oberg, writing in IEEE Spectrum, details the detective work that led to a diagnosis." The article has good insights into the role the ISS plays as a laboratory for US-Russian technology cooperation — something that is likely to be crucial in any manned Mars mission.
Urgh. (Score:5, Insightful)
Duct Tape (Score:5, Insightful)
They also decided to rig a thermal barrier out of a surplus reference book and all-purpose gray tape
Almost certainly, this was the duct tape we all know and love. They probably thought it was better not to actually say that, though. Pretty funny. And as an added side-benefit, they should be safe from terrorists.
Redundancy != Safety (Score:5, Insightful)
I think NASA should have learned this lesson by now. After all, the Challenger disaster showed this principle as well. In that case, the same cold temperature that weakened the primary seal on the solid rocket booster weakened the secondary as well, sapping its ability to provide redundant backup. In this case, the same condensation affected all three computers equally.
Its troubling to see them taking shortcuts on safety and redundancy, when such measures have resulted in loss of life before. How hard would it have been to have had three shut-off cables?
Re:The REAL reason they failed (Score:1, Insightful)
You're joking, I hope?
Every discussion of Vista is FULL of astroturfers defending the OS, and they're always modded up. It's almost impossible to discuss its real flaws because of all the Microsoft-sponsored noise.
Proper debugging technique (Score:5, Insightful)
The author is obviously way more qualified than I to assess the situation and he may well be right but from the content of the article I came away thinking, wow, I would have looked first at all the recent changes to the station and the power supply too.
It's interesting... (Score:5, Insightful)
1) unexpected failure modes
2) political battles
Which really isn't a whole lot different than 1) the unexpected failure modes I see every day at work, and 2) the political wrangling (fingerpointing) that takes place when they happen. Apparently NASA and its Russian equivalent are no better than any old software company.
The lesson being, people are people, and people are still the ones that design these things.
Hate to break it to you... (Score:4, Insightful)
Jingoism (Score:2, Insightful)
I hope they don't (Score:5, Insightful)
The article has good insights into the role the ISS plays as a laboratory for US-Russian technology cooperation -- something that is likely to be crucial in any manned Mars mission.
No offense to Russia or the US, both who produce good space gear, but technology cooperation is probably a bad idea unless it is tested more thoroughly than in the ISS. The ISS is a great example of how to screw up international cooperation. The station has been delayed for more than a decade (and cost NASA around $50 billion so far) due to redesign and indecision, reliance on a single launch vehicle for key components (the Shuttle), and the inclusion of the Russians. There are parts of the station that can only communicate with the Russians and parts that can only communicate with NASA. Aside from basic utility hookup (electricity), there's no connection between the different parties on the ISS (at least between the Russians and NASA, the ESA and Japanese parts might work better with NASA's stuff). And if you want to make changes that affect more than one party, it becomes by default an international issue. Finally, there's no easy way to transfer ownership. NASA's communication system is integral (TDRSS [wikipedia.org]) to the NASA parts and is also a national secret (so I understand). So the communication system can't be transfered to another party like the Russians or the ESA.
If there's any international cooperation between space agencies, it probably should be at a rather trivial and manageable level. Say including foreign astronauts or using off the shelf equipment that is know to work under the circumstances.
Re:The REAL reason they failed (Score:4, Insightful)
Re:Proper debugging technique (Score:4, Insightful)
Look, the Russians as people are all right. But their management in the space program is obsessed with face. They feel that admitting any faults demeans the Russian nation and the Russian people. You can laugh but that's how it is.
Re:Redundancy != Safety (Score:5, Insightful)
Its troubling to see them taking shortcuts on safety and redundancy, when such measures have resulted in loss of life before. How hard would it have been to have had three shut-off cables?
Re:The REAL reason they failed (Score:4, Insightful)
Why? Is defending a MS operating system for honest reasons impossible to believe anymore?
Re:Urgh. (Score:4, Insightful)
This item is hugely biased. It looks to me like a simple case of corrosion, which could easily have been patched up if it happened on a Mars flight. The engineers and crew all seemed to work well together, and the Russians were the ones who sorted the problem.
I don't know if the Russian Program Managers got all political against us, but the item, written by a retired NASA manager, sure as hell gets political against the Russians. He's right in one thing - the managers need to stop getting political, and I suggest he starts with himself!
It's just as well he's retired - looks like he's fighting long lost battles against cooperation with the Russians and Europeans.
Re:I hope they don't (Score:3, Insightful)
Re:The REAL reason they failed (Score:3, Insightful)
Wiring corrosion? (Score:5, Insightful)
I'm surprised that connector corrosion would be a problem. Aviation has a long history of wire problems [etsu.edu], but gold-plating connectors seems to be a stable solution to that problem. The ISS uses Kapton wire, which was popular in the 1980s and is lightweight and tough. But that material is hygroscopic and now banned by the USAF, US Navy, Boeing, etc. "Susceptible to aging in that it dries out forming hairline cracks which can lead to micro current leakage (i.e. electrical 'ticking' faults)"
There are ways to do corrosion-resistant contacts without precious metals; the automotive industry has solved this problem. The alloys aren't simple; here's one used for under-hood automotive connectors. [olinbrass.com] Copper, iron, magnesium, and phosphorus, with upper limits on tin, zinc, nickel, lead, and manganese. But avionics connectors are usually gold plated; it doesn't add that much cost. And Russia is a major exporter of gold.
The article doesn't go far enough. OK, the connectors corroded. Why? Wrong alloy? Plating failure? Wear from too many connector insertions? Was the spec wrong, or were the cables not made to spec?
Re:It's not surprising (Score:2, Insightful)
Indeed, how many russion casualties have there bee (Score:5, Insightful)
Tell me, how many casualties have the russians had in the last decade, even last two decades? This was in the days of Mir, when the russians maintained a continues space pressence year after year and the US was out of space for year after year for blowing up space shuttles.
So whose tech is behind whose? The ISS didn't plunge out of the sky when the Space Shuttle was not available, apparently the russian capability is more then enough to operate it.
And finally, who build the de-humidefier that was the fault in the first place?
Life will find a way (Score:2, Insightful)
Re:Urgh. (Score:5, Insightful)
Seriously, all of that political cold war-era cockwaving should stop.
Re:Hmmmm. (Score:4, Insightful)
If they had designed the modules for multiple lift modes, if one was NOT operational, the odds are the other would be. THAT is true redundency - 2 totally different systems, each capable of doing the job
Re:Proper debugging technique (Score:5, Insightful)
Re:Indeed, how many russion casualties have there (Score:1, Insightful)
The one thing you've got to give the Americans is that they're prepared to admit when they've got casualties. I find it hard to believe that Russians didn't attempt to launch people previously and just didn't report the failures.
Re:The REAL reason they failed (Score:5, Insightful)
We don't do honest here. We do technically sound.
Re:Urgh. (Score:3, Insightful)
Re:Urgh. (Score:4, Insightful)
"It is dismaying that after decades of experience with manned space stations, Russian space engineers still couldn't keep unwanted condensation at bay."
That's a bunch of crap. That's like saying it's dismaying that McDonald's has served billions of burgers and still can't figure out how to make them healthy.
Condensation is "still" a problem because it's one of the big and tricky ones. To get rid of the condensation, you have to get rid of the people.
Re:Duct tape saves the day! (Score:3, Insightful)
Two studies related to duct tape have been reported recently. The first was a government study of various commercial products for affixing insulation to HVAC systems, which found that every product performed well over time except duct tape. The second was a study which showed that the folk remedy for warts in which you cover them with duct tape was surprisingly effective.
There you have it: amazingly versatile, as long as you don't ask it to do what it's supposed to.
Re:Proper debugging technique (Score:1, Insightful)
Look, the Americans, as people, are all right. But their management in the space program is obsessed with face. They feel that admitting any faults demeans the American pride and the American people. You can laugh but that's how it is.
REAL (Score:2, Insightful)
I love this, rather than discuss the real issues, /. can't even talk about other computers without bashing MS.
Re:The REAL reason they failed (Score:5, Insightful)
Millions, nay Tens of Millions of people give Microsoft and their products "the time of day." People who have no dogmas or political agendas when it comes to computing. People who just see a computer and its software as a tool to get their desired job done. And not just MBA or Administration types, but also millions of software developers and network administrators and such.
I don't think Windows is perfect, but I also don't think OSX is perfect nor do I think that Linux or any flavor of Unix is perfect. I do think that the O^n usefulness of the Windows install base provides so much opportunity that it ends up offering the most value to businesses and consumers.
And with regard to their "self serving" ways... many on slashdot are anti-business or at least anti-corporation. They adopt the FSF malarkey that all code should be given away free. I put food on my family's table by developing software and the notion that it should be given away free just misses the mark. Market-based economics can bring out the best in innovation, which is why America has some of the highest paid and most productive workers in the world.
Slashdot is full of idealistic college students and 20-somethings (of which I am a part) who think that corporations are "evil" and that we should all wear birkenstocks and eat crunchy granola and spend our days writing software that solves a problem that's already been solved on a Windows platform and then give it away for free just so we can say we fought the good fight. It's naive. Say what you want about Microsoft, but that company, and the efforts of billg have made THOUSANDS of people millionaires and probably a handful of billionaires, too. Many of those people took that money and started their own software companies solving their own unique, novel problems, and on their own hiring employees and fueling the economy and probably making a lot of those people millionaires, too, who perpetuate it.
Business is good for all of us. Economic success and security is good for America.