The Forgotten Huygens Experiment 556
jdray writes "An experiment onboard the Huygens probe didn't run as planned because someone forgot to turn it on. The team lead for the experiment has put eighteen years of his life into the project, just to watch it not happen after a seven year ride to its destination on Titan."
Only one word can be used to describe this... (Score:5, Funny)
Doh (d)
Interj.
a) A Gen-X colloquialism conveying an overall feeling of frustration.
b) Used to express a feeling one has after realizing they have been tricked, misled, scammed, swindled, etc..
c) Used to boast or chide the victim of such tomfoolery
d) Coined by the animated sitcom character Homer Simpson in the mid to late eighties, "Doh" is similar to other one word, one syllable explicatives in that it is a quick and succinct summary of one's aggravation, but differs in that it was an accepted substitute to similarly censored words.
Re:Only one word can be used to describe this... (Score:4, Informative)
D'oh (Score:5, Funny)
Re:D'oh (Score:5, Informative)
Re:D'oh (Score:2)
Re:D'oh (Score:5, Funny)
Try small-town Quebec. That's where real french is spoken.
Uh, wouldn't it be France where real French is spoken?
Re:D'oh (Score:4, Informative)
Re:D'oh (Score:3, Interesting)
I've heard at least three claims to be the "best" French:
Quebec, because they use the fewest anglicismes (thanks in part to the Office quebecois de la langue francaise [gouv.qc.ca]), though their French maintains features that are archaic in France.
Liege, where les Liegeois universally claim they speak the best French.
The Loire Valley, where la Touraine is supposedly the best dialect of the bunch.
Moi? J'sais pas...(Me? Dunno...)
...laura
Re:D'oh (Score:2)
Re:NITPICK Re:D'oh (Score:3, Funny)
Yes, and I'm not sure why we'd have to go to French to get a meaning for D'oh. Homer is English-speaking and "dough" is an English word. Plus it it what doughnuts are made from. I figure since Americans shortened "doughnuts" to "donuts", and Homer is lazy and likes donuts, he just shortened it to "D'oh". (=
It also could be a contraction for "damn, oh!" or preferably "oh damn".
Re:Only one word can be used to describe this... (Score:2)
Re:Only one word can be used to describe this... (Score:2, Interesting)
Let's practice:
Me: "'Duh' and 'doh' are not the same thing, duh!"
You: "Doh!"
Re:Only one word can be used to describe this... (Score:2)
so, he could use 'fuck' just as well.
Re:Only one word can be used to describe this... (Score:4, Informative)
(I was reminded of this by a story on NPR this morning.)
Sad :-( (Score:2, Insightful)
Re:Sad :-( (Score:2)
Then those others split the lists to even smaller lists and pass them to others.
That increases the number of ways things could go wrong. Even if you have list items on some lists to check that other people are doing stuff on their lists, stuff happens...
Re:Sad :-( (Score:3, Insightful)
It's a pity that the comand to activate channel A on the Cassini spacecraft was not sent as data was lost, but one can only hope that future missions do not make the same mistake.
Incidently this event demonstrate w
Re:Sad :-( (Score:3, Informative)
Re:Sad :-( (Score:4, Funny)
Sure they have!
They just realized what was at point 2.
Redundancy... (Score:4, Insightful)
This post is from memory. Please feel free to correct errors and ridicule me for factual inconsistencies.
Re:Redundancy... (Score:4, Informative)
Re:Redundancy... (Score:2, Insightful)
Re:Redundancy... (Score:4, Informative)
This is totally false, as an scientist will tell you. Quantity is a characteristic of quality. In this case, splitting the data stream is actually the best choice, because you get both redudency of the communications infrastructure, but you also get redundency of data. The thing to remember is that there was a limited communications window and increasing bandwidth meant that the quality of the data they were going to get back would be greatly increased. They didn't affect redudency because, while they didn't get exactly the same data, they got a working subset of it.
Re:Redundancy... (Score:4, Informative)
Assume a probe has the bandwidth to send a total of 8 images on two channels, 4 on each channel, on the way down. Each image will be represented by the digits 1 through 8.
With exact redundancy:
A: 1357
B: 1357
With alternating redundancy:
A: 1357
B: 2468
With exact redundency, if you receive both channels, you only get 4 total images in the end. With alternating redundancy, if both channels work, you get up to 8 images in the end. But if one fails, you still get 4, just like under exact redundancy. Thus, it seems like the better choice because you get twice as many images if both channels happen to work, but both techniques still send only 4 images if one channel fails.
Re:Redundancy... (Score:2, Insightful)
Non-substitutable experiments on the other hand...
Re:Shame they were only black and white. (Score:3, Informative)
Re:Shame they were only black and white. (Score:5, Insightful)
Black and white sensors have higher resolution, just as black and white film has higher resolution. Resolution is more than the number of pixels, it's the valuable ability to resolve actual data with those photosensors.
Your little consumer digicam that did not cost a hundred thousand dollars is arranged with cheap little colored filters, cutting out over half of the photons that arrive in the camera, just so you can get the right shade of pink on your girlfriend's tummy. Scientists would rather collect all the photons they can, thanks.
Scientists do use filters now and then. Spirit and Opportunity use black and white cameras, but they can use something like NINE different filters to block out all frequencies except certain bands of interest. They don't just select Red, Green, Blue, but also various bands of near and far Infrared and Ultraviolet too. Those probes were designed later, and were going to be used on a longer mission, where power and available light energy would be greater. Huygens was built earlier, and going to a distant and dark moon where they'd be lucky if the probe lasted a couple of hours.
Is their logic still a mystery to you?
Re:Shame they were only black and white. (Score:4, Informative)
To capture a colour image using CCD arrays, there are the cheap, the expensive and economic ways. The cheap way (consumer cameras) is to place itty-bitty colour filters over the entire CCD array. In this way each cell captures either red,green,blue or white.
The expensive way is to have separate CCD chips for every wavelength of light you want to capture.
However, when an image is captured by a CCD array, there is a very small amount of bleed from one CCD pixel into it's neighbours. You can compensate for this by making use of image process techniques like convolution/sharpening. But these methods are completely useless with the cheap way of capturing colour images (each of RGBW will have blended with its neighbours of a different colour).
This can be done with the expensive way (professional digital cameras), but you are restricted to three wavelengths of light.
Alternatively, you can have one CCD chip, and a series of calibrated colour filters that can be swapped over. In that way you, have a low energy budget of one CCD chip, and the flexbility of analysing a scene in multiple light wavelengths, each of which can be processed separately.
In a strange way, it makes me feel better (Score:5, Funny)
I forgot to turn on my cellphone this morning, and missed a call from someone dear to me. Still, reading this makes me realise that somewhere out there, someone is feeling even worse over forgetting to turn something on.
Re:In a strange way, it makes me feel better (Score:5, Funny)
Sincerely,
Your Father
Re:In a strange way, it makes me feel better (Score:4, Funny)
Re:In a strange way, it makes me feel better (Score:5, Funny)
Re:In a strange way, it makes me feel better (Score:2)
Shit happens. (Score:4, Insightful)
We should really praise the gods that the rest of Huygens mission was a grand success.
Re:Shit happens. (Score:2)
Yeah it's a success. Although you cannot guarantee that nothing will go wrong, making precautions to prevent everything from going wrong will add to the overall cost. The had two channels to transmit data, it's not easy to prevent a human error like this. They simply forgot to turn it on.
Grand success? (Score:2)
Thats not a grand success. Compared to crashing a lander into Mars, losing everything? Yeah, its probably a grand success. But thats a pretty low standard.
Re:Grand success? (Score:2)
You are wrong [slashdot.org]
Re:Shit happens. (Score:5, Insightful)
I may not have got it all right on the first go around, but you can rest assured, i got it right after the testing and before it was deployed...
In my primary field of work, 'shit happens' is just not an acceptable excuse, I'm a pilot. We use checklists precisely for that reason, to make sure that shit doesn't happen. Every flight has a few phases where even one minor screw up can have serious consequences, so we have checks and balances built into the system to make sure that small screw up does NOT happen.
I know the software folks here on /. always want to make excuses about 'its hard' and 'its to complicated', but, it's actually not hard, and not to complicated. complex systems are designed and built every day in the aerospace field, systems that many lives depend on. We take it for granted that they are properly designed with failsafe modes, they can deal with problems on the fly, and they do not puke up and die when things become abnormal. Same goes for our crews, they train extensively to make sure they fully understand all operational modes, and they can deal with them. Once that's all done, we write books full of checklists, to make sure the details do not get missed at a critical time.
'I forgot' or 'shit happens' is just not an excuse. In reality, it's an admission of unprofessional conduct. Billions of euros spent, many many man years of effort, and you want to take 'forgot' or 'shit happens' as an acceptable excuse? there is no acceptable excuse, those are just admissions of shoddy management and operations. Those are terms that are not even in the vocabulary of true professionals.
Every time I read here on /. about how 'professional' programmers seem to think that it's to hard to actually take the time and effort to write failsafe code, and test it as such, I ask myself how many people would die if thier attitudes were used developing the flight management systems in our aircraft.
Thanks to government regulations, i can only fly 9 days a month, that leaves me with a lot of time to operate my other business. We do software development, embedded systems for mission critical applications. We do deploy equipment into life critical situations, so, for our work, 'shit happens' and 'i forgot' just dont exist in the vocabulary. We use checklists to ensure that all testing covers all forseeable abnormal conditions, up to and including partial failure of various hardware. for your typical 'desktop' developer, equivalent testing would be along the lines of making sure programs handle gracefully things like having the hard drive removed from it's computer while the program is still running. They may not function at full capacity anymore, but it's not reason enough to have the thing just puke up and crash, it needs to fall into a failsafe mode that's prepared to deal with the detail of 'no local storage available anymore'. the code to handle this scenario will likely not 'get it right' on the first try, but, it'll surely be right before the product goes into release.
Looking at the money spent, and the multitude of man years spent on developing the lander for this mission, to hear that a significant experiment was lost becase somebody forgot to turn it on, is just beyond comprehension. this goes way beyond unprofessional, and well past the line we would draw for 'incompetent'.
Re:Shit happens. (Score:2, Insightful)
Yeah , theres *never* been any inflight problems in aircraft due to the computers or other systems has there. Though a couple of
Re:Shit happens. (Score:3, Insightful)
>You cannot forsee all abnormal conditions
I think that's why he said all forseeable (sic) abnormal conditions. That subset must by definition be foreseeable :-)
Re:Shit happens. (Score:2)
Oh, so if you don't mind, why exactly do airplanes crash and kill hundreds of people?
Re:Shit happens. (Score:3, Informative)
I'm going from memory here, but I think it's something like 80% pilot/crew error, 15% weather (which could still be considered pilot error), and 5% mechanical failure. So if those pilots/crew had been following the proper procedure and protocol, then the shit *wouldn't* happen (except in those rare cases where mechanical error is to blame, and even then, most of those cases can be traced back to a mechanic cutting corner
Re:Shit happens. (Score:2)
Re:Shit happens. (Score:5, Insightful)
In your field of business, I imagine you cannot easily deploy quick fixes (to embedded systems), and major bugs in life critical situations are obviously not acceptable. So you do rigorous tests and code reviews. In my line of business however, bugs are acceptable. Sometimes a bug makes it into production... users will moan, and we'll have to spend a bit extra on writing and deploying the fix, but the cost is lower than doing a full test on every release.
I agree with you that software developers should realise the importance of testing, and take a critical look at their own testing and coding procedures... often it isn't that hard or expensive to make real improvements.
Re:Shit happens. (Score:3, Insightful)
You pay to do it right, or you pay to do it wrong, pay to clean it up, and THEN pay to do it right.
Test scripts are your friend. If you haven't been introduced to TCL (Tool Command Language) yet, you should seriously think about it.
Re:Shit happens. (Score:4, Insightful)
Well, that's kind of nitpicking. Although "time is money" is just a slogan, it does point to the fact that both timeline and money are constraints that affect test coverage that can be done. And cost/benefit analysis should be done for testing as well as for implementation: proper amount of testing to do is a compromise based on many things (type of system, expertise of implementers, aggressiveness of implementation/release schedule etc. etc.). So I would argue that it's ALWAYS about cost, in broad sense (delaying a release costs money -- that's the main reason to avoid delays).
And finally, there are cases where defects just are cheaper to have, than doing rigorours testing. Like everything in software engineering, impact of defects is relative; there are no absolute guidelines.
Re:Shit happens. (Score:3, Insightful)
Which is precisely why there has never been a software glitch in a plane system. You know, like the TCAS system which saw ghost planes and told pilots to avoid them (noted in IEEE Spectrum [uni-bielefeld.de]), or any of the cases cited here [nasa.gov] or here [ncl.ac.uk]. Nope, aer
Re:Shit happens. (Score:3, Insightful)
Shit does happen. People skip over items on checklists every day. Little things break constantly. Usually it's not enough to cause a catastrophic failure. Now, whoever was in charge of the specific checklist DID screw up, and they screwed up hard, and they need to own up to that. But the potential for failure is part of complex systems and the human element is part of that.
The OPs rant about software is just stupid, though. Software is com
Re:Shit happens. (Score:2)
It's not that it's too complex, it's that it's too costly for most applications. In a system where 1 bug can kill hundreds of people, it's OK to spend huge sums of money on processes, tools, people and time to deliver perfect software, but most people would not be prepared to spend vast amounts more for a version of Word (or whatever) that never cras
Re:Shit happens. (Score:2)
Re:Shit happens. (Score:3, Insightful)
The system of checklists you are using has been finetuned over many decades and probably *millions* of flights. And your operating procedures evolved alongside the hardware.
I'm sure on their millionth flight, the Cassini operation would be just as airtight.
If we were to turn back the clock to the first weeks of commercial airline travel, I imagine things were quite a bit different than the industry you describe.
Re:Shit DOES happen...and HAS happened. (Score:3, Informative)
Charles Perrow has an excellent analysis of those type of accidents in Nuclear Plants, Petrochemical industries, Aircraft & Airways, Dams etc.
(Normal Accidents: Living with High-Risk Technologies, by Charles Perrow, Basic Books, NY, 1984.) http://oak.cats.ohiou.edu/~piccard/entropy/perrow. html [ohiou.edu]
Most of these accidents and failures were not the result of lack of money or due to opera
Re:Shit happens. (Score:4, Insightful)
Admittedly lives don't depend on 90% of the software any of us here writes, but that isn't to say it isn't complex or demanding and requires complex, demanding testing to ensure high standards of reliability.
If those resources aren't allocated, then I'm afraid 'Shit Happens' is very definitely an excuse.
Re:Shit happens. (Score:3, Interesting)
It's always a balance between the probability of a given failure, the concequences of a given failure, and the cost of adapting to that error. Different types of projects have different ways of looking at this balance.
What are the chances of a HDD being removed (or totally failin
Re:Shit happens. (Score:2)
Uh, bullshit. Arrogant presumption on your part. You have no idea as to the total experience he has in that area, AND are apparently overrating length of time as a quality indicator.
Re:Shit happens. (Score:3, Interesting)
Agreed, however I think we're talking primarily about mission critical systems in this case...situations where lives can be lost, or big bucks wasted. Not switching on the system, is pure incompetence...heads should roll (or maybe I need to RTFA).
As a defense contractor, our customers usually have high expectations (and rightfully so), though it may vary dependi
Re:Shit happens. (Score:2)
Just this past week. Actually, it was a little less than 200 lines.
I was kind of surprised it compiled and ran exactly right the very first time.
The only problem was the 9 megabytes of input data contained numerous errors.
Human error (Score:3, Funny)
losing data? (Score:5, Funny)
three words for you... (Score:5, Funny)
women come, women go, but pr0n is forever
Suchetha
Re:three words for you... (Score:2)
There are just so many jokes to make on the women come, women go -
Comment removed (Score:5, Funny)
Nothing was lost, all data is safe (Score:5, Informative)
Two redundant radio channels were used to get data from the lander to the orbiter, which relays the data to earth. The signal for the orbiter to start listening on the high-sensitivity channel, channel A, was never given. The data was transmitted redundantly on both channels, except for images and the output of the Doppler wind speed experiment. Fortunately, all was not lost, as scientists donated radio telescope time around the earth to search directly for the A signal, despite it not being relayed via the orbiter. Thanks to this increase in sensitivity, the data acquired was good enough to fulfill all objectives of all experiments.
So everyone can relax and get one with the analysis...
Re:Nothing was lost, all data is safe (Score:2)
Good point, Zen science is the best kind. Less stressful.
Now, if only I could really become one with my thesis and finish writing the damn thing...
Slashdot post is wrong (Score:5, Interesting)
This is what the slashdot post says...
But I got this out of your linked article...
And I also found this [nwpr.org] article online. Here's an excerpt...The reports are confusing and I can't tell what happened. Was there a measurement device onboard the Huygens probe gathering data and transmitting it (like the Slashdot story suggests), or was the data supposed to come from the measurement of the signal from the Huygens probe in relation to the Cassini orbiter?
If it was the former, is the data not as good because the Earth radio telescopes didn't pick up the entire signal, because there was signal degradation, or because they have to piece all the data together from all the different radio telescopes? If it was the latter, is the data not as precise because of the proximity from the transmitter to the receiver?
Either way, the Slashdot post is wrong. If it was a measurement device solely on the Huygens probe, it was turned on- it was the relay onboard the Cassini orbiter that wasn't turned on. If the data was meant to be gathered from the proximity of the transmitter to the receiver, then the experiment wasn't onboard the Huygens probe but was actually meant to be a collaboration between the probe and the orbiter.
Don't worry (Score:4, Funny)
They send these missions all the time don't they?
Doppler Wind Experiment: "I'm not dead yet!" (Score:2)
Also, the most recent ESA press conference on Huygens stated that they are trying to recover data from the ground telescopes (which they are now referring to as Channel C), although it was unclear if this would be just the signal's Doppler or actua
Re:Doppler Wind Experiment: "I'm not dead yet!" (Score:2)
What about the grad students? (Score:4, Insightful)
Re:What about the grad students? (Score:4, Insightful)
grad students (Score:2)
Jackpot! (Score:2)
Everybody knows the Huygens probe. Nobody knows any of the scientists who worked on the probe.
Now, David Atkinson has become a name.
Granted, he's that-guy-of-the-Huygens-mission-who-was-screwed- out-of-his-experimental-results-due-to-a-stupid- mistake-on-the-part-of-a-programmer, but at least people know him!
He can write a book! He can go on talkshows!
He can even use the fact that he is now a name to get more funding!
After the initial feeling of depression, he should real
No problem... (Score:5, Funny)
Might sound a bit hard, but it's the only way they'll learn.
Latest results from analysis (Score:5, Informative)
The first scientific assessments of Huygens' data were presented during a press conference at ESA head office in Paris on 21 January.
Results include:
Re:Latest results from analysis (Score:2)
I know it's a heck of alot further up the Sun's gravity well, and there might be some harsh radiation near Saturn.
Is it doable?
Re:Latest results from analysis (Score:2, Interesting)
From what I have heard, this is one of the reasons the probe had such a short lifespan, batteries don't last long at these kind of temperatures.
I suspect that this would make even a rover type robot quite a difficult challenge.
Why does the "how" matter? (Score:2)
Isn't it wierd how bad an event is can depend so heavily on how it happened? I mean, I'd be pissed if this happened to me but I think I would be less pissed if it was due to component failure rather than human error (actually, sounds more like process or Q/A error, but I digress). And even if it was component failure, couldn't that (often) be ultimately traced back to human failure somewhere further up the line.
I dunno, we humans are strange.
Recovering lost data.. (Score:3, Insightful)
As someone who has been involved in large coding projects (100,000 lines +) while I understand how easy it is for bugs to creep in, I do think the programming bug that effectively did not switch on the second channel should have been picked up on a project of this size/budget. Sadly, too often, the bigger the bureaucracy, the more mistakes like this you have - small keen teams often do better.
Regarding image quality on Huygens - in hindsight could that have been done better?
I realise there are constraints - 80's hardware, limited batteries, 8k bit channel, etc, but here are my casual observations..
Much higher resolution CCD's were available at the time - Cassini had a 1 megapixel unit. Low res data could have been transmitted during descent, but hi-res data could have been stored & broadcast after landing. As it is, the radio spent a lot of time sending identical images of the landing site. Another idea that gets a lot more out of a video data stream is variable jpg compression & only transmitting the signal difference between certain frames. That way you can use hi res CCDs then compress-until-it-fits the 8K data channel. When there is a lot of data/change in the pictures you compress a lot, but if certain cameras are not returning any or little change in the pictures, or if the picture has no detail, more channel space is available to send either hi-resolution or even pre-recorded data.
Furthermore, why the assumption that the probe will be destroyed on landing? Why not switch off Huygens when Cassini dissapears below the horizon, and switch it on for the next day? (titan's day is 16 days long..) The batteries lasted many hours after the landing, and the craft did cruise in standby mode for 16 days, so this might have been possible.
I think they could have returned all the data we got anyway up to the landing, and designed a 2nd phase with more data being sent, with little change to mission profile/weight/etc..
One thing I dont understand - why are the triplets out of sequence? The early pictures show the landing site! Is this just some artifact of the transmission process?
If I didnt know any better, I would say that final picture of the rocks was just a "joke" by the programmer, a frame to put in when the data/checksum fails for that camera..
What about using the audio? (Score:3, Interesting)
Ouch (Score:2)
Blackadder did it too.. (Score:5, Funny)
Baldrick: You mean the big papery thing tied up with string?
E: Yes, Baldrick -- the manuscript belonging to Dr. Johnson.
B: You mean the baity fellow in the black coat who just left?
E: Yes, Baldrick -- Dr. Johnson.
B: So you're asking where the big papery thing tied up with string belonging
to the baity fellow in the black coat who just left is.
E: Yes, Baldrick, I am, and if you don't answer, then the booted bony thing
with five toes at the end of my leg will soon connect sharply with the
soft dangly collection of objects in your trousers. For the last time,
Baldrick: Where is Dr. Johnson's manuscript?
B: On the fire.
E: (shocked) On the *what*?
B: The hot orangy thing under the stony mantlepiece.
E: You *burned* the Dictionary?
B: Yup.
Some Precedents (Score:2)
"Hey, perfessor, lookit, we got us a high resolution image of the 'REMOVE BEFORE LAUNCH' galaxy!'
Suddenly Heard on Channel A... (Score:2, Funny)
These things keep happening, it seems... (Score:2)
Re:These things keep happening, it seems... (Score:2)
Well (Score:2)
Of all the .....! (Score:2)
Obviously, this would have to be a kamikaze mission. But is that such a bad thing anyway? Throughout history, explorers on Earth have set out knowing they might not return -- and many did not return. Even Christopher Columbus knew there was a risk that he could be wrong about the Earth being round. Of course it's best if you can be a living legend, but if you have to settle for one out of two then it's surely better to
Yeah, bummer (Score:2)
You Think (Score:2)
I guess it'll be there for the next mission...
eighteen years (Score:2)
18 years of his life? (Score:3, Interesting)
Looks like they've recovered the data (Score:3, Informative)
Re:Depression. (Score:2, Informative)
Officials for the European Space Agency said last week they would investigate to learn what happened. They were not available for comment on Thursday, nor did NASA officials immediately respond to telephone messages...
Yeah, both groups know how to shut up and watch each others backs when it all goes f*@$ ways : )
By the way, I thought they ended up using channel B because they did not take the full effect of the Doppler shift for channel A into account. Can anyine correct me on this? ..
Re:Depression. (Score:3, Informative)
But the order to activate the receiver, or oscillator, for Channel A was never sent, so the entire mission operated through Channel B, which is less stable, Atkinson said.
I guess it makes it even worse that it was purely a human error.
Re:Depression. (Score:3, Informative)
See also what I reported on saturday [slashdot.org].
Re:Backup (Score:2)
Re:CRAPNeal (Score:2)
Re:Ouch... (Score:3, Interesting)
You are confusing bad luck with incomptence. it would be bad luck if the system failed when told to activate. It's incomptence if they left the activation codes out of the system. This is a case of the latter, not the former.
Re:Already done it (Score:3, Insightful)
Re:So it Goes (Score:3, Informative)
Huh? Simply because failure is not an option does not mean that failure will not occur. (And note that on three out of three high altitude flights SS1 had significant problems.)
NASA didn't make the mistake... The ESA did. The ESA screwed up the command sequences they gave to to NASA to transmit. (The Huygens reciever onboard