National Virtual Observatory

National Virtual Observatory 66

Posted by michael on Sunday December 01, 2002 @08:22AM from the number-crunching dept.

scubacuda writes "According to this Technology Review article, U.S. astronomers (compliments of a $10M grant from the National Science Foundation) are building a National Virtual Observatory to make accessible terabytes of astrononomical data to a web browser. One interesting challenge is how the scientists are going to query so many *different* distributed databases (which they're leaving in their respective places to avoiding clogging network bandwidth)."

National Virtual Observatory

This discussion has been archived. No new comments can be posted.

Search 66 Comments Log In/Create an Account

Comments Filter:

"Open Source" Knowledge (Score:3, Interesting)

by otisaardvark ( 587437 ) writes: on Sunday December 01, 2002 @08:50AM (#4787188)

IMHO this is an incredible phenomenon. For the first time in history, we have been able to access a huge subset of academic literature and data for the (fairly minimal) cost of an internet connection... Many university lecture course notes are completely available on the WWW. The internet could prove to be the single factor which contributes greatest towards equality of educational opportunity for all around the world. Will education will lead to (economic) salvation?

The problem of data interfaces and the layman (Score:5, Interesting)

by JayBonci ( 92015 ) writes: on Sunday December 01, 2002 @08:57AM (#4787196)

From what the article reads, it seems to be a very ambitious and interesting project. Very rarely do you see people trying to get together to spread information out to the web in such a fashion. The major problem in my (and I can imagine in their) mind is of format? How can you accomodate the mythical layman's and his or her inherent lack of skill, and still have it be available for advanced researchers to make use of.

It seems that there is simply going to be a huge amount of data-cross referenced and collated. From the second page of the article, it seems to include pictoral data. I also hear talk of XML being thrown around, which is a good start, but there's a lot that goes into that transition. Are they looking to set the layman bar at "your novice astronomer", "the third grade science report", or "grad student". Where is this information really being targeted at the sub-obscure level.

While I don't want to trivialize their massive IT effort, it seems that a lot of this is going to come down to the end user of the data. Their sample study [caltech.edu] using this information isn't trivial stuff, and does seem to set the aforementioned bar at somewhere in the undergrad-graduate level. Perhaps that is the nature of the data (I'm not that familiar with it). There's an XML schema, some request examples, and other framework stuff already in place to view by potential client writers.

I'm glad to see XML being done the right way (by collaboration with its end users), and those pictures /numbers being available for public research. Maybe someone will throw together an inverse Terraserver [msn.com] or something with Whiz-bang true-layman appeal. Until then, the geeks bow at the effort, because man, space is BIG.

Anyone closer to the project know of any simplification efforts?

--jaybonci

Seems like $10 million might not be enough (Score:4, Interesting)

by abirdman ( 557790 ) writes: <[abirdman] [at] [maine.rr.com]> on Sunday December 01, 2002 @09:17AM (#4787225) Homepage Journal

"an electronic catalog of images in multiple wavelengths spanning half the northern sky--100 million celestial objects in all, encoded in four databases...and combine it with other, smaller U.S. and international surveys, including some maintained by the United Kingdom, Australia, India, and European Union. "

"...optical telescope images and gamma ray, infrared, radio, ultraviolet, and X-ray snapshots of the heavens"
Now, that's a lot of data, in a lot of formats. Given the economics of software development and data transformation and conversion, I wonder what they'll be able to accomplish with $10M, beyond some XML data format definitions and some shiny new infrastructure (that gigabit network interface isn't cheap).

All in all, though, it seems like a good use for those tax dollars. The "Google" of astronomy research is an attractive idea, and I know we'll get some great new acronyms in the deal.

Virtual astronomy (Score:4, Interesting)

by xdesk ( 550151 ) writes: on Sunday December 01, 2002 @09:51AM (#4787276)

What about a peer-to-peer network of amateur astronomers running highly-computerized telescopes and a special P2P program ? If the program is really good it will be able to discover automatically interesting things - like potential objects that might collide with the Earth !!! A project like this (even one slightly subsidized by public funds) can certainly be VERY cost-effective - and unlike much bigger projects can be started rather quick. And if you think that ever since the Apollo program the budget for space is smaller and smaller this might actually be the only effective way to avoid the same fate as the dinosaurs!

Re:How much will this data get re-analyzed? (Score:2, Interesting)

by niall2 ( 192734 ) writes: on Sunday December 01, 2002 @12:31PM (#4787701) Homepage

Just as an example...each data set from HST gets downloaded and used more than 5 times by different projects. Much of this is to suppliment other observations or to plan for future observations. And with the growth of imaging CCDs on HST, the number of objects in a single frame grows as well, leading to a lot of parallel usage of a single image. In the end, I doubt that every use for every frame within a 7+ terabyte archive gets used. The VO will help with this.

As another example, people still use the plate archives at Harvard. Many of these plates are over 100 years old. Astronomical data gets reused.

Making it accessible to lay people is important (Score:3, Interesting)

by Simon Field ( 563434 ) writes: on Sunday December 01, 2002 @01:52PM (#4788107) Homepage

While the main benefits of the virtual observatory will be to researchers, the $10 million is only the start, and more money will be needed, and the way to get more money is to make it popular with voters.
There are two examples of indexing large databases for the masses that come to mind. One is Google, and the other is Amazon.
Google ranks items by how popular they are, based in large part by how many links there are to the web page. Amazon gives you a list of books other customers bought when they bought the book you found in your search.
For astronomical data and images, something like those approaches could be quite entertaining. I could go to a popularity list to see which images and data everyone else was looking at (a million flies can't be wrong...). But then, like the Internet Movie Database, it would be fun to see other images and data that was most often found in the same papers or web pages as this item. Somewhat like the Science Citation Index (or the Kevin Bacon game).
Users could also rate the images and data. Then we could have lists such as "people who liked this nebula also liked these HST photos". Images could be grouped by popular use -- "Images most often used as wallpaper", "Images most often used by science magazines", "Data most often used by newspapers", etc.

Re:How much will this data get re-analyzed? (Score:4, Interesting)

by KjetilK ( 186133 ) writes: <kjetil@@@kjernsmo...net> on Sunday December 01, 2002 @02:01PM (#4788148) Homepage Journal

Grad students. Doing their thesis on a topic, use archival data to support.

To elaborate on that, at my (old) institute [astro.uio.no] people are discouraged from disembarking on a thesis that requires them to obtain original data, it is too risky.
To get observation time, you would have to write a really good proposal; most major observatories have at least three times as many applications as they have time for. If you're lucky enough to get time, it is maybe half a year into the future, and you're getting three nights to complete everything.
You spend that time preparing everything, just to come down to the observatory, and you're in the fog for three nights! Tough luck, you've spent all that time preparing, and you're now one year behind schedule...
I did three observation runs during my thesis work , two as Observing Astronomer (who is kind of the guy deciding what to look at when and for how long when at the telescope, the PI is the guy who decides what the project is about). My own thesis was purely theoretical, and I was happy about that, because we experienced having a total of ten nights (it is rare to get so many nights, it was a world-wide collaboration), and we got one full night + 3 hours on two other nights worth of observation. It's extremely frustrating to sit there getting nothing because of humidity, I can tell you, and if that had been a part of my thesis, I'd be in deep trouble.

Re:P2P as an alternative (Score:2, Interesting)

by DirtyJ ( 576100 ) writes: on Sunday December 01, 2002 @02:22PM (#4788254)

Actually, another problem with P2P between individual astronomers that I forgot to mention: Data takes up a lot of space. I've done work using mosaic cameras (where multiple CCDs are butted up together to make a large array) where single images are ~250MB. A single night's observing can produce many GB of data. Most people don't have enough disk space to store all of their data indefinitely. Most of the time, you do your work with it, and then write it to DLT or Exabyte and delete it from the hard drive to make space for the next data that you collect. In this sense, P2P wouldn't work well because most of the data which people have would not be easily accessible from remote locations.

Why web browser? (Score:1, Interesting)

by Anonymous Coward writes: on Sunday December 01, 2002 @07:05PM (#4789635)

Why is this database being built to be accessable through a web browser? Surely custom client software would be a vastly more efficient method of manipulating remote databases?

Just because the web exists doesn't mean that it should be used for everything, even if it can, especially since this project isn't going to be accessable to the general public. A small custom cross-platform client application would make much more sense depending on the data being accessed - it would probably allow for more efficient automation of searching and repetitive tasks as well by not having a completely dumb client.

I hope they considered what tasks the end-users will actually be doing with the data and are going to allow them the flexibility to be creative in their manipulation and searches.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

National Virtual Observatory 66

National Virtual Observatory More Login

National Virtual Observatory

"Open Source" Knowledge (Score:3, Interesting)

The problem of data interfaces and the layman (Score:5, Interesting)

Seems like $10 million might not be enough (Score:4, Interesting)

Virtual astronomy (Score:4, Interesting)

Re:How much will this data get re-analyzed? (Score:2, Interesting)

Making it accessible to lay people is important (Score:3, Interesting)

Re:How much will this data get re-analyzed? (Score:4, Interesting)

Re:P2P as an alternative (Score:2, Interesting)

Why web browser? (Score:1, Interesting)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot