In a science such as astronomy, information handling encompasses data collection, analysis and dissemination, as well as the way astronomers publish, interact and communicate, including with other communities, with amateur astronomers and with the public at large.
The main aim of astronomers is to contribute to a better understanding of the universe (as well as of its past and future) and consequently to a better comprehension of the place and role of humans in it.
To this end, together with theoretical investigations, they carry out observations to obtain data that subsequently undergo treatment and studies leading to the publication of results. The whole procedure can include several iterations between the various steps as well as with external fields, non-scientific disciplines, instrumental technologies and information handling methodologies.
In the following, the concept of information will cover the observational material, the more or less reduced data extracted from it, the scientific results and the accessory material used by scientists in their work (bibliographical resources, yellow-page services, and so on), as well as the communications and publications of all kinds.
It should also be noted that, contrary to other scientific disciplines, astronomy has the peculiarity of not being able to interact with the objects it investigates (except for very few bodies of the solar system where in situ studies can be carried out by spacecraft). Consequently all our knowledge of the universe has so far depended almost exclusively on the photons reaching us from outer space.
Astronomy has thus to rely on the ingenuity of instrumentalists to conceive and design a whole range of observing tools exploiting at best the latest technologies and the most sensitive detectors to obtain the most relevant and most varied information allowing progress of astronomical knowledge. The current trend is also towards panchromatic astronomy, i.e. combining information from the various wavelength ranges of the electromagnetic spectrum (radio, infrared, visible, ultraviolet, X, Gamma, ...), instead of restricting oneself to specific ranges as used to be the case in the past.
Compared with the past too, ever larger amounts of data are being collected, and the rate will probably continue to accelerate. Instruments such as NASA/ESA’s Hubble Space Telescope (HST) or ESO’s Very Large Telescope (VLT) are generating or will generate annually a volume of information of the order of terabytes of data. The rate of increase of observations is also matched by the diversification.
Until not so long ago, an astronomer could also work individually from the conception of a project through to the collection and analysis of data. Nowadays, as instrumentation has become more complex, teams of researchers, most often international ones, have become necessary and they quite naturally include technologists or instrumentalists.
At the other end of the chain, the teams are more and more including methodologists, i.e. specialists of information handling. If image processing is a natural consequence of a sophisticated technology, information handling rather qualifies whatever happens to already well-reduced data.
Astronomy and related space sciences have always been at the leading edge of new information technologies, often testing, contributing to and pushing for new developments, be it for the access to databases, the usage of networks or the initial explosion of world-wide web (WWW) sites, as well as for remote observing (spacecraft and ground-based telescopes remotely operated).
The recent dramatic evolution of communication and information technologies had a deep impact on the way the astronomy community interacts and works, in other words on its dynamics, and one can certainly expect more evolution in the future as the corresponding technologies will bring in new potentialities.
Information handling in astronomy thus reflects the way astronomers work and ensures progress of the astronomical knowledge which is then shared with colleague scientists as well as, on a less specialized level, with the significant community of amateur astronomers round the world (also a phenomenon proper to astronomy) and with the public at large since cosmic perceptions have always been a fundamental component of human culture and philosophy. Astronomers are of course also deeply involved in education at all levels.
As described in the following and as illustrated by figure 1, the information flow in astronomy is far from being a simple linear one. More and more of the processes and corresponding exchanges are performed electronically.
Figure 1
|
|
Figure 2
|
In the early 1970s, a number of European institutions decided indeed to collaborate and to create a data center at Strasbourg Observatory with the daunting task of establishing an enormous table of synonyms between all catalog identifications. This was definitely aimed at avoiding the repetition of a couple of situations where two qualified astronomers studied and published papers on the same object, but under different identifications and without realizing it.
However, more interestingly, this was allowing, with a single object name, access to all other identifiers, plus to the data listed in the various catalogs. As CDS also set up a comprehensive and very successful bibliographical object-oriented database, the references of the papers dealing with that object were immediately available by the same token. The popular Simbad database was born.
Simbad holds today more than 2 200 000 objects under about 5 500 000 identifiers, together with more than 105 000 bibliographical references including about 3 000 000 object citations. Anyone starting studying seriously an astronomical object must pay a visit to Simbad first.
Figure 2 shows other components of CDS’s information hub. VizieR is a search-and-shop individual-catalog service, with also access to large tables published in professional journals. Aladin is an interactive digitized sky atlas. Besides the references of Simbad, the CDS bibliographical service provides also access to abstracts of several major journals (see also ADS hereafter). The dictionary of nomenclature gives references and details on usage for more than 4000 different catalog acronyms. AstroGlu is a discovery tool helping to locate database servers providing relevant information.
Last, but not least, the StarPages are a set of permanently updated yellow-page resources providing detailed information on astronomy-related organizations (StarWorlds: about 6300 entries, more than 5500 WWW links), as well as on individual astronomers and related scientists (StarHeads: more than 5200 WWW home pages). They offer also access to an enormous dictionary (StarBits: about 140 000 entries) of abbreviations and acronyms related to astronomy and associated space sciences. The master files for StarWorlds have been used to produce figure 4 (see below).
Figure 4
|
A few other multipurpose resources deserve also a mention in this section.
The NASA/IPAC Extragalactic Database (NED) has been built around a master list of extragalactic objects for which cross-identifications of names have been established, accurate positions and redshifts entered to the extent possible, and some basic data collected. Bibliographic references relevant to individual objects have also been compiled.
The Astrophysics Data System (ADS) is a NASA-funded project whose main resource is an abstract service (about half a million abstracts for astronomy and astrophysics only), together with links to scanned images of over 40 000 journal articles. ADS provides also access to astronomical data catalogs and data archives, thereby making data collected by NASA space missions available to astronomers. It offers also access to the StarPages.
The National Space Science Data Center (NSSDC) provides access to a wide variety of astrophysics, space physics, solar physics, lunar and planetary data from NASA missions, together with some complementary data.
The Canadian Astronomy Data Centre (CADC) and the Astronomical Data Analysis Center (ADAC) at the National Astronomical Observatory of Japan (NAOJ) are national resources offering access to a number of catalogs, databases and archives.
Data processing
Until not so long ago, data processing consisted essentially in photographic-plate scanning and in reduction of raw photometric data from punched papertapes.
Nowadays, with the omnipresent digitization and multidimensionality of collected data, one rather speaks of image processing and a whole treatise could be devoted to it.
Indeed the time seems definitely gone, at least with the big instruments, when astronomers were coming themselves to the telescope (or ground observatory for a spacecraft) and later returning home with unprocessed data. The advanced experiments have developed their own specific image processing which is now part of the projects themselves since the development phase.
Several comprehensive image handling systems have however been developed such as ESO’s Munich Image Data Analysis System (MIDAS) and NOAO’s Image Reduction and Analysis Facility (IRAF). They can be considered as general-purpose software systems for the reduction and analysis of astronomical data and are regularly upgraded.
Complementary resources would include specific software packages and libraries (such as statistical ones), spectral line compilations, abundance libraries, opacity tables, and so on. A network such as Starlink helps UK-based astronomers to reduce and analyze their observations.
It is certainly worthwhile to mention here that astronomers have introduced the Flexible Image Transport System (FITS) which is a general way to encode both definitions of data and the data themselves and which is machine independent. The FITS format has been quickly recommended for interchange of image data between all observatories—and is now in use even outside astronomy.
Publishing and information sharing
Publishing is not only motivated by the noble aims of educating and information sharing but also strongly conditioned by career constraints involving recognition, a necessity that should not be underestimated. Recognition is sought for getting positions (i.e. grants and salaries), for obtaining acceptance of proposals (e.g. leading to data collection) and for achieving funding of projects (allowing materialization of ideas).
The pressure for recognition has contributed to the strong increase of professional publications (see figure 3), together with other factors such as the expansion of the astronomy community itself (especially after the beginning of the space age), the multiplication of large instruments and spacecraft equipped with always faster, more diversified and more efficient detectors, and so on. Commercial publishers have also put on the market more journals which were as many additional communication outlets.
Figure 3
|
The major professional journals use the peer-review procedure (‘refereeing’) for accepting, amending or rejecting submitted contributions. Albeit a matter of regular debates (on the principle itself as well as on the way it is conducted), the refereeing process has been so far the best one (or the less questionable one) to publish contributions with validated content, i.e. an assurance of good-quality, novel results obtained by reproducible experiments, calculations or analyses on which enough details are provided.
The most important general professional journals include the Astrophysical Journal and the Astronomical Journal published by the American Astronomical Society, the Monthly Notices of the Royal Astronomical Society, and Astronomy and Astrophysics resulting from the merging in 1969 of a number of European professional journals.
Astronomers communicate also via a whole spectrum of publications ranging from informal newsletters to books gathering together review papers by the best specialists on specific topics. Conferences, colloquiums, workshops and meetings of all kinds provide also efficient ways of exposing oneself to both excellent review talks and presentations of works in progress. The corresponding proceedings are published by commercial publishers, by learned societies, by research institutions or even by individuals, reasonably soon after the events.
As described in the following section, publishing is nowadays increasingly done electronically, or, better said, there is more and more of diversified publishing, i.e. of information available on different media (paper, CD-ROM, web sites, and so on). These media are not excluding, but complementing, each other. Several journals have an electronic counterpart.
As mentioned earlier, professional astronomers are also contributing substantially to less specialized publications, mainly directed towards amateur astronomers and the public at large. Many countries have their own such national journal, but Sky & Telescope is probably the magazine with the largest audience world-wide. Electronic astronomy
As mentioned already, more and more of the information exchanges in astronomy are done electronically, both dynamically (e-mail) and passively (web sites). Electronic handling is particularly well adapted to the fluid and living nature of today’s information material, to the digitized nature of most data and to the efficiency of contacts between collaborators spread over the world, as well as to the retrieval of information from the information hubs. It also makes easy the various interactions upstream and downstream, and allows diversified publishing.
Figure 4 gives an idea of the distribution, at the time of writing this article, of the astronomy-related organizations using e-mail and/or having a web site. Together with the general world distribution, more detailed views are displayed for North America and Western Europe.
The largest concentrations are located in Europe and in the USA (Northeast and California), with a few nuclei in Australia, India, Japan, New Zealand, as well as a few spots in South America. A striking feature of the central map is the desperate emptiness of the African continent. A similar comment is also applicable to quite a number of the so-called third-world countries.
In Western Europe, it is striking how France, Portugal and Spain have significantly much lower densities than their European neighbors, obviously lagging behind as to the penetration of the internet and the WWW, a few years after the electronic medium started spreading quickly over the world.
The ‘electronization’ of astronomical information handling is, however, not a bed of roses. New facilities and new possibilities bring in naturally new questions, new challenges and new problems that have to be faced, especially at the ethical level (proper credits of downloaded material, for instance), as well as at the legal (financial and copyright policies, electronic signatures, for instance) and educational (training of young and not-so-young people) ones, without forgetting the security and the fragility of the material delivered via the electronic medium—a very worrying factor for many people.
The most demanding challenge, however, arises probably from the enormous quantity of information easily reachable (and providable) by everybody nowadays, as discussed in the following section.
The information retrieval challenge
At ACM97, the conference celebrating the 50th anniversary of the Association for Computing Machinery, the Nobel Prize laureate Murray Gell-Mann called attention to the fact that, with the digital age producing an ‘immense sea of data that threatens to drown humanity’, people needed to adapt how they think so that true knowledge can be distilled from the deluge.
‘We hear, in this dawn of the so-called information age, a great deal of talk about the explosion of information and new methods for its dissemination. It is important to realize, however, that most of what is disseminated is misinformation, badly organized information or irrelevant information. How can we establish a reward system such that many competing but skillful processors of information, acting as intermediaries, will arise to interpret for us this mass of unorganized, partially false material?’
That challenge is probably the most critical one currently in the field of information handling—and astronomy is facing it too.
We shall have to rely, indeed, on the wisdom of providers and users of electronic information, as well as on the various intermediaries (compilers, information hubs,…) and on the learned societies, committee experts and so on to take this into account, to put, more than ever, the emphasis on validated and authenticated electronic information, and to reward appropriately the scientists who are or will dedicate a full-time job to such activities.
The quality of resources as well as their maintenance must be continuously improved from lessons learned with time and using the most adequate tools. Generally speaking, information has to be collected, verified, de-biased, homogenized and made available not only in an efficient way but also through reliable channels. Sophisticated techniques cannot save the extensive background, unrewarding and very careful work which is indispensable for the compilation of a valuable resource. One could never stress enough the importance of this obscure daily work consisting of patiently collecting data, checking information and updating it. This has also to be carried out by knowledgable scientists or documentalists and cannot be delegated to inexperienced clerks.
Efficient search engines working on validated and authenticated material must enable finding information looked for, if available at all, and whether it relates to celestial objects, to data, to bibliography or to the vast coverage by yellow-page services. That information must be of good quality, relevant and on target. The best search engines available today and the corresponding organizations have been mentioned in this article.
As technology leaders agree that changes are the only sure thing about computing and communications in the next decades, we can certainly expect further modifications on the information handling in astronomy and consequently on the sociodynamics of the astronomy community itself.
Useful URLs
For WWW access, here are—in alphabetical order—the URLs of the most relevant organizations and resources mentioned in the article. Most of them are collaborating with each other and are hosting mirror pages of their partners. Complementary resources are also generally available from the sites.
• |
American Astronomical Society (http://www.aas.org/) |
• |
Association for Computing Machinery (http://www.acm.org/) |
• |
Astrophysics Data System (http://adsabs.harvard.edu/) |
• |
Canadian Astronomy Data Centre (http://cadcwww.dao.nrc.ca/CADC-homepage.html) |
• |
Central Bureau of Astronomical Telegrams (http://cfa-www.harvard.edu/cfa/ps/cbat.html) |
• |
International Astronomical Union (http://www.iau.org/) |
• |
Minor Planet Center (http://cfa-www.harvard.edu/iau/mpc.html) |
• |
NAOJ Astronomical Data Analysis Center (http://adac.mtk.nao.ac.jp/) |
• |
NASA/IPAC Extragalactic Database (http://nedwww.ipac.caltech.edu/) |
• |
National Space Science Data Center (http://nssdc.gsfc.nasa.gov/) |
• |
Royal Astronomical Society (http://www.ras.org.uk/ras/) |
• |
Simbad (http://simbad.u-strasbg.fr/Simbad) |
• |
Sky & Telescope (http://www.skypub.com/) |
• |
Starlink (http://star-www.rl.ac.uk/) |
• |
Strasbourg Data Center (http://cdsweb.u-strasbg.fr/CDS.html) |
Bibliography
Because of the quick evolution of information technologies nowadays, and because of the impact they have on the dynamics of the astronomy community, some aspects of the most recent compilations and reviews could become rather quickly outdated. The basic principles will largerly remain unchanged, however, and the following books could be recommended as possible further, more technical, reading:
Series of specialized conferences provide also sources for advanced material. See for instance the following as well as the numerous references quoted therein: