Документ взят из кэша поисковой машины. Адрес
оригинального документа
: http://www.adass.org/adass/proceedings/adass96/accomazzia.html
Дата изменения: Tue Jun 23 21:08:36 1998 Дата индексирования: Tue Oct 2 03:37:28 2012 Кодировка: Поисковые слова: astro-ph |
Next: The Sociology of Astronomical Publication Using ADS and ADAMS
Previous: QDB: An IDL-Based Interface to LASCO Databases
Up: Database Applications
Table of Contents - Index - PS reprint - PDF reprint
A. Accomazzi, G. Eichhorn, M. J. Kurtz, C. S. Grant, S. S. Murray
Smithsonian Astrophysical Observatory, 60 Garden Street,
Cambridge, MA 02138
The design behind the Astrophysics Data System (ADS) bibliographic databases was mainly dictated by the desire for a powerful and discipline-oriented system featuring sophisticated search capabilities. The main considerations which shaped the final outcome of the system were: the advantages and disadvantages of using a commercial or publicly available RDBMS system versus a custom-build one; the quality and quantity of the data at hand versus the resources available to the project; and the tradeoff between search speed and simplicity on one hand and sophistication on the other.
General-purpose search engines and relational databases were used as part of the abstract service in the first implementation of the search engine, but they were eventually dropped in favour of a home-grown system as the desire for better performance and custom features grew with time (Accomazzi et al. 1995).
The heterogeneous nature of the bibliographic data that had to be entered into our database, and the need to effectively deal with the imprecision in it, lead us to design a system where a large set of discipline-specific interpretations are made. For instance, to cope with the different use of abstract keywords by the publishers, and to correct possible spelling errors and typos in text, sets of words have been grouped together as synonyms for the purpose of searching the databases. Also, many astronomical object names are translated in a uniform fashion when indexing and searching the database.
Because of the large number of features that we have been adding to the abstract service in the last few years, we had to strike a balance between simplicity of the user interface and the creeping featurism syndrome so commonly found in many user interfaces. To avoid overwhelming users with complex search pages, we have devised a design where the main search parameters are always visible within the top part of the screen, with more options to follow. Because of the very nature of the WWW, we have been able to create simpler HTML forms that have much of the additional functionality hidden from the user, and we now even allow users to create and customize their own search form according to their preferences.
In order to provide transparent access to our system from other WWW-based systems, we have provided access interfaces that use bibliographic codes (Schmitz et al. 1995)-or bibcodes, as referred to in the rest of this paper-as unique identifiers for references in our databases. Direct HTTP access to our CGI interface programs, and a high-level programming interface implemented as a library of Perl routines, are provided as hooks into our bibliographic search engine.
The ADS CGI interfaces implement a variety of possibly complex searches of the bibliographic databases, but searches can generally be divided in two classes: reference searches and concept searches.
This type of interface allows users to lookup a particular publication or to browse a set of references published in a journal. Access to the program that implements this interface is available by retrieving the URL:
http://adsabs.harvard.edu/cgi-bin/abs_connect?bibcode= bibcodewhere bibcode is either a fully qualified, 19-digit bibliographic code, a partial bibcode, or a bibcode pattern possibly containing metacharacters. Consider, for instance, the cases where bibcode is one of the following:
Other similar programs and HTML forms extend these capabilities by allowing selections based on publication date ranges and journals (see, for instance, the ADS Table of Contents Query Form).
Searches based on the identification of a set of references which are relevant to a particular topic or ``concept'' are implemented in a similar fashion. Because references are structured entities having several attributes (or ``fields''), a fielded search is one in which one or more fields are to be searched and one or more terms to be searched for are specified for each field.
Currently the ADS Astronomy database allows users to search by author name, astronomical object name, keywords, words in the title, and words in the abstract text. The general URL syntax for searching for terms in a particular field is
http://adsabs.harvard.edu/cgi-bin/abs_connect?field=wordswhere field is the name of the field to be searched and words represents the expression to be searched for. For instance, to find the list of all papers published by the Author in the ADS Astronomical Database, one would access the URL
http://adsabs.harvard.edu/cgi-bin/abs_connect?author=accomazzi.
When specifying more than a single word to be searched in a particular field, the interface allows the user to select whether the resulting list is to include references which contain a subset of the search terms, which search terms must be present, and which should be excluded. When specifying words to be searched in separate fields, the user may choose how the lists of references resulting from the individual field searches should be combined, using a logic similar to the one applied for combining references generated from individual words within a field.
The ADS abstract service search form has many more features and settings that can be set customized, including restricting the search to be performed only on a particular journal or body of literature (e.g., searching on refereed journals only). One immediate application of this is that it provides users with several up-to-date indexes into subsets of the astronomical literature. For instance, to search for all the publications appearing in the ADASS conference series that mention ADS in their abstract, one would simply call the abs_connect script with the arguments: text=ADS&jou_pick=YES&ref_stems=adass.
One of the most successful features of the design behind the current WWW software agents is that they allow users to transparently browse information available on the Internet via the selection of hyperlinks. In particular, this has created a de-facto standard interface and protocol for accessing network resources available from different institutions, thus becoming the glue between the services provided by different astronomical data centers.
The ADS databases currently maintain for each bibliography a set of links to both local and network-accessible resources. The following hyperlinks provide interconnectivity between the ADS and other institutions:
The relationship between the ADS system and the data centers mentioned above is reciprocal, in the sense that they, in turn, provide hyperlinks from their databases to bibliographic resources available in the ADS, when appropriate.
The popularity and usefulness of the ADS bibliographical services is due, in large part, to its discipline-specific features and to the synergy created by several data centers adopting a common language and protocol to link their resources. This cooperation provides astronomers an ever-growing wealth of information and resources that are transforming the way they perform their research.
Because of the size and completeness of its databases, the NASA Astrophysics Data System has become a clearinghouse for astronomical bibliographic resources, and the ADS abstract service has become the bridge between networked resources available from different institutions and societies.
This work is funded by the NASA Astrophysics Program under grant NCCW-0024.
Accomazzi, A., Grant, C. S., Eichhorn, G., Kurtz, M. J., & Murray, S. S. 1995, in Astronomical Data Analysis Software and Systems IV, ASP Conf. Ser., Vol. 77, eds. R. A. Shaw, H. E. Payne & J. J. E. Hayes (San Francisco, ASP), 36
Schmitz, M., Helou, G., Dubois, P., LaGue, C., Madore, B., Corwin Jr., H. G., & Lesteven, S. 1995, in Information & On-line Data in Astronomy, ed. D. Egret & M. A. Albrecht (Dordrecht: Kluwer Acad. Publ.), 271
Next: The Sociology of Astronomical Publication Using ADS and ADAMS
Previous: QDB: An IDL-Based Interface to LASCO Databases
Up: Database Applications
Table of Contents - Index - PS reprint - PDF reprint