doi:10.1016/S1532-0464(03)00011-X
Copyright © 2003 Published by Elsevier Science (USA).
The structure of science information
Received 1 March 2002.
Available online 10 March 2003.
References and further reading may be available for this article. To view references and further reading you must
purchase this article.
Abstract
The organization of information within science can be investigated in a principled way through analysis of science language. The restricted use of language in science enables description of the informational structure of science and of particular subfields, with strong similarities to structures in mathematics and programming languages. This result rests on decades of research into the relation between form and content in language, based on an information-theoretic approach to the structure of information. Examples are provided from immunology and the social sciences. Practical applications include storage of science information in databases, indexing the literature, and identification and resolution of controversy.
Table 1. Formulaic representation of sentences

The middle column is a grammatical transform of the left column. Brackets enclose elementary sublanguage sentences. Material between brackets is the sublanguage conjunction marked by colon. Material before a bracket is a general conjunction (not shown in formulas) to the preceding sentence; WH indicates a secondary sentence which has become relative clause or modifier. Vertical bars inside brackets separate the subject, verb, and object. Parentheses indicate zeroed material. ← indicates that the preceding material is to be read in English in reverse order; forward-readable transforms exist but are more complex. The right column gives the formulaic representation of the middle column, obtained directly by writing a sublanguage symbol for each segment between bars or brackets. Superscript w on a host letter indicates that the host is carrying a modifier which appears as a secondary sentence, below, introduced by WH. Other superscripts indicate a modifier that is written together with the host. Subscripts indicate subclasses of the class marked by the host letter. The sentences are from E.H. Leduc, A.H. Coons, J.M. Connolly, J. Exp. Med. 102, 66 Par. 4 sentences 1–3 (1955) and the analysis is given on pp. 360–361 of Harris, Gottfried, Ryckman, et al. op. cit.