I am assuming at the start that we will agree that some type of encoding standard and a prototype implementation of that standard are the two major products that will advance our objective. That is, we need to produce a structure and a working demonstration system and not simply theoretical or descriptive papers on the concept.
In order to develop such a product, we need to address a number of issues dealing with the existing environment for archival authority information and its relation to other archival descriptive products; the state of acceptance, knowledge, and experience in the archival profession of the concept; and the likely path to adoption of the approach as a standard and as a practice. My comments here draw heavily on both the MARC-AMC and EAD experiences and the roads they traveled on their way to acceptance by the community. I am not convinced that there are substantive differences in this project from any other standards development project, other than the degree of consensus in the community about the inherent value of the approach. So, in many ways, this is simply an effort to articulate that process for this particular issue and try to define specific implications for our work.
The comments also are based primarily on a US perspective and the experiences
and state of the archival community in other countries may differ significantly
and suggest modifications.
Define and confirm the concept and show it working
While the idea of archival authority information, as defined in the first discussion paper, has been discussed by archivists for over 10 years, recognized in an international standard (ISAAR (CPF)), and implemented in a small number of existing systems, it is not clear that it is recognized as an integral part of archival description by more than a small number of institutions and archivists. Most archivists will readily subscribe to a descriptive practice that includes the recording of biographical or agency history information in traditional ways - as narrative text in a catalog record or finding aid - but relatively few would understand the benefits of recording it as a separate but linked entity or as information that can have its own structure and content designation. One of the major tasks we face in having this concept of archival authority information adopted as a standard and a practice is to convince a critical mass of archivists that it is beneficial, feasible, and economic to treat authority information in this way and to demonstrate that the tools are available to make it happen.
Most of the discussion in the profession thus far has focused on administrative histories of organizational units, where a variety of rationales specific to the mission and responsibilities of the archival repository (such as citizens' access to information about their government) and to the method of organizing records (multiple series, each with its own descriptive record, originating from the same organization) make the value of structured authority information self-evident. For materials, such as personal papers, however, the value of "authority copy" that can be used in multiple places is not as obvious. Nor is there the same mission on the part of the repository to record and disseminate information on a person's life and activities as there is for an archives to do so for a governmental agency. {Bright Sparcs is an exception since it focuses on individuals and their characteristics.) We need to consider carefully the arguments and benefits we want to advance for extending the approach beyond organizational entities.
We should also consider the application of the concept to other non-records creating entities, such as functions, activities, and artifacts. The importance of architectural research on the Yale campus, for example, strongly suggests that an authority record standard that encompasses structured and content designated descriptions of buildings would be extremely valuable and, in fact, such information already exists in book form and is heavily used. Similar cases can be made for functional, geographic, and other entities or concepts that are documented or influence the creation and use of records. One of the goals then is to determine whether it is practical and useful to establish one authority encoding standard for all types of entities or to develop separate ones for each specific type.
The existence of working prototype systems that demonstrated how effectively
catalog-level information and finding aids could be searched, retrieved, and
exchanged was key to their acceptance by the archival community. In both cases,
there was considerable skepticism about the value of the undertakings and the
amount of effort needed to reorganize thought processes and workflows to
accommodate these new standards (and some still remains). Having a working model
with live information that can be reviewed and tested will be essential to
acceptance and involvement by the profession.
Accommodate existing universe
Since adoption of this practice will involve a change of methodology, at least for many in the US, the benefits must outweigh the costs of reorientation and restructuring of information systems. As with MARC-AMC and EAD, the proposed standard cannot be overly prescriptive in terms of required content designation and structure. It needs to accommodate the existing universe of authority information so that it can easily be migrated into a more robust system that treats it as independent, linked data without requiring extensive analysis and recoding.
In addition to the technical accommodation of existing practice, the proposed
standard must also be careful to identify and address the full range of content
and structure that are already being used. While there are few instances of
highly structured archival authority information in place, the content that
could be specifically designated within existing information products is
extensive. Since much of this information is embedded in narrative texts, it
will require analysis of these texts to identify content that is commonly
recorded and that may warrant either structure or specific content designation.
This parallels the work that Elaine Engst did prior to the development of
MARC-AMC and that the Berkeley team did to "distill the essence" of finding aids
for the document type definition.
Encourage better practice
At the same time that it accommodates existing information products, however,
the standard also needs to be sufficiently rigorous to encourage better practice
in analyzing, structuring, and content designating authority information. It can
best do this by a combination of a detailed standard that reflects a complete
set of relevant data elements and clear examples of how they are used. To the
extent that such structured and encoded examples can be compared to the looser
narrative structures with which most archivists work, and the benefits of such
structuring for retrieval and comprehension purposes shown, the standard may
gain more adherents.
Prototype that demonstrates linkage and navigation
Also critical to the success of this effort will be a working prototype of authority information linked to both standard catalog records and encoded finding aids. At a minimum, this prototype needs to show the feasibility of the concept such that one can easily and intuitively navigate amongst all three types of archival descriptive information. The prototype needs to provide access to catalog records and finding aids in existing systems from authority information rather than requiring a migration of the former into a custom-built system. In this way, it is similar to the capability of linking MARC catalog records that already exist in national utilities or local systems to EAD-encoded finding aids.
Again, to the extent that such a system can be compared to existing
architectures and methodologies and shown to be more intuitive and effective,
there will be a better chance of convincing archivists to adopt this approach.
If user studies are built into the project, then the prototype should be built
in ways that the data needed for such studies is readily available. The design
of user studies should take place concurrently with the specification of the
prototype system with a good sense of what will be investigated.
Review and standards process
The encoding standard and prototype should be constructed in such a way that they will be readily available for review and comment by the international archival community at appropriate stages. The process by which the FINDAID document type definition became EAD is a model for the type of review and consultation that is needed. A call for the contribution of existing archival authority information, coupled with direct invitations to repositories who are known to have caches of such information or represent certain archival sub-universes that may have different needs, can publicize and educate the community to the approach (as well as assembling the needed information).
Similarly, a structured process of review and comment with a plan for turning
the process over to an established professional standards group for final
development, promulgation, and maintenance may be a goal we wish to articulate
and aim for from the start. Presentations at professional meetings, articles,
and similar consciousness-raising and education activities should also be
explicitly planned for.
Summary of the characteristics of the project product:
Convincing demonstration of the utility of the approach.Demonstration of tools to make implementation practical and cost-efficient.
Ability to link technically to related descriptive products in existing systems.
Ability to accommodate existing practice (within reason).
Encouragement of the development of better practice.
Support user studies.
Educate the community and elicit involvement throughout the stages of the project.