NAME: Changes to the USMARC Classification Format for Multilingual Classification Schemes
SOURCE: Decimal Classification Division, Library of Congress
SUMMARY: This proposal suggests changes in the USMARC classification format to accommodate multilingual editions of a classification scheme. Several additions are suggested: 1) add subfield $n for a textual note in field 084 (Classification scheme and edition) to allow for the ability to specify a source edition which may vary within a translation; 2) allow for a new subfield for the specification of whether the edition is authorized or unauthorized in a new subfield $f; 3) addition of field 686 for Relationship to Source Note to show the variations in different editions and how they relate to the source edition.
KEYWORDS: Field 084 (Classification); Field 686 (Classification); Classification Scheme and Edition; Relationship to Source Note
RELATED:
STATUS/COMMENTS:
12/15/96 - Forwarded to USMARC Advisory Group for discussion at the 1997 Midwinter MARBI meetings.
2/17/97 - Results of USMARC Advisory Group discussion - Approved.
It was suggested that a code be used in subfield $f Authorization (e.g. "u" for unauthorized). Some concern was expressed that the information in field 084 is repeated in every record, but it was pointed out that records from different translations could be mixed in the same database.
2/26/97 - Results of final LC review - Approved.
PROPOSAL NO. 97-5: Changes to the USMARC Classification Format
for Multilingual Classification Schemes
1. BACKGROUND
The Dewey Decimal Classification is used by over 200,000
libraries in 135 countries and has been translated into over 30
languages. The current edition, Edition 20, has been translated
into Italian, Spanish, and Turkish. These translations are very
close in structure and content to the English-language standard
edition with minor cultural adaptations. There is also an
intermediate French edition based on Abridged Edition 12 with
excerpts from Edition 20. Abridged Edition 12 has been translated
or is in the process of being translated into Arabic, French,
Greek, Hebrew, Italian, and Persian. Work is already underway on a
translation of Edition 21 into Russian. It is expected that
translations of Edition 21 will also appear in Arabic, Chinese,
French, Italian, and Turkish, plus excerpts (the major revisions)
in Spanish.
Classification schemes are unique among authority control
systems in that they may retain the same controlled vocabulary
(notation) and meaning when translated into other languages or
linked with other thesauri. Over a decade ago, Elaine Svenonius
(1983) noted Dewey's potential as a switching language in
multilingual databases. In order for this potential to be
realized, there must be explicit links between the English-language
standard editions of Dewey and each translation, and documentation
on the nature of those links. It is likely that the USMARC Format
for Classification Data will be used for the development of
explicit links between different translations.
An IFLA working group has reviewed the MARC format for
adequacy for international classification systems and compatibility
with UNIMARC, and made preliminary recommendations for extensions
to the MARC format. A paper will be presented a later MARBI
meeting addressing changes needed for accommodating the Universal
Decimal Classification (UDC). This paper discusses additional data
elements needed to link records from a translation to a standard
edition.
(http://www.nlc-bnc.ca/ifla/VII/s29/projects/rep0796.htm)
2. DISCUSSION
Ability to specify source edition. There is a need to specify the
source edition for a translation.
Source edition may vary within a translation; for example, the new
Spanish edition is a translation of Edition 20, but contains parts
of Edition 21 such as the revised area table for the former Soviet
Union and the Table 6 expansions for North and South American
native languages.
Field 084 is used for Classification Scheme and Edition and is
defined as follows:
Indicators
First Type of edition
0 Full
1 Abridged
8 Other
Second Undefined
# Undefined
Subfield Codes
$a Classification scheme code (from USMARC Code List for
Relators, Sources, Description)
$b Edition title (NR)
$c Edition identifier (NR)
$e Language code (NR)
In the case of the Spanish translation mentioned above, it is
important to note the source on which it was based and the
variations incorporated. A new subfield $n for a variations note
could be used to show the variation. This information would appear
in every classification record that is part of this edition. In
addition, a new subfield $d could be added for Source edition
identifier.
Examples:
084 8 $addc $bSistema de Clasificación Decimal $c20 $d21 $n
contains parts of edition 21 in revised area table for
former Soviet Union and Table 6 expansions for North and
South American native languages
084 8 $addc $bClassification décimale de Dewey $cintermédiaire
$d12 $n based on Abridged Edition 12 with extensions from
DDC 20
Authorized/unauthorized. It is also necessary to specify whether
a translation is authorized or unauthorized. This information is
useful from the perspective of quality (the assistance the
translators' received in preparing the translation) and to
distinguish it from an authorized translation in the same
language. This information could be added as a new subfield for
authorization in field 084.
Examples:
084 8 $addc $bSistema de Clasificación Decimal $c20
$funauthorized
Relationship of number to source edition number.
Another change needed in the format is the ability to show the
relationship of the number to the source edition. There are three
types of relationships possible between the number and the source
edition: expansion (link to base number); authorized option (link
to option); adaptation. Even if based on a standard edition,
there may be variations in the meanings of some numbers or
expansions. For example, Table 2 in Edition 20 contains area
notation 4541 for the province of Bologna. In the Italian edition,
area notation 4541 has a 26-number expansion for the parts of the
province of Bologna. It would be useful to know that this is an
expansion, and the base number in the standard edition on which the
expansion is based.
Each edition of the DDC contains options to address cultural
differences or to provide a method for emphasizing topics of local
importance. If a scheme employs a standard option, it would be
useful to have a link to that instruction or number in the standard
edition.
Sometimes, translators adapt parts of the Classification to meet
local needs. For example, the religion schedule in the Persian
edition reflects the needs of an Islamic majority. It would be
useful to document an adaptation as such while retaining the link
to the source edition.
Because there is a need to provide a linking mechanism to other
numbers and no existing field contains the same type of
information, a new field could be added to the classification
format for Relationship to Source. It might be defined as follows:
686 Relationship to Source Note
Indicator 1 Type of relationship
0 Number from other source edition
1 Expansion
2 Option
3 Adaptation, other
$a Number in edition described in field 084--single number or
beginning number of span (R)
$b Number in primary source edition--single number or beginning
number of span (R)
$c Number in edition described in field 084, number in primary
source edition, or number where instructions are found--ending
number of span (R)
$i Explanatory text (R)
$o Number where instructions are found--single number or
beginning number of span (R)
$t Topic (R)
$z Table identification (R)
$2 Edition identifier (R)
$5 Institution to which field applies (R)
$8 Link and sequence no. (NR)
EXAMPLES:
Number from other source edition
084 8# $addc$bSistema de Clasificación$c20$ncontains parts of
edition 21 in revised Table 2 notation for former
Soviet Union and Table 6 expansions for North and South
American Languages$espa
153 ## $z2$4771$hEuropa Europa Occidental$hEuropa oriental
Rusia$hUcrania$jProvincia de Crimea
686 0# $221
Expansion
084 8# $addc$bClassificazione Decimale Dewey$c20$eita
153 ## $z2$a454126$hEuropa Europa occidentale$hPenisola
italiana e isole adiacenti Italia $hRegione dellýEmilia-
Romagna e San Marino$hProvincia di Bologna$hNordovest
della provincia di Bologna$jCrevalcore
686 1# $z2$b4541
Option
084 8# $addc$bClassificazione Decimale Dewey$c20$eita
153 ## $a222.86$hReligione$hBibbia$hLibri storici dellýAntico
Testamento$hNeemia (Esdra 2)$jTobia
683 2# $i(Opzione: Classificare in$a229.22)$p253
686 2# $o229.22
084 8# $addc$bDewey Onlu Sýnýflama ve Baýýntýlý Dizin$c20$etur
153 ## $$a412$hDil ve dilbilim$hBelirli diller$hTürk
dili$h$jStandart Türkçeýnin kökenbilimi (etimolojisi)
686 2# $b494.352$o410
Adaptation, other
084 8# $addc$bDewey Onlu Sýnýflama ve Baýýntýlý Dizin$c20$etur
153 ## $z2$a56226$hTablo: Coýrafi Alanlar, Tarihi Dönemler,
Kiýiler$hAsya Doýu (Orient) Uzakdoýu$hOrta Doýu (Yakin
Doýu)$hEge Bölgesi (Batý Anadolu) ve Marmara Bölgesi
$hMarmara Bölgesi$jýstanbul
686 3# $tComprehensive works and European portion of Istanbul
province$z2$b49618
686 3# $tAsian portion of Istanbul province$z2$b563
084 8# $addc$bClassificazione Decimale Dewey$c20$eita
153 ## $a641.815$hTecnologia (Scienze applicate)$hEconomia
domestica e vita familiare$hCibi e bevande
(Alimenti)$hConservazione, immagazzinamento, cucina degli
alimenti$hCucina di specifici tipi di piatti$hPiatti
preliminari e di accompagnamento$jPane e affini
680 1# $iEsempî: cialde, crackers, cr�pes, focacce, panini,
pizze, schiacciate
686 3# $tPizza$b641.824
084 8# $addc$bClassificazione Decimale Dewey$c20$eita
153 ## $a641.824$hTecnologia (Scienze applicate)$hEconomia
domestica e vita familiare$hCibi e bevande
(Alimenti)$hConservazione, immagazzinamento, cucina degli
alimenti$hCucina di specifici tipi di piatti$hPiatti
principali$jSformati di carne e torte di formaggio
686 3# $tPizza$a641.815
Script. In order for the classification format to be used
internationally, it is necessary to provide deails about the
translation itself, such as script or romanization system. DDC is
already published in other scripts (Arabic, Russian). This issue
will be explored in a later discussion paper concerning script and
romanization in the authority format. In addition, some editions
contain text in more than one language. Thus, it is desirable to
make field 084 subfield $e (Language code) repeatable.
3. PROPOSED CHANGES
The following is presented for consideration:
- In the USMARC Classification Format, define the following in
field 084 (Classification Scheme and Edition:
$d Source edition identifier
$f Authorization
$n Variations
Make the following subfield in field 084 repeatable:
$e Language code
See Attachment A for a description of this field if this proposal
is approved.
- In the USMARC Classification Format, define a new field 686
for Relationship to Source Note.
See Attachment B for a description of this field if this proposal
is approved.
------------------------------------------------------------------
ATTACHMENT A
< > indicates addition; [ ] indicates deletion
084 Classification Scheme and Edition (NR)
Indicators
First Type of edition
0 Full
1 Abridged
8 Other
Second Undefined
# Undefined
Subfield Codes
$a Classification scheme code (NR)
$b Edition title (NR)
$c Edition identifier (NR)
<$d Source edition identifier (NR)>
<$f Authorization (NR)>
$e Language code (NR)
<$n Variations (R)>
FIELD DEFINITION AND SCOPE
This field contains information about the authoritative
classification scheme and edition that contains the classification
number(s) and term(s) in the record. It also may indicate the
edition title, date, and language of a particular version of the
classification scheme. If a library creates its own record for a
classification number maintained by another classification source,
the classification scheme on which it is based is specified in
field 084 and the library creating the record is identified in
field 040 (Record Source).
GUIDELINES FOR APPLYING CONTENT DESIGNATORS
INDICATORS
First Indicator - Type of edition
The first indicator position contains a value that specifies
the type of edition containing the classification data.
0 - Full
Value 0 indicates that the classification data is contained
in the full edition of the classification scheme. This
value is also used for classification schemes not issued in
an abridged edition.
084 0#$addc$c20
153 ##$a616.9792$hTechnology (Applied
sciences)$hMedical sciences.
Medicine$hDiseases$kSpecific diseases$hOther
diseases$hDiseases of the immune
system$hImmune deficiency diseases$jAcquired
immune deficiency syndrome (AIDS)
084 0#$alcc
153 ##$aN6370$cN6494$hVisual arts$hHistory$hModern
art$jBy century
1 - Abridged
Value 1 indicates that the classification data is from an
abridged edition of the classification scheme.
084 1#$addc$c11
153 ##$a323.3$hSocial sciences$hPolitical science
(Politics and government)$hRelation of state
to its residents$hRelation of state to social
aggregates$jOther social aggregates
8 - Other
Value 8 indicates that the classification data is contained
in an edition other than those specified by the other
values. The edition is specified in subfield $b (Edition
title) or subfield $c (Edition identifier).
084 8#$audc$cInternational medium edition
153 ##$a512.5$hMathematics and natural
sciences$hAlgebra$jGeneral algebra
Second Indicator - Undefined
The second indicator position is undefined and contains a
blank ($).
SUBFIELD CODES
$a - Classification scheme code
Subfield $a contains a variable-length alphabetic USMARC code
that identifies the classification scheme used to formulate
the classification number and caption in field 153
(Classification Number). The code is based on the general
classification scheme used without regard to the particular
edition or adaptation of the scheme. A classification number
or span that has been adapted in some way from the information
in the authoritative classification scheme is coded for the
scheme in this subfield and the NUC symbol or name of the
library that made the adaptation is contained in field 040
(Record Source). The source of the classification scheme code
is USMARC Code List for Relators, Sources, Description
Conventions that is maintained by the Library of Congress.
084 0#$addc$c20
153 ##$a323.32$hSocial sciences$hPolitical science
(Politics and government)$hCivil and political
rights$hCivil and political rights of other social
aggregates $jSocioeconomic classes
084 0#$alcc
153 ##$aHE381$hTransportation and communications$hWater
transportation$hWaterways$jGeneral works
040 ##$aDNLM$cDNLM
084 0#$alcc
153 ##$aSF887$hAnimal culture$hVeterinary
medicine$hVeterinary medicine of special organs,
regions, and systems$hUrinary and reproductive
organs$jObstetrics
753 ##$aAbortion, Veterinary
[This record is created by NLM for use in the NLM index
to refer users to an LCC number. The basic
classification scheme is identified in field 084 and
agency that created the record is in field 040 (Record
Source).]
084 8#$audc$cInternational medium edition
153 ##$a642.12$hHousekeeping. Home economics. Domestic
science$hFood. Cooking. Dishes.
Meals$hMeals and mealtimes. Tableware$jMorning meal.
Breakfast
$b - Edition title
Subfield $b contains the title of the edition when a USMARC
code has not been assigned to the scheme or further
information needs to be given about the edition.
084 8#$addc$bSistema de Clasificación Decimal$c1980$espa
153 ##$a331.012$hCiencias sociales$hEconom�a$hEconom�a
laboral$hFilosof�a y
teor�a$jSatisfacciones del trabajo
[Data is from the Spanish edition of the Dewey Decimal
Classification.]
$c - Edition identifier
Subfield $c contains the edition number, date, or other
textual designation of the classification scheme edition
contained in the classification record.
084 0#$addc$c20
153 ##$a401.3$hLanguage$hPhilosophy and
theory$jInternational languages
084 0#$anlm$c4th ed., rev.
153 ##$aWQ160$hObstetrics$jMidwifery
<$d Source edition identifier
Subfield $d contains the edition number, date, or other
textual designation of the classification scheme edition used
as the primary source for the edition identified in subfield
$c. Subfield $d is not used if it would be the same as
subfield $c. Subfield $d contains the edition on which the
current edition is based.
084 8# $addc $bSistema de Clasificación Decimal $c20
$d21 $n contains parts of edition 21 in revised
area table for former Soviet Union and Table 6
expansions for North and South American native
languages>
$e - Language code
Subfield $e contains the USMARC code for the language of the
classification scheme edition when the language is other than
English. The source of the codes is USMARC Code List for
Languages that is maintained by the Library of Congress.
<$f - Authorization
Subfield $f contains an indication of whether the translation
has been authorized, i.e., done with the approval of the
producer of the source edition. If this subfield is not used,
it is assumed to be authorized.
084 8# $addc$bSistema de Clasificación
Decimal$c20$funauthorized>
<$n - Variations
Subfield $n contains general information about variations in
this edition from the primary source edition. Field 686
Relationship to Source Note contains specific information
about the relationship of a particular number to the source
edition.
084 8# $addc$bSistema de Clasificación$c20$ncontains
parts of edition 21 in revised Table 2 notation
for former Soviet Union and Table 6 expansions
for North and South American Languages$espa>
SCHEME-SPECIFIC CONVENTIONS
DEWEY DECIMAL CLASSIFICATION
Only the standard abridged edition uses value 1 in the first
indicator position.
RELATED USMARC FIELD/DOCUMENT
040 Record Source
USMARC Code List for Languages
USMARC Code List for Relators, Sources, Description
Conventions
-------------------------------------------------------------------
ATTACHMENT B
686 Relationship to Source Note (R)
Indicators
First Type of relationship
0 Expansion not based on other source edition
1 Option
2 Adaptation
3 Number from other source edition
Second Undefined
# Undefined
Subfield Codes
$a Number in edition described in field 084--single number or
beginning number of span (R)
$b Number in primary source edition--single number or beginning
number of span (R)
$c Number in edition described in field 084, number in primary
source edition, or number where instructions are found--
ending number of span (R)
$i Explanatory text (R)
$o Number where instructions are found--single number or
beginning number of span (R)
$t Topic (R)
$z Table identification (R)
$2 Edition identifier (R)
$5 Institution to which field applies (R)
$8 Link and sequence no. (NR)
FIELD DEFINITION AND SCOPE
This field contains information about the relationship of a number
to the source edition when the number is
different from the standard number for the same topic in the
primary source edition. This field is used for
numbers based on a source other than the primary source,
expansions, implemented options, and adaptations.
The information in this field is intended primarily for computer
processing or to guide classifiers and is often
not written in a form adequate for public user display.
GUIDELINES FOR APPLYING CONTENT DESIGNATORS
INDICATORS
First Indicator - Type of relationship
The first indicator position contains a value that indicates the
type of relationship between the number in the 153 field and the
standard number for the same topic in the source edition.
0 - Number from other source edition
Value 0 indicates that the classification number in field 153 is
based on a source other than the primary source. If the
classification number in field 153 is the implementation of an
option described in the other source edition, use indicator
value 2.
1 - Expansion
Value 1 indicates that the classification number in field 153
represents a more specific number in the same hierarchy as the
standard number in the primary source edition for the topic
identified in subfield $t. If this number is based on another
source edition, use indicator value 0.
2 - Option
Value 2 indicates that the classification number in field 153
represents the implementation of an option described in the
primary source or other source edition.
3 - Adaptation, other
Value 3 indicates that the classification number in field 153 is
different from the number in the primary source edition for the
topic identified in subfield $t, and none of the types of
relationships described with indicator values 0-2 is applicable.
SUBFIELD CODES
$a - Number in edition described in field 084--single number or
beginning number of span
Subfield $a contains the number in the edition described in
field 084 for the topic identified in subfield $t. Subfield
$a is not used if it would be the same as the number in field
153.
$b - Number in primary source edition--single number or beginning
number of span
Subfield $b contains the standard number in the primary
source edition for the topic identified in subfield $t, or if
there is no subfield $t, then in subfield $j of field 153.
Subfield $b is not used if it would be the same as subfield
$o (Number where instructions are found).
$c - Number in edition described in field 084, number in primary
source edition, or number where instructions are found--
ending number of span
Subfield $c contains the ending number of a classification
number span cited in field 686. The beginning number of the
span is recorded in subfields $a, $b or $o.
$o - Number where instructions are found--single number or
beginning number of span
Subfield $o contains the number in the source edition where
instructions are given for the option that is being
implemented in the number in field 153. This subfield is
used only for an option described in the primary source
edition or another source edition (indicator value 2).
$t - Topic
Subfield $t contains the topic that is being added to or
subtracted from the meaning of the number in field 153.
Subfield $t is not used if it would be the same as subfield
$j (Caption) in field 153.
$i - Explanatory text
Subfield $i contains the explanatory text in field 686.
$z - Table identification
Subfield $z contains the identification of the table to which
a classification number recorded in field 686 belongs, if the
classification number is part of a table. For a
classification number span, subfield $z is given only once,
before the first number.
$2 - Edition identifier
Subfield $2 contains the edition number, date, or other
textual designation of the classification scheme edition
used as the source for the classification number in field
153 when that source is not the primary source. This
subfield is used with indicator value 0, and with indicator
value 2 when the option is described in the other source
edition. This subfield is not used to record the edition
identifier of the primary source edition; that edition
identifier is recorded in subfield $c or $d of the 084
field.
$5 - Institution to which field applies
Subfield $5 contains the USMARC code of the organization to
which the Relationship of Source note applies. The source
of this code is USMARC Code List for Organizations that is
maintained by the Library of Congress.
$8 - Link and sequence number
Subfield $8 contains data that is used to sequence a 686
field with other related 6XX or 76X fields. The subfield
is structured as follows:
.
The linking number is a variable length whole number. The
linking number is the same for each 6XX or 76X field being
linked to this field.
A variable length sequence number is added to control the
display sequencing of fields with identical linking
numbers. A sequence number is separated from a linking
number by a decimal point. A sequence number may itself be
a decimal number.
For examples of the use of this subfield see field 763
(Internal Subarrangement or Add Table Entry).
SCHEME-SPECIFIC CONVENTIONS
DEWEY DECIMAL CLASSIFICATION
Only longer numbers in the same hierarchy are expansions.