1 Application for the TEI customizaiton

The LAUDATIO-Repository provides open access to historical text corpora. With the help of metadata such as

the corpus title,
version,
documents in the corpus,
formats,
annotations,
checking methods
and the responsible editors, annotators and projects.

you will be able to chose your corpus of interest for your analysis or next preparation steps.

The TEI customization provides a metadata scheme for the LAUDATIO-Repository which enables a structured and uniform display and facetted and free-metadata search of such corpus metadata. The customization is based on a metadata model.

For each corpus in the repository the metadata are stored in TEI XML. There are three ODDs which specify the metadata for a corpus, the documents of the corpus and the annotation in the documents. This customization refers to the class 'corpus', see figure 1 below.

The TEI ODD is developed in the context of the LAUDATIO-Project in cooperation with the working groups of the Search and Visualization System ANNIS and the converter framework SaltNPepper.

1.1 Metadata model

The metadata model refers to all kinds of historical text corpora.

The metadata model defines a corpus as the sum of documents which are the sum of annotations. For each concept, there is a list of attributes which represent the metadata relevant for data retrieval in the repository. For example, the attribut 'title' is assigned to each concept but there is a difference between a title for the corpus, a title for a historical document within the corpus or the title of an annotation layer. with the help of the model, it will be clear which title is meant in which situation.

For detailed information, motivation and discussion see

1.2 Implementation of the metadata model

The TEI customization creates TEI scheme for the LAUDATIO metadata concepts 'Corpus', 'Document' and 'PreparationStep' by mapping the metadata model and TEI p5 header structure. The metadata model is realized with TEI p5 by using a subset of the TEI p5 and a TEI header for each conclascept:

Each TEI XML file contains fileDesc, encodingDesc and revisionDesc. The concepts 'Corpus' and 'Document' contain profileDesc because they refer to the language used in the data.

The header have relations to each other in the following way: The 'Corpus Header' provides lists of documents corresponding to the 'Document Header' and list of annotations corresponding to the 'PreparationStep Header'. Each 'Document Header' contains a list of annotation for each document in the corpus corresponding to the 'PreparationStep Header'.

2 TEI p5 customization for the concept 'Corpus'

The ODD combines the four basic modules (<textstructure>, <core>, <header> and <namesdates>), but removes most of the available elements and changes several attribute conceptes, e.g. add a TEI attribute to an element. Thus, the customization is very restricted and allows little variation in selecting elements and attributes.

2.1 Core structure

The <core> module is very restricted and only contains the elements corresponding to bibliographic metadata such as author, editor, date and title and more general structural element such as list, item, label, p and ref.

author in a bibliographic reference, contains the name(s) of an author, personal or corporate, of a work; for example in the same form as that provided by a recognized bibliographic name authority.

n	(number) gives a number (or other label) for an element, which is not necessarily unique within the document. Sample values include:
role	may be used to specify further information about the entity referenced by this name in the form of a set of whitespace-separated values, for example the occupation of a person, or the status of a place.

editor contains a secondary statement of responsibility for a bibliographic item, for example the name of an individual, institution or organization, (or of several such) acting as editor, compiler, translator, etc.

n	(number) gives a number (or other label) for an element, which is not necessarily unique within the document. Sample values include:
role	may be used to specify further information about the entity referenced by this name in the form of a set of whitespace-separated values, for example the occupation of a person, or the status of a place.

date contains a date in any format.
when supplies the value of the date or time in a standard form, e.g. yyyy-mm-dd.
list contains any sequence of items organized as a list.
type List of documents in the corpus with corresponding references.
item contains one component of a list.
n (number) gives a number (or other label) for an element, which is not necessarily unique within the document.
p (paragraph) marks paragraphs in prose.
ref (reference) defines a reference to another location, possibly modified by additional text or comment.
target [att.pointing] specifies the destination of the reference by supplying one or more URI References
title contains a title for any kind of work.
type (You must specify the exact reference to LAUDATIO metadata module schema 'Corpus'.) classifies the title according to some convenient typology.

No class or general module modification were made. The TEI scheme is only locally restricted. That means that no inheritance relations or module relations were changed, deleted or added. The customization is documented in the section 'Formal declaration'.

All encoded metadata refer to information addressing the whole corpus, e.g. the editors and annotators of the corpus, the preliminary annotations guidelines used for all formats of the corpus, the revision history and the available formats and its license.

2.2 Basic TEI header structure

LAUDATIO metadata scheme for the object 'Corpus' contains much more elements and modules comparing to standard TEI header which are usually used for the purpose of text encoding only. For validity, fileDesc, the profileDesc, encodingDesc and revisionDesc are required.

The fileDesc contains the following elements where the basic meaning are left untouched but are referred to the object 'Corpus', meaning the element author refers to the annotator of the corpus, the element title refers to the title of the entire corpus. Elements used within teiHeader are:

titleStmt (title statement) groups information about the title of a work and those responsible for its content.
extent describes the approximate size of a text stored on some carrier medium or of some other object, digital or non-digital, specified in any convenient units.
type (You must indicate what type of extent you refere of) characterizes the element in some sense, using any convenient classification scheme or typology.
publicationStmt (publication statement) groups information concerning the publication or distribution of an electronic or other text.
sourceDesc (source description) describes the source from which an electronic text was derived or generated, typically a bibliographic description in the case of a digitized text, or a phrase such as "born digital" for a text which has no previous existence.
profileDesc (text-profile description) The langUsage defines which language the document has.
encodingDesc (encoding description) encoding descriptiondocuments the relationship between an electronic text and the source or sources from which it was derived. For LAUDATIO: The element encodingDescrefers to the different formats of the corpus. for each format a single encodingDesc contains information about what kind of annotation including keys and values and what kind of formats were used.
n [att.global] (number) gives a number (or other label) for an element, which is not necessarily unique within the document.
revisionDesc (revision description) summarizes the revision history for a file.

The titleStmt contains the title, editors, annotators and technical stuff of the corpus with the declaration <forname>, surname and affiliation.

title contains a title for any kind of work.
persName (personal name) contains a proper noun or proper-noun phrase referring to a person, possibly including one or more of the person's forenames, surnames, honorifics, added names, etc.
forename contains a forename, given or baptismal name.
surname contains a family (inherited) name, as opposed to a given, baptismal, or nick name.
affiliation contains an informal description of a person's present or past affiliation with some organization, for example an employer or sponsor.

The extent provides information about the size of the corpus. The attribute type defines the counting unit, 'tokens' or 'words'.

extent describes the approximate size of a text stored on some carrier medium or of some other object, digital or non-digital, specified in any convenient units.
type (You must indicate what type of extent you refere of) characterizes the element in some sense, using any convenient classification scheme or typology.

The publicationStmt covers the availability, responsibility and publication dates.

authority (release authority) supplies the name of a person or other agency responsible for making a work available, other than a publisher or distributor.
idno (identifier) supplies any form of identifier used to identify some object, such as a bibliographic item, a person, a title, an organization, etc. in a standardized way.
availability supplies information about the availability of a text, for example any restrictions on its use or distribution, its copyright status, any licence applying to it, etc.
status supplies a code identifying the current availability of the text.
date contains a date in any format.
when supplies the value of the date or time in a standard form, e.g. yyyy-mm-dd.

sourceDesc contains a list of documents of the corpus which are given with technical short terms.

list contains any sequence of items organized as a list.
type List of documents in the corpus with corresponding references.
item contains one component of a list.
corresp
n (number) gives a number (or other label) for an element, which is not necessarily unique within the document.

The profileDesc only contains metadata concerning the language of the corpus data including remarks on language type and lanuage area.

langUsage (language usage) describes the languages, sublanguages, registers, dialects, etc. represented within a text.

language characterizes a single language or sublanguage used within a text.

style	contains an expression in some formal style definition language which defines the rendering or presentation used for this element in the source text
ident	(identifier) Supplies a language code constructed as defined in BCP 47 which is used to identify the language documented by this element, and which is referenced by the global xml:lang attribute.

The encodingDesc contains descriptions of the corpus annotation guidelines depending on the format, segmentation and normalization. Each annotation key and value of each format is listed with a short description similar to annotation guidelines. Further more, the annotation are roughly divided according to their functions. Additionally, the research project and its references are provided. Elements used within encodingDescare:

appInfo (application information) records information about an application which has edited the TEI file.
application provides information about an application which has acted upon the document.
ident supplies an identifier for the application, independent of its version number or display name.
version supplies a version number for the application, independent of its identifier or display name.
label contains any label or heading used to identify part of a text, typically but not exclusively in a list or glossary.
projectDesc (project description) describes in detail the aim or purpose for which an electronic file was encoded, together with any other relevant information concerning the process by which it was assembled or collected.
ref (reference) defines a reference to another location, possibly modified by additional text or comment.
target [att.pointing] specifies the destination of the reference by supplying one or more URI References
editorialDecl (editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text.
segmentation describes the principles according to which the text has been segmented, for example into sentences, tone-units, graphemic strata, etc.
normalization indicates the extent of normalization or regularization of the original source carried out in converting it to electronic form.
tagsDecl (tagging declaration) provides detailed information about the tagging applied to a document.
namespace supplies the formal name of the namespace to which the elements documented by its children belong.
name specifies the full formal name of the namespace concerned.
rend (rendition) indicates how the element in question was rendered or presented in the source text.
tagUsage supplies information about the usage of a specific element within a text.
gi (generic identifier) specifies the name (generic identifier) of the element indicated by the tag, within the namespace indicated by the parent namespace element.

revisionDesc refers to the version history of the whole corpus. Corpora in the LAUDATIO-Repository may have several versions. The changes between these versions will be provided here.

revisionDesc (revision description) summarizes the revision history for a file.

change documents a change or set of changes made during the production of a source document, or during the revision of an electronic file.

n [att.global]	(number) gives a number (or other label) for an element, which is not necessarily unique within the document.
when [att.datable.w3c]	supplies the value of the date or time in a standard form, e.g. yyyy-mm-dd.
who [att.ascribed]	indicates the person, or group of people, to whom the element content is ascribed.

2.3 Detailed TEI header structure

A full TEI structure with all required elements is given below:

2.4 Formal declaration

Schema teiODD_LAUDATIOCorpus_S7: Elements

<affiliation>

<affiliation> contains an informal description of a person's present or past affiliation with some organization, for example an employer or sponsor. [15.2.2. The Participant Description]
Module	namesdates
Member of	model.addressLike
Contained by	core: author date editor item label p ref title header: authority change extent language tagUsage namesdates: affiliation forename orgName persName surname
May contain	core: date ref title header: idno namesdates: affiliation forename orgName persName surname
Declaration	element affiliation { macro.phraseSeq }
Example	<affiliation>Junior project officer for the US <name type="org">National Endowment for the Humanities</name> </affiliation> <affiliation notAfter="1960-01-01" notBefore="1957-02-28">Paid up member of the <orgName>Australian Journalists Association</orgName> </affiliation>
Note	If included, the name of an organization may be tagged using either the <name> element as above, or the more specific orgName element.

<appInfo>

<appInfo> (application information) records information about an application which has edited the TEI file. [2.3.10. The Application Information Element]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp))
Member of	model.encodingDescPart
Contained by	header: encodingDesc
May contain	header: application
Declaration	element appInfo { att.global.attributes, model.applicationLike+ }
Example	<appInfo> <application ident="Xaira" version="1.24"> <label>XAIRA Indexer</label> <ptr target="#P1"/> </application> </appInfo>

<application>

<application> provides information about an application which has acted upon the document. [2.3.10. The Application Information Element]

Module header

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.typed (@type, @subtype) att.datable (@calendar, @period) (att.datable.w3c (@when, @notBefore, @notAfter, @from, @to)) (att.datable.iso (@when-iso, @notBefore-iso, @notAfter-iso, @from-iso, @to-iso)) (att.datable.custom (@when-custom, @notBefore-custom, @notAfter-custom, @from-custom, @to-custom, @datingPoint, @datingMethod))

ident

supplies an identifier for the application, independent of its version number or display name.

Status	Required
Datatype	`data.name`

version

supplies a version number for the application, independent of its identifier or display name.

Status	Required
Datatype	`data.versionNumber`

Member of

model.applicationLike

Contained by

header: appInfo

May contain

core: label p ref

Declaration

element application
{
   att.global.attributes,
   att.typed.attributes,
   att.datable.attributes,
   attribute ident { data.name },
   attribute version { data.versionNumber },
   ( model.labelLike+, ( model.ptrLike* | model.pLike* ) )
}

Example

<appInfo>
<application ident="ImageMarkupTool1"
 notAfter="2006-06-01" version="1.5">
 <label>Image Markup Tool</label>
 <ptr target="#P1"/>
 <ptr target="#P2"/>
</application>
</appInfo>

This example shows an appInfo element documenting the fact that version 1.5 of the Image Markup Tool1 application has an interest in two parts of a document which was last saved on June 6 2006. The parts concerned are accessible at the URLs given as target for the two <ptr> elements.

<author>

<author> in a bibliographic reference, contains the name(s) of an author, personal or corporate, of a work; for example in the same form as that provided by a recognized bibliographic name authority. [3.11.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement]

Module core

Attributes

Attributesatt.global.responsibility (@cert, @resp)

(number) gives a number (or other label) for an element, which is not necessarily unique within the document. Sample values include:

Derived from	att.global
Status	Recommended
Datatype	`data.text`
Sample values include:

role

may be used to specify further information about the entity referenced by this name in the form of a set of whitespace-separated values, for example the occupation of a person, or the status of a place.

Derived from	att.naming
Status	Recommended
Datatype	1–∞ occurrences of `data.enumerated`separated by whitespace
Legal values are:	Annotator Infrastructure Transcription

Member of

model.respLike

Contained by

header: titleStmt

May contain

core: date ref title

header: idno

namesdates: affiliation forename orgName persName surname

Declaration

element author
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute n { data.text }?,
   attribute role
   {
      list { ( "Annotator" | "Infrastructure" | "Transcription" )+ }
   }?,
   macro.phraseSeq
}

Example

<author>British Broadcasting Corporation</author>
<author>La Fayette, Marie Madeleine Pioche de la Vergne, comtesse de (1634–1693)</author>
<author>Anonymous</author>
<author>Bill and Melinda Gates Foundation</author>
<author>
<persName>Beaumont, Francis</persName> and
<persName>John Fletcher</persName>
</author>
<author>
<orgName key="BBC">British Broadcasting
Corporation</orgName>: Radio 3 Network
</author>

Note

Particularly where cataloguing is likely to be based on the content of the header, it is advisable to use a generally recognized name authority file to supply the content for this element. The attributes key or ref may also be used to reference canonical information about the author(s) intended from any appropriate authority, such as a library catalogue or online resource.

In the case of a broadcast, use this element for the name of the company or network responsible for making the broadcast.

Where an author is unknown or unspecified, this element may contain text such as Unknown or Anonymous. When the appropriate TEI modules are in use, it may also contain detailed tagging of the names used for people, organizations or places, in particular where multiple names are given.

<authority>

<authority> (release authority) supplies the name of a person or other agency responsible for making a work available, other than a publisher or distributor. [2.2.4. Publication, Distribution, Licensing, etc.]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp))
Member of	model.publicationStmtPart.agency
Contained by	header: publicationStmt
May contain	core: date ref title header: idno namesdates: affiliation forename orgName persName surname
Declaration	element authority { att.global.attributes, macro.phraseSeq.limited }
Example	<authority>John Smith</authority>

<availability>

<availability> supplies information about the availability of a text, for example any restrictions on its use or distribution, its copyright status, any licence applying to it, etc. [2.2.4. Publication, Distribution, Licensing, etc.]

Module header

Attributes

Attributesatt.global.responsibility (@cert, @resp)

status

supplies a code identifying the current availability of the text.

Status	Required
Datatype	`data.enumerated`
Legal values are:	free the text is freely available. unknown the status of the text is unknown. [Default] restricted the text is not freely available.

Member of

model.publicationStmtPart.detail

Contained by

header: publicationStmt

May contain

core: p

Declaration

element availability
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute status { "free" | "unknown" | "restricted" },
   ( model.availabilityPart | model.pLike )+
}

Example

<availability status="restricted">
Available for academic research purposes only.
</availability>
<availability status="free">
In the public domain
</availability>
<availability status="restricted">
Available under licence from the publishers.
</availability>

Example

<availability>
<licence target="http://opensource.org/licenses/MIT">
 The MIT License
 applies to this document.
 Copyright (C) 2011 by The University of Victoria
 Permission is hereby granted, free of charge, to any person obtaining a copy
 of this software and associated documentation files (the "Software"), to deal
 in the Software without restriction, including without limitation the rights
 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
 copies of the Software, and to permit persons to whom the Software is
 furnished to do so, subject to the following conditions:
 The above copyright notice and this permission notice shall be included in
 all copies or substantial portions of the Software.
 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
 THE SOFTWARE.
</licence>
</availability>

Note

A consistent format should be adopted

<change>

<change> documents a change or set of changes made during the production of a source document, or during the revision of an electronic file. [2.5. The Revision Description 2.4.1. Creation 11.7. Changes]

Module header

Attributes

Attributes att.ascribed (@who) att.datable (@calendar, @period) (att.datable.w3c (@when, @notBefore, @notAfter, @from, @to)) (att.datable.iso (@when-iso, @notBefore-iso, @notAfter-iso, @from-iso, @to-iso)) (att.datable.custom (@when-custom, @notBefore-custom, @notAfter-custom, @from-custom, @to-custom, @datingPoint, @datingMethod)) att.docStatus (@status) att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.typed (@type, @subtype)

target

points to one or more elements that belong to this change.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace

Contained by

header: revisionDesc

May contain

core: date label list p ref title

header: idno

namesdates: affiliation forename orgName persName surname

Declaration

element change
{
   att.ascribed.attributes,
   att.datable.attributes,
   att.docStatus.attributes,
   att.global.attributes,
   att.typed.attributes,
   attribute target { list { data.pointer+ } }?,
   macro.specialPara
}

Example

<titleStmt>
<title> ... </title>
<editor xml:id="LDB">Lou Burnard</editor>
<respStmt xml:id="BZ">
 <resp>copy editing</resp>
 <name>Brett Zamir</name>
</respStmt>
</titleStmt>

<revisionDesc status="published">
<change status="public" when="2008-02-02"
 who="#BZ">Finished chapter 23</change>
<change status="draft" when="2008-01-02"
 who="#BZ">Finished chapter 2</change>
<change n="P2.2" when="1991-12-21"
 who="#LDB">Added examples to section 3</change>
<change when="1991-11-11" who="#MSM">Deleted chapter 10</change>
</revisionDesc>

Example

<profileDesc>
<creation>
 <listChange>
 <change xml:id="DRAFT1">First draft in pencil</change>
 <change notBefore="1880-12-09"
 xml:id="DRAFT2">First revision, mostly
 using green ink</change>
 <change notBefore="1881-02-13"
 xml:id="DRAFT3">Final corrections as
 supplied to printer.</change>
 </listChange>
</creation>
</profileDesc>

Note

The who attribute may be used to point to any other element, but will typically specify a <respStmt> or <person> element elsewhere in the header, identifying the person responsible for the change and their role in making it.

It is recommended that changes be recorded with the most recent first. The status attribute may be used to indicate the status of a document following the change documented.

<date>

<date> contains a date in any format. [3.5.4. Dates and Times 2.2.4. Publication, Distribution, Licensing, etc. 2.5. The Revision Description 3.11.2.4. Imprint, Size of a Document, and Reprint Information 15.2.3. The Setting Description 13.3.6. Dates and Times]

Module core

Attributes

Attributesatt.typed (subtype, @type)

when

supplies the value of the date or time in a standard form, e.g. yyyy-mm-dd.

Derived from	att.datable.w3c
Status	Required
Datatype	`data.temporal.iso`

Member of

model.dateLike model.publicationStmtPart.detail

Contained by

core: author date editor item label p ref title

header: authority change extent language publicationStmt tagUsage

namesdates: affiliation forename orgName persName surname

May contain

core: date ref title

header: idno

namesdates: affiliation forename orgName persName surname

Declaration

element date
{
   att.typed.attribute.type,
   attribute when { data.temporal.iso },
   ( text | model.gLike | model.phrase | model.global )*
}

Example

<date when="1980-02">early February 1980</date>

Example

Given on the <date when="1977-06-12">Twelfth Day
of June in the Year of Our Lord One Thousand Nine Hundred and Seventy-seven of the Republic
the Two Hundredth and first and of the University the Eighty-Sixth.</date>

Example

<date when="1990-09">September 1990</date>

<editor>

<editor> contains a secondary statement of responsibility for a bibliographic item, for example the name of an individual, institution or organization, (or of several such) acting as editor, compiler, translator, etc. [3.11.2.2. Titles, Authors, and Editors]

Module core

Attributes

Attributesatt.global.responsibility (@cert, @resp)

(number) gives a number (or other label) for an element, which is not necessarily unique within the document. Sample values include:

Derived from	att.global
Status	Recommended
Datatype	`data.text`
Sample values include:

role

Derived from	att.naming
Status	Recommended
Datatype	1–∞ occurrences of `data.enumerated`separated by whitespace
Legal values are:	CorpusEditor

Member of

model.respLike

Contained by

header: titleStmt

May contain

core: date ref title

header: idno

namesdates: affiliation forename orgName persName surname

Declaration

element editor
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute n { data.text }?,
   attribute role { list { "CorpusEditor"+ } }?,
   macro.phraseSeq
}

Example

<editor>Eric Johnson</editor>
<editor role="illustrator">John Tenniel</editor>

Note

A consistent format should be adopted.

Particularly where cataloguing is likely to be based on the content of the header, it is advisable to use generally recognized authority lists for the exact form of personal names.

<editorialDecl>

<editorialDecl> (editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text. [2.3.3. The Editorial Practices Declaration 2.3. The Encoding Description 15.3.2. Declarable Elements]
Module	header
Attributes	Attributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.global.responsibility (@cert, @resp)
Member of	model.encodingDescPart
Contained by	header: encodingDesc
May contain	core: p header: normalization segmentation
Declaration	element editorialDecl { att.global.attribute.n, att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, ( model.pLike \| model.editorialDeclPart )+ }
Example	<editorialDecl> <normalization> <p>All words converted to Modern American spelling using Websters 9th Collegiate dictionary </p> </normalization> <quotation marks="all"> <p>All opening quotation marks converted to “ all closing quotation marks converted to &cdq;.</p> </quotation> </editorialDecl>

<encodingDesc>

<encodingDesc> (encoding description) encoding descriptiondocuments the relationship between an electronic text and the source or sources from which it was derived. For LAUDATIO: The element encodingDescrefers to the different formats of the corpus. for each format a single encodingDesc contains information about what kind of annotation including keys and values and what kind of formats were used. [2.3. The Encoding Description 2.1.1. The TEI Header and Its Components]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp))
Member of	model.teiHeaderPart
Contained by	header: teiHeader
May contain	core: p header: appInfo editorialDecl projectDesc tagsDecl
Declaration	element encodingDesc { att.global.attributes, ( ( model.encodingDescPart \| model.pLike )+ ) }
Example	A corpus is available in EXMARaLDA format. The annotations were applied with the help of the EXMARaLDA Partitur Editor in a certain version, see appInfo. Information about the minimal segmentation in the format and the normalizations of the primary data is given in segmentation and normalization. The tagsDecl lists the annotation keys and values which are available in EXMARalDA. Each namesspace has @rend with a fixed a of values which classify the annotation according to their basic functions such as 'Trancription', 'Syntactical' and 'Morphological'. This enables searching certain annotation in more than one corpus in the LAUDATIO-Repository. <encodingDesc n="1"> <appInfo> <application ident="EXMARaLDA" version="1.5.1"> <label>EXMARaLDA Partitur Editor</label> </application> </appInfo> <projectDesc> <p> <ref target="http://korpling.german.hu-berlin.de/ridges/"/>The RIDGES project (Register in Diachronic German Science) is an investigation into the development of the German scientific language in the early modern and modern periods, ranging from the mid 16th to the late 19th century. </p> </projectDesc> <editorialDecl> <segmentation> <p>Multiple segmentations on the basis of "dipl", "clean" and "norm".</p> </segmentation> <normalization> <p>Two normalization are provided by the layer "clean" containing normalizations of special characters and by the layer "norm" containing normalizations of orthography.</p> </normalization> </editorialDecl> <tagsDecl> <namespace name="dipl" rend="Transcription" xml:id="dipl"> <tagUsage gi="NA">The diplomatic transcription of the word form as found on the manuscript. Line-breaks are marked as in the text, usually as 'U+2E17'.</tagUsage> </namespace> <namespace name="pos" rend="Lexical" xml:id="pos"> <tagUsage gi="ADJA">Attributives Adjektiv.</tagUsage> <tagUsage gi="ADJD">Adverbiales oder prädikatives Adjektiv.</tagUsage> <tagUsage gi="ADV">Adverb.</tagUsage> </namespace> </tagsDecl> </encodingDesc>

<extent>

<extent> describes the approximate size of a text stored on some carrier medium or of some other object, digital or non-digital, specified in any convenient units. [2.2.3. Type and Extent of File 2.2. The File Description 3.11.2.4. Imprint, Size of a Document, and Reprint Information 10.7.1. Object Description]

Module header

Attributes

Attributesatt.global.responsibility (@cert, @resp) att.typed (type, @subtype)

type

(You must indicate what type of extent you refere of) characterizes the element in some sense, using any convenient classification scheme or typology.

Derived from	att.typed
Status	Required
Datatype	`data.enumerated`
Legal values are:	Tokens (The size of the corpus is given in token.) Words (The size of the corpus is given in words.)

Contained by

header: fileDesc

May contain

core: date ref title

header: idno

namesdates: affiliation forename orgName persName surname

Declaration

element extent
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   att.typed.attribute.subtype,
   attribute type { "Tokens" | "Words" },
   macro.phraseSeq
}

Example

<extent>3200 sentences</extent>
<extent>between 10 and 20 Mb</extent>
<extent>ten 3.5 inch high density diskettes</extent>

Example

The <measure> element may be used to supplied normalised or machine tractable versions of the size or sizes concerned.

<extent>
<measure quantity="4.2" unit="MiB">About four megabytes</measure>
<measure quantity="245" unit="pages">245 pages of source
material</measure>
</extent>

<fileDesc>

<fileDesc> (file description) [2.2. The File Description 2.1.1. The TEI Header and Its Components]
Module	header
Attributes	Attributesatt.global.responsibility (@cert, @resp)
Contained by	header: teiHeader
May contain	header: extent publicationStmt sourceDesc titleStmt
Declaration	element fileDesc { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, ( ( titleStmt, editionStmt?, extent?, publicationStmt, seriesStmt?, notesStmt? ), sourceDesc+ ) }
Example	The <fileDsec> refers to the whole corpus. Each corpus has one or more editors and annotators and may have some technical supporters, see editor and author. All project and institution references can be made in affiliation for person or in publicationStmt in general. All documents of a corpus are listed in sourceDesc with a technical short name. <fileDesc> <titleStmt> <title type="Corpus">RIDGES Herbology Version 2.0</title> <editor n="1" role="CorpusEditor"> <persName> <forename>Anke</forename> <surname>Lüdeling</surname> </persName> <affiliation> <orgName type="Department">Institut für deutsche Sprache und Linguistik</orgName> <orgName type="Institution">Humboldt-Universität zu Berlin</orgName> </affiliation> </editor> <author n="1" role="Infrastructure"> <persName> <forename>Carolin</forename> <surname>Odebrecht</surname> </persName> <affiliation> <orgName type="Department">Institut für deutsche Sprache und Linguistik</orgName> <orgName type="Institution">Humboldt-Universität zu Berlin</orgName> </affiliation> </author> </titleStmt> <extent type="Tokens">60720</extent> <publicationStmt> <authority>Humboldt-Universität zu Berlin</authority> <idno>RIDGES Herbology Project.</idno> <availability status="free"> <p>Open Source Project. Open Source Project. All corpus data generated by the RIDGES project is licensed under a Creative Commons Attribution 3.0 Unported License.</p> </availability> <date type="CorpusRelease" when="2011">First corpus release.</date> <date type="CorpusRelease" when="2012-05-31">Second corpus release.</date> </publicationStmt> <sourceDesc> <list type="CorpusDocument"> <item corresp="deutsche.pflanzennamen.1870" n="1"/> <item corresp="noch.einige.worte.1840" n="2"/> <item corresp="flora.7.1821" n="3"/> </list> </sourceDesc> </fileDesc>
Note	The major source of information for those seeking to create a catalogue entry or bibliographic citation for an electronic file. As such, it provides a title and statements of responsibility together with details of the publication or distribution of the file, of any series to which it belongs, and detailed bibliographic notes for matters not addressed elsewhere in the header. It also contains a full bibliographic description for the source or sources from which the electronic text was derived.

<forename>

<forename> contains a forename, given or baptismal name. [13.2.1. Personal Names]
Module	namesdates
Attributes	Attributesatt.global.responsibility (@cert, @resp)
Member of	model.persNamePart
Contained by	core: author date editor item label p ref title header: authority change extent language tagUsage namesdates: affiliation forename orgName persName surname
May contain	core: date ref title header: idno namesdates: affiliation forename orgName persName surname
Declaration	element forename { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, macro.phraseSeq }
Example	<persName> <roleName>Ex-President</roleName> <forename>George</forename> <surname>Bush</surname> </persName>

<idno>

<idno> (identifier) supplies any form of identifier used to identify some object, such as a bibliographic item, a person, a title, an organization, etc. in a standardized way. [2.2.4. Publication, Distribution, Licensing, etc. 2.2.5. The Series Statement 3.11.2.4. Imprint, Size of a Document, and Reprint Information]

Module header

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.sortable (@sortKey)

type

categorizes the identifier, for example as an ISBN, Social Security number, etc.

Status	Optional
Datatype	`data.enumerated`

Member of

model.nameLike model.publicationStmtPart.detail

Contained by

core: author date editor item label p ref title

header: authority change extent idno language publicationStmt tagUsage

namesdates: affiliation forename orgName persName surname

May contain

header: idno

Declaration

element idno
{
   att.global.attributes,
   att.sortable.attributes,
   attribute type { data.enumerated }?,
   ( text | model.gLike | idno )*
}

Example

<idno type="ISBN">978-1-906964-22-1</idno>
<idno type="ISSN">0143-3385</idno>
<idno type="DOI">10.1000/123</idno>
<idno type="URI">http://www.worldcat.org/oclc/185922478</idno>
<idno type="URI">http://authority.nzetc.org/463/</idno>
<idno type="LT">Thomason Tract E.537(17)</idno>
<idno type="Wing">C695</idno>
<idno type="oldCat">
<g ref="#sym"/>345
</idno>

In the last case, the identifier includes a non-Unicode character which is defined elsewhere by means of a <glyph> or <char> element referenced here as #sym.

Note

idno should be used for labels which identify an object or concept in a formal cataloguing system such as a database or an RDF store, or in a distributed system such as the World Wide Web. Some suggested values for type on idno are ISBN, ISSN, DOI, and URI.

<item>

<item> contains one component of a list. [3.7. Lists 2.5. The Revision Description]

Module core

Attributes

Attributesatt.global.responsibility (@cert, @resp)

(number) gives a number (or other label) for an element, which is not necessarily unique within the document.

Derived from	att.global
Status	Required
Datatype	`data.text`

corresp

Status

Required

Contained by

core: list

May contain

core: date label list p ref title

header: idno

namesdates: affiliation forename orgName persName surname

Declaration

element item
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute n { data.text },
   attribute corresp { text },
   macro.specialPara
}

Example

<list rend="numbered">
<head>Here begin the chapter headings of Book IV</head>
<item n="4.1">The death of Queen Clotild.</item>
<item n="4.2">How King Lothar wanted to appropriate one third of the Church revenues.</item>
<item n="4.3">The wives and children of Lothar.</item>
<item n="4.4">The Counts of the Bretons.</item>
<item n="4.5">Saint Gall the Bishop.</item>
<item n="4.6">The priest Cato.</item>
<item> ...</item>
</list>

Note

May contain simple prose or a sequence of chunks.

Whatever string of characters is used to label a list item in the copy text may be used as the value of the global n attribute, but it is not required that numbering be recorded explicitly. In ordered lists, the n attribute on the item element is by definition synonymous with the use of the label element to record the enumerator of the list item. In glossary lists, however, the term being defined should be given with the label element, not n.

<label>

<label> contains any label or heading used to identify part of a text, typically but not exclusively in a list or glossary. [3.7. Lists]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.typed (@type, @subtype) att.placement (@place)
Member of	model.labelLike
Contained by	core: item list p ref title header: application change tagUsage
May contain	core: date ref title header: idno namesdates: affiliation forename orgName persName surname
Declaration	element label { att.global.attributes, att.typed.attributes, att.placement.attributes, macro.phraseSeq }
Example	Labels are commonly used for the headwords in glossary lists; note the use of the global xml:lang attribute to set the default language of the glossary list to Middle English, and identify the glosses and headings as modern English or Latin: <list type="gloss" xml:lang="enm"> <head xml:lang="en">Vocabulary</head> <headLabel xml:lang="en">Middle English</headLabel> <headItem xml:lang="en">New English</headItem> <label>nu</label> <item xml:lang="en">now</item> <label>lhude</label> <item xml:lang="en">loudly</item> <label>bloweth</label> <item xml:lang="en">blooms</item> <label>med</label> <item xml:lang="en">meadow</item> <label>wude</label> <item xml:lang="en">wood</item> <label>awe</label> <item xml:lang="en">ewe</item> <label>lhouth</label> <item xml:lang="en">lows</item> <label>sterteth</label> <item xml:lang="en">bounds, frisks (cf. <cit> <ref>Chaucer, K.T.644</ref> <quote>a courser, <term>sterting</term>as the fyr</quote> </cit> </item> <label>verteth</label> <item xml:lang="la">pedit</item> <label>murie</label> <item xml:lang="en">merrily</item> <label>swik</label> <item xml:lang="en">cease</item> <label>naver</label> <item xml:lang="en">never</item> </list>
Example	Labels may also be used to record explicitly the numbers or letters which mark list items in ordered lists, as in this extract from Gibbon's Autobiography. In this usage the label element is synonymous with the n attribute on the item element: I will add two facts, which have seldom occurred in the composition of six, or at least of five quartos. <list rend="runon" type="ordered"> <label>(1)</label> <item>My first rough manuscript, without any intermediate copy, has been sent to the press.</item> <label>(2) </label> <item>Not a sheet has been seen by any human eyes, excepting those of the author and the printer: the faults and the merits are exclusively my own.</item> </list>
Example	Labels may also be used for other structured list items, as in this extract from the journal of Edward Gibbon: <list type="gloss"> <label>March 1757.</label> <item>I wrote some critical observations upon Plautus.</item> <label>March 8th.</label> <item>I wrote a long dissertation upon some lines of Virgil.</item> <label>June.</label> <item>I saw Mademoiselle Curchod — <quote xml:lang="la">Omnia vincit amor, et nos cedamus amori.</quote> </item> <label>August.</label> <item>I went to Crassy, and staid two days.</item> </list> Note that the label might also appear within the item rather than as its sibling. Though syntactically valid, this usage is not recommended TEI practice.
Example	Labels may also be used to represent a label or heading attached to a paragraph or sequence of paragraphs not treated as a structural division, or to a group of verse lines. Note that, in this case, the label element appears within the p or <lg> element, rather than as a preceding sibling of it. <p>[...] <lb/>& n’entrer en mauuais & mal-heu- <lb/>ré meſnage. Or des que le conſente- <lb/>ment des parties y eſt le mariage eſt <lb/> arreſté, quoy que de faict il ne ſoit <label place="margin">Puiſſance maritale entre les Romains.</label> <lb/> conſommé. Depuis la conſomma- <lb/>tion du mariage la femme eſt ſoubs <lb/> la puiſſance du mary, s’il n’eſt eſcla- <lb/>ue ou enfant de famille : car en ce <lb/> cas, la femme, qui a eſpouſé vn en- <lb/>fant de famille, eſt ſous la puiſſance [...]</p> In this example the text of the label appears in the right hand margin of the original source, next to the paragraph it describes, but approximately in the middle of it. If so desired the type attribute may be used to distinguish different categories of label.

<language>

<language> characterizes a single language or sublanguage used within a text. [2.4.2. Language Usage]

Module header

Attributes

Attributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.global.responsibility (@cert, @resp)

style

contains an expression in some formal style definition language which defines the rendering or presentation used for this element in the source text

Derived from	att.global.rendition
Status	Required
Datatype	`data.text`
Legal values are:	Language LanguageType LanguageArea

ident

(identifier) Supplies a language code constructed as defined in BCP 47 which is used to identify the language documented by this element, and which is referenced by the global xml:lang attribute.

Status	Required
Datatype	`data.language`

Contained by

header: langUsage

May contain

core: date ref title

header: idno

namesdates: affiliation forename orgName persName surname

Declaration

element language
{
   att.global.attribute.n,
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute style { "Language" | "LanguageType" | "LanguageArea" },
   attribute ident { data.language },
   macro.phraseSeq.limited
}

Example

<langUsage>
<language ident="en-US" usage="75">modern American English</language>
<language ident="i-az-Arab" usage="20">Azerbaijani in Arabic script</language>
<language ident="x-lap" usage="05">Pig Latin</language>
</langUsage>

Note

Particularly for sublanguages, an informal prose characterization should be supplied as content for the element.

<langUsage>

<langUsage> (language usage) describes the languages, sublanguages, registers, dialects, etc. represented within a text. [2.4.2. Language Usage 2.4. The Profile Description 15.3.2. Declarable Elements]
Module	header
Attributes	Attributesatt.global.responsibility (@cert, @resp)
Member of	model.profileDescPart
Contained by	header: profileDesc
May contain	header: language
Declaration	element langUsage { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, language+ }
Example	<langUsage> <language ident="fr-CA" usage="60">Québecois</language> <language ident="en-CA" usage="20">Canadian business English</language> <language ident="en-GB" usage="20">British English</language> </langUsage>

<list>

<list> contains any sequence of items organized as a list. [3.7. Lists]

Module core

Attributes

Attributesatt.global.responsibility (@cert, @resp) att.typed (type, @subtype)

type

List of documents in the corpus with corresponding references.

Derived from	att.typed
Status	Required
Datatype	`data.enumerated`
Legal values are:	CorpusDocument
Note	Previous versions of these Guidelines recommended the use of type on list to encode the rendering or appearance of a list (whether it was bulleted, numbered, etc.). The current recommendation is to use the rend or style attributes for these aspects of a list, while using type for the more appropriate task of characterizing the nature of the content of a list.
Note	The formal syntax of the element declarations allows label tags to be omitted from lists tagged <list type="gloss">; this is however a semantic error.

Member of

model.listLike

Contained by

core: item p ref title

header: change revisionDesc sourceDesc tagUsage

May contain

core: item label

Declaration

element list
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   att.typed.attribute.subtype,
   attribute type { "CorpusDocument" },
   (
      ( ( model.divTop ) | ( model.global ) )*,
      (
         ( item, model.global* )+
       | (
            headLabel?,
            headItem?,
            ( label, model.global*, item, model.global* )+
         )
      ),
      ( ( model.divBottom ), model.global* )*
   )
}

Schematron

<sch:rule context="tei:list[@type='gloss']">
<sch:assert test="tei:label">The content of a "gloss" list should include a sequence of one or more pairs of a label element followed by an item element</sch:assert></sch:rule>

Example

<list rend="numbered">
<item>a butcher</item>
<item>a baker</item>
<item>a candlestick maker, with <list rend="bulleted">
 <item>rings on his fingers</item>
 <item>bells on his toes</item>
 </list>
</item>
</list>

Example

<list rend="bulleted" type="syllogism">
<item>All Cretans are liars.</item>
<item>Epimenides is a Cretan.</item>
<item>ERGO Epimenides is a liar.</item>
</list>

Example

<list rend="simple" type="litany">
<item>God save us from drought.</item>
<item>God save us from pestilence.</item>
<item>God save us from wickedness in high places.</item>
<item>Praise be to God.</item>
</list>

Example

The following example treats the short numbered clauses of Anglo-Saxon legal codes as lists of items. The text is from an ordinance of King Athelstan (924–939):

<div1 type="section">
<head>Athelstan's Ordinance</head>
<list rend="numbered">
 <item n="1">Concerning thieves. First, that no thief is to be spared who is caught with
 the stolen goods, [if he is] over twelve years and [if the value of the goods is] over
 eightpence. <list rend="numbered">
 <item n="1.1">And if anyone does spare one, he is to pay for the thief with his
 wergild — and the thief is to be no nearer a settlement on that account — or to
 clear himself by an oath of that amount.</item>
 <item n="1.2">If, however, he [the thief] wishes to defend himself or to escape, he is
 not to be spared [whether younger or older than twelve].</item>
 <item n="1.3">If a thief is put into prison, he is to be in prison 40 days, and he may
 then be redeemed with 120 shillings; and the kindred are to stand surety for him
 that he will desist for ever.</item>
 <item n="1.4">And if he steals after that, they are to pay for him with his wergild,
 or to bring him back there.</item>
 <item n="1.5">And if he steals after that, they are to pay for him with his wergild,
 whether to the king or to him to whom it rightly belongs; and everyone of those who
 supported him is to pay 120 shillings to the king as a fine.</item>
 </list>
 </item>
 <item n="2">Concerning lordless men. And we pronounced about these lordless men, from whom
 no justice can be obtained, that one should order their kindred to fetch back such a
 person to justice and to find him a lord in public meeting. <list rend="numbered">
 <item n="2.1">And if they then will not, or cannot, produce him on that appointed day,
 he is then to be a fugitive afterwards, and he who encounters him is to strike him
 down as a thief.</item>
 <item n="2.2">And he who harbours him after that, is to pay for him with his wergild
 or to clear himself by an oath of that amount.</item>
 </list>
 </item>
 <item n="3">Concerning the refusal of justice. The lord who refuses justice and upholds
 his guilty man, so that the king is appealed to, is to repay the value of the goods and
 120 shillings to the king; and he who appeals to the king before he demands justice as
 often as he ought, is to pay the same fine as the other would have done, if he had
 refused him justice. <list rend="numbered">
 <item n="3.1">And the lord who is an accessory to a theft by his slave, and it becomes
 known about him, is to forfeit the slave and be liable to his wergild on the first
 occasionp if he does it more often, he is to be liable to pay all that he owns.</item>
 <item n="3.2">And likewise any of the king's treasurers or of our reeves, who has been
 an accessory of thieves who have committed theft, is to liable to the same.</item>
 </list>
 </item>
 <item n="4">Concerning treachery to a lord. And we have pronounced concerning treachery to
 a lord, that he [who is accused] is to forfeit his life if he cannot deny it or is
 afterwards convicted at the three-fold ordeal.</item>
</list>
</div1>

Note that nested lists have been used so the tagging mirrors the structure indicated by the two-level numbering of the clauses. The clauses could have been treated as a one-level list with irregular numbering, if desired.

Example

These decrees, most blessed Pope Hadrian, we propounded in the public council ... and they
confirmed them in our hand in your stead with the sign of the Holy Cross, and afterwards
inscribed with a careful pen on the paper of this page, affixing thus the sign of the Holy
Cross. <list rend="simple">
 <item>I, Eanbald, by the grace of God archbishop of the holy church of York, have
 subscribed to the pious and catholic validity of this document with the sign of the Holy
 Cross.</item>
 <item>I, Ælfwold, king of the people across the Humber, consenting have subscribed with
 the sign of the Holy Cross.</item>
 <item>I, Tilberht, prelate of the church of Hexham, rejoicing have subscribed with the
 sign of the Holy Cross.</item>
 <item>I, Higbald, bishop of the church of Lindisfarne, obeying have subscribed with the
 sign of the Holy Cross.</item>
 <item>I, Ethelbert, bishop of Candida Casa, suppliant, have subscribed with thef sign of
 the Holy Cross.</item>
 <item>I, Ealdwulf, bishop of the church of Mayo, have subscribed with devout will.</item>
 <item>I, Æthelwine, bishop, have subscribed through delegates.</item>
 <item>I, Sicga, patrician, have subscribed with serene mind with the sign of the Holy
 Cross.</item>
</list>

Note

May contain an optional heading followed by a series of items, or a series of label and item pairs, the latter being optionally preceded by one or two specialized headings.

<namespace>

<namespace> supplies the formal name of the namespace to which the elements documented by its children belong. [2.3.4. The Tagging Declaration]

Module header

Attributes

Attributesatt.global (n, xml:lang, xml:base, xml:space, @xml:id) att.global.responsibility (@cert, @resp)

rend

(rendition) indicates how the element in question was rendered or presented in the source text.

Derived from	att.global.rendition
Status	Required
Datatype	1–∞ occurrences of `data.word`separated by whitespace
Legal values are:	Lexical Syntactical Morphological MarkUp Graphical Transcription Other Meta

corresp

Status

Recommended when applicable

name

specifies the full formal name of the namespace concerned.

Status	Required
Datatype	`data.text`

Contained by

header: tagsDecl

May contain

header: tagUsage

Declaration

element namespace
{
   att.global.attribute.xmlid,
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute rend
   {
      list
      {
         (
            "Lexical"
          | "Syntactical"
          | "Morphological"
          | "MarkUp"
          | "Graphical"
          | "Transcription"
          | "Other"
          | "Meta"
         )+
      }
   },
   attribute corresp { text }?,
   attribute name { data.text },
   tagUsage+
}

Example

<namespace name="http://www.tei-c.org/ns/1.0">
<tagUsage gi="hi" occurs="28" render="#it"
withId="2"> Used only to mark English words
italicized in the copy text </tagUsage>
</namespace>

<normalization>

<normalization> indicates the extent of normalization or regularization of the original source carried out in converting it to electronic form. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements]
Module	header
Attributes	Attributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.global.responsibility (@cert, @resp)
Member of	model.editorialDeclPart
Contained by	header: editorialDecl
May contain	core: p
Declaration	element normalization { att.global.attribute.n, att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, model.pLike+ }
Example	<editorialDecl> <normalization method="markup"> <p>Where both upper- and lower-case i, j, u, v, and vv have been normalized, to modern 20th century typographical practice, the <gi>choice</gi> element has been used to enclose <gi>orig</gi> and <gi>reg</gi> elements giving the original and new values respectively. ... </p> </normalization> <normalization method="silent"> <p>Spacing between words and following punctuation has been regularized to zero spaces; spacing between words has been regularized to one space.</p> </normalization> <normalization source="http://www.dict.sztaki.hu/webster"> <p>Spelling converted throughout to Modern American usage, based on Websters 9th Collegiate dictionary.</p> </normalization> </editorialDecl>

<orgName>

<orgName> (organization name) contains an organizational name. [13.2.2. Organizational Names]
Module	namesdates
Attributes	Attributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.typed (subtype, @type)
Member of	model.nameLike.agent
Contained by	core: author date editor item label p ref title header: authority change extent language tagUsage namesdates: affiliation forename orgName persName surname
May contain	core: date ref title header: idno namesdates: affiliation forename orgName persName surname
Declaration	element orgName { att.global.attribute.n, att.typed.attribute.type, macro.phraseSeq }
Example	About a year back, a question of considerable interest was agitated in the <orgName key="PAS1" type="voluntary"> <placeName key="PEN">Pennsyla.</placeName> Abolition Society </orgName>....

<p> (paragraph) marks paragraphs in prose. [3.1. Paragraphs 7.2.5. Speech Contents]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.declaring (@decls) att.fragmentable (@part)
Member of	model.pLike
Contained by	core: item header: application availability change editorialDecl encodingDesc normalization projectDesc publicationStmt segmentation sourceDesc
May contain	core: date label list ref title header: idno namesdates: affiliation forename orgName persName surname
Declaration	element p { att.global.attributes, att.declaring.attributes, att.fragmentable.attributes, macro.paraContent }
Example	<p>Hallgerd was outside. <q>There is blood on your axe,</q> she said. <q>What have you done?</q> </p> <p> <q>I have now arranged that you can be married a second time,</q> replied Thjostolf. </p> <p> <q>Then you must mean that Thorvald is dead,</q> she said. </p> <p> <q>Yes,</q> said Thjostolf. <q>And now you must think up some plan for me.</q> </p>

<persName>

<persName> (personal name) contains a proper noun or proper-noun phrase referring to a person, possibly including one or more of the person's forenames, surnames, honorifics, added names, etc. [13.2.1. Personal Names]
Module	namesdates
Member of	model.nameLike.agent
Contained by	core: author date editor item label p ref title header: authority change extent language tagUsage namesdates: affiliation forename orgName persName surname
May contain	core: date ref title header: idno namesdates: affiliation forename orgName persName surname
Declaration	element persName { macro.phraseSeq }
Example	<persName> <forename>Edward</forename> <forename>George</forename> <surname type="linked">Bulwer-Lytton</surname>, <roleName>Baron Lytton of <placeName>Knebworth</placeName> </roleName> </persName>

<profileDesc>

<profileDesc> (text-profile description) The langUsage defines which language the document has. [2.4. The Profile Description 2.1.1. The TEI Header and Its Components]
Module	header
Attributes	Attributesatt.global.responsibility (@cert, @resp)
Member of	model.teiHeaderPart
Contained by	header: teiHeader
May contain	header: langUsage
Declaration	element profileDesc { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, ( model.profileDescPart* ) }
Example	<profileDesc> <langUsage> <language ident="de" style="Language">Early New High German, Modern German</language> <language ident="de" style="LanguageType">UpperGerman</language> <language ident="de" style="LanguageArea">Bavarian</language> </langUsage> </profileDesc>
Note	Although the content model permits it, it is rarely meaningful to supply multiple occurrences for any of the child elements of profileDesc unless these are documenting multiple texts. In earlier versions of these Guidelines, it was required that the <creation> element appear first.

<projectDesc>

<projectDesc> (project description) describes in detail the aim or purpose for which an electronic file was encoded, together with any other relevant information concerning the process by which it was assembled or collected. [2.3.1. The Project Description 2.3. The Encoding Description 15.3.2. Declarable Elements]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.declarable (@default)
Member of	model.encodingDescPart
Contained by	header: encodingDesc
May contain	core: p
Declaration	element projectDesc { att.global.attributes, att.declarable.attributes, model.pLike+ }
Example	<projectDesc> <p>Texts collected for use in the Claremont Shakespeare Clinic, June 1990</p> </projectDesc>

<publicationStmt>

<publicationStmt> (publication statement) groups information concerning the publication or distribution of an electronic or other text. [2.2.4. Publication, Distribution, Licensing, etc. 2.2. The File Description]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp))
Contained by	header: fileDesc
May contain	core: date p header: authority availability idno
Declaration	element publicationStmt { att.global.attributes, ( ( ( model.publicationStmtPart.agency ), model.publicationStmtPart.detail* )+ \| model.pLike+ ) }
Example	<publicationStmt> <publisher>C. Muquardt </publisher> <pubPlace>Bruxelles & Leipzig</pubPlace> <date when="1846"/> </publicationStmt>
Example	<publicationStmt> <publisher>Chadwyck Healey</publisher> <pubPlace>Cambridge</pubPlace> <availability> <p>Available under licence only</p> </availability> <date when="1992">1992</date> </publicationStmt>
Note	Where a publication statement contains several members of the `model.publicationStmtPart` classes rather than one or more paragraphs or anonymous blocks, care should be taken to ensure that the repeated elements are presented in a meaningful order. It is a conformance requirement that elements supplying information about publication place, address, identifier, availability, and date be given following the name of the publisher, distributor, or authority concerned, and preferably in that order.

<ref>

<ref> (reference) defines a reference to another location, possibly modified by additional text or comment. [3.6. Simple Links and Cross-References 16.1. Links]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.pointing (@targetLang, @target, @evaluate) att.internetMedia (@mimeType) att.typed (@type, @subtype) att.declaring (@decls) att.cReferencing (@cRef)
Member of	model.ptrLike
Contained by	core: author date editor item label p ref title header: application authority change extent language tagUsage namesdates: affiliation forename orgName persName surname
May contain	core: date label list ref title header: idno namesdates: affiliation forename orgName persName surname
Declaration	element ref { att.global.attributes, att.pointing.attributes, att.internetMedia.attributes, att.typed.attributes, att.declaring.attributes, att.cReferencing.attributes, macro.paraContent }
Schematron	<s:report test="@target and @cRef">Only one of the attributes @target' and @cRef' may be supplied on <s:name/></s:report>
Example	See especially <ref target="http://www.natcorp.ox.ac.uk/Texts/A02.xml#s2">the second sentence</ref>
Example	See also <ref target="#locution">s.v. <term>locution</term> </ref>.
Note	The target and cRef attributes are mutually exclusive.

<revisionDesc>

<revisionDesc> (revision description) summarizes the revision history for a file. [2.5. The Revision Description 2.1.1. The TEI Header and Its Components]
Module	header
Attributes	Attributesatt.global (xml:id, xml:lang, @n, @xml:base, @xml:space) att.global.rendition (style, rendition, @rend) att.global.responsibility (@cert, @resp)
Contained by	header: teiHeader
May contain	core: list header: change
Declaration	element revisionDesc { att.global.attribute.n, att.global.attribute.xmlbase, att.global.attribute.xmlspace, att.global.rendition.attribute.rend, att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, ( list \| listChange \| change+ ) }
Example	<revisionDesc status="embargoed"> <change when="1991-11-11" who="#LB"> deleted chapter 10 </change> </revisionDesc>
Note	If present on this element, the status attribute should indicate the current status of the document. The same attribute may appear on any change to record the status at the time of that change. Conventionally change elements should be given in reverse date order, with the most recent change at the start of the list.

<segmentation>

<segmentation> describes the principles according to which the text has been segmented, for example into sentences, tone-units, graphemic strata, etc. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements]
Module	header
Attributes	Attributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.global.responsibility (@cert, @resp)
Member of	model.editorialDeclPart
Contained by	header: editorialDecl
May contain	core: p
Declaration	element segmentation { att.global.attribute.n, att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, model.pLike+ }
Example	<segmentation> <p> <gi>s</gi> elements mark orthographic sentences and are numbered sequentially within their parent <gi>div</gi> element </p> </segmentation>
Example	<p> <gi>seg</gi> elements are used to mark functional constituents of various types within each <gi>s</gi>; the typology used is defined by a <gi>taxonomy</gi> element in the corpus header <gi>classDecl</gi> </p>

<sourceDesc>

<sourceDesc> (source description) describes the source from which an electronic text was derived or generated, typically a bibliographic description in the case of a digitized text, or a phrase such as "born digital" for a text which has no previous existence. [2.2.7. The Source Description]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.declarable (@default)
Contained by	header: fileDesc
May contain	core: list p
Declaration	element sourceDesc { att.global.attributes, att.declarable.attributes, ( model.pLike+ \| ( model.biblLike \| model.sourceDescPart \| model.listLike )+ ) }
Example	<sourceDesc> <bibl> <title level="a">The Interesting story of the Children in the Wood</title>. In <author>Victor E Neuberg</author>, <title>The Penny Histories</title>. <publisher>OUP</publisher> <date>1968</date>. </bibl> </sourceDesc>
Example	<sourceDesc> <p>Born digital: no previous source exists.</p> </sourceDesc>

<surname>

<surname> contains a family (inherited) name, as opposed to a given, baptismal, or nick name. [13.2.1. Personal Names]
Module	namesdates
Attributes	Attributesatt.global.responsibility (@cert, @resp)
Member of	model.persNamePart
Contained by	core: author date editor item label p ref title header: authority change extent language tagUsage namesdates: affiliation forename orgName persName surname
May contain	core: date ref title header: idno namesdates: affiliation forename orgName persName surname
Declaration	element surname { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, macro.phraseSeq }
Example	<surname type="combine">St John Stevas</surname>

<tagsDecl>

<tagsDecl> (tagging declaration) provides detailed information about the tagging applied to a document. [2.3.4. The Tagging Declaration 2.3. The Encoding Description]

Module header

Attributes

Attributesatt.global.responsibility (@cert, @resp)

partial

indicates whether the element types listed exhaustively include all those found within text, or represent only a subset.

Status	Recommended
Datatype	`data.truthValue`
Note	TEI recommended practice is to specify this attribute. When the tagUsage elements inside tagsDecl are used to list each of the element types in the associated text, the value should be given as false. When the tagUsage elements inside tagsDecl are used to provide usage information or default renditions for only a subset of the elements types within the associated text, the value should be true.

Member of

model.encodingDescPart

Contained by

header: encodingDesc

May contain

header: namespace

Declaration

element tagsDecl
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute partial { data.truthValue }?,
   ( rendition*, namespace* )
}

Example

<tagsDecl>
<rendition xml:id="rend-it">to be rendered in italic font</rendition>
<namespace name="http://www.tei-c.org/ns/1.0">
 <tagUsage gi="hi" occurs="467"
 render="#rend-it"/>
 <tagUsage gi="title" occurs="45"
 render="#rend-it"/>
</namespace>
<namespace name="http://docbook.org/ns/docbook">
 <tagUsage gi="para" occurs="10"/>
</namespace>
</tagsDecl>

<tagUsage>

<tagUsage> supplies information about the usage of a specific element within a text. [2.3.4. The Tagging Declaration]

Module header

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp))

(generic identifier) specifies the name (generic identifier) of the element indicated by the tag, within the namespace indicated by the parent namespace element.

Status	Required
Datatype	`data.text`

occurs

specifies the number of occurrences of this element within the text.

Status	Recommended
Datatype	`data.count`

withId

(with unique identifier) specifies the number of occurrences of this element within the text which bear a distinct value for the global xml:id attribute.

Status	Recommended
Datatype	`data.count`

render

specifies the identifier of a <rendition> element which defines how this element was rendered in the source text.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace

Contained by

header: namespace

May contain

core: date label list ref title

header: idno

namesdates: affiliation forename orgName persName surname

Declaration

element tagUsage
{
   att.global.attributes,
   attribute gi { data.text },
   attribute occurs { data.count }?,
   attribute withId { data.count }?,
   attribute render { list { data.pointer+ } }?,
   macro.limitedContent
}

Example

<tagsDecl>
<rendition xml:id="it">Render using a slant or italic variant on the current font</rendition>

<namespace name="http://www.tei-c.org/ns/1.0">
 <tagUsage gi="hi" occurs="28" render="#it"
 withId="2"> Used to mark English words
 italicized in the copy text.</tagUsage>
 <tagUsage gi="foreign" render="#it">Used to mark non-English words in the copy text.</tagUsage>

</namespace>
</tagsDecl>

<TEI>

<TEI> (TEI document) contains a single TEI-conformant document, containing a single TEI header, a single text, one or more members of the model.resourceLike class, or a combination of these. A series of TEI elements may be combined together to form a <teiCorpus> element. [4. Default Text Structure 15.1. Varieties of Composite Text]

Module textstructure

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp))

version

specifies the major version number of the TEI Guidelines against which this document is valid.

Status	Optional
Datatype	`data.version`
Note	The major version number is historically prefixed by a P (for Proposal), and is distinct from the version number used for individual releases of the Guidelines, as used by (for example) the source of the <schemaSpec> element. The current version is P5.

Contained by

—

May contain

header: teiHeader

textstructure: text

Declaration

element TEI
{
   att.global.attributes,
   attribute version { data.version }?,
   ( teiHeader, ( ( model.resourceLike+, text? ) | text ) )
}

Schematron

<s:ns prefix="tei"
uri="http://www.tei-c.org/ns/1.0"/>
<s:ns prefix="xs"
uri="http://www.w3.org/2001/XMLSchema"/>

Schematron

<s:ns prefix="rng"
uri="http://relaxng.org/ns/structure/1.0"/>

Example

<TEI version="5.0" xmlns="http://www.tei-c.org/ns/1.0">
<teiHeader>
 <fileDesc>
 <titleStmt>
 <title>The shortest TEI Document Imaginable</title>
 </titleStmt>
 <publicationStmt>
 First published as part of TEI P2, this is the P5
 version using a name space.
 </publicationStmt>
 <sourceDesc>
 No source: this is an original work.
 </sourceDesc>
 </fileDesc>
</teiHeader>
<text>
 <body>
 This is about the shortest TEI document imaginable.
 </body>
</text>
</TEI>

Example

<TEI version="5.0" xmlns="http://www.tei-c.org/ns/1.0">
<teiHeader>
 <fileDesc>
 <titleStmt>
 <title>A TEI Document containing four page images </title>
 </titleStmt>
 <publicationStmt>
 Unpublished demonstration file.
 </publicationStmt>
 <sourceDesc>
 No source: this is an original work.
 </sourceDesc>
 </fileDesc>
</teiHeader>
<facsimile>
 <graphic url="page1.png"/>
 <graphic url="page2.png"/>
 <graphic url="page3.png"/>
 <graphic url="page4.png"/>
</facsimile>
</TEI>

Note

This element is required. It is customary to specify the TEI namespace http://www.tei-c.org/ns/1.0 on it, using the xmlns attribute.

<teiHeader>

<teiHeader> (TEI header) supplies the descriptive and declarative information making up an electronic title page for every TEI-conformant document. [2.1.1. The TEI Header and Its Components 15.1. Varieties of Composite Text]

Module header

Attributes

Attributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.global.responsibility (@cert, @resp)

type

(You must identify the module of the LAUDATIO metada schema, e.g. CorpusHeader, DocumentHeader, PreparationHeader) specifies the kind of document to which the header is attached, for example whether it is a corpus or individual text.

Status	Required
Datatype	`data.enumerated`
Legal values are:	CorpusHeader DocumentHeader PreparationHeader

Contained by

textstructure: TEI

May contain

header: encodingDesc fileDesc profileDesc revisionDesc

Declaration

element teiHeader
{
   att.global.attribute.n,
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute type { "CorpusHeader" | "DocumentHeader" | "PreparationHeader" },
   ( fileDesc, model.teiHeaderPart*, revisionDesc? )
}

Example

<teiHeader>
<fileDesc>
 <titleStmt>
 <title>Shakespeare: the first folio (1623) in electronic form</title>
 <author>Shakespeare, William (1564–1616)</author>
 <respStmt>
 <resp>Originally prepared by</resp>
 <name>Trevor Howard-Hill</name>
 </respStmt>
 <respStmt>
 <resp>Revised and edited by</resp>
 <name>Christine Avern-Carr</name>
 </respStmt>
 </titleStmt>
 <publicationStmt>
 <distributor>Oxford Text Archive</distributor>
 <address>
 <addrLine>13 Banbury Road, Oxford OX2 6NN, UK</addrLine>
 </address>
 <idno type="OTA">119</idno>
 <availability>
 Freely available on a non-commercial basis.
 </availability>
 <date when="1968">1968</date>
 </publicationStmt>
 <sourceDesc>
 <bibl>The first folio of Shakespeare, prepared by Charlton Hinman (The Norton Facsimile,
 1968)</bibl>
 </sourceDesc>
</fileDesc>
<encodingDesc>
 <projectDesc>
 Originally prepared for use in the production of a series of old-spelling
 concordances in 1968, this text was extensively checked and revised for use during the
 editing of the new Oxford Shakespeare (Wells and Taylor, 1989).
 </projectDesc>
 <editorialDecl>
 <correction>
 Turned letters are silently corrected.
 </correction>
 <normalization>
 Original spelling and typography is retained, except that long s and ligatured
 forms are not encoded.
 </normalization>
 </editorialDecl>
 <refsDecl xml:id="ASLREF">
 <cRefPattern matchPattern="(\S+) ([^.]+)\.(.*)"
 replacementPattern="#xpath(//div1[@n='$1']/div2/[@n='$2']//lb[@n='$3'])">
 A reference is created by assembling the following, in the reverse order as that
 listed here: <list>
 <item>the <att>n</att> value of the preceding <gi>lb</gi>
 </item>
 <item>a period</item>
 <item>the <att>n</att> value of the ancestor <gi>div2</gi>
 </item>
 <item>a space</item>
 <item>the <att>n</att> value of the parent <gi>div1</gi>
 </item>
 </list>
 
 </cRefPattern>
 </refsDecl>
</encodingDesc>
<revisionDesc>
 <list>
 <item>
 <date when="1989-04-12">12 Apr 89</date> Last checked by CAC</item>
 <item>
 <date when="1989-03-01">1 Mar 89</date> LB made new file</item>
 </list>
</revisionDesc>
</teiHeader>

Note

One of the few elements unconditionally required in any TEI document.

<text>

<text> contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample. [4. Default Text Structure 15.1. Varieties of Composite Text]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.declaring (@decls) att.typed (@type, @subtype)
Contained by	textstructure: TEI
May contain	Empty element
Declaration	element text { att.global.attributes, att.declaring.attributes, att.typed.attributes, ( model.global, ( front, model.global )?, ( body \| group ), model.global, ( back, model.global )? ) }
Example	<text> <front> <docTitle> <titlePart>Autumn Haze</titlePart> </docTitle> </front> <body> <l>Is it a dragonfly or a maple leaf</l> <l>That settles softly down upon the water?</l> </body> </text>
Example	The body of a text may be replaced by a group of nested texts, as in the following schematic: <text> <front> <!-- front matter for the whole group --> </front> <group> <text> <!-- first text --> </text> <text> <!-- second text --> </text> </group> </text>
Note	This element should not be used to represent a text which is inserted at an arbitrary point within the structure of another, for example as in an embedded or quoted narrative; the <floatingText> is provided for this purpose.

<title>

<title> contains a title for any kind of work. [3.11.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement 2.2.5. The Series Statement]

Module core

Attributes

Attributesatt.global (xml:id, n, xml:lang, @xml:base, @xml:space) att.global.rendition (style, rendition, @rend) att.global.responsibility (@cert, @resp)

type

(You must specify the exact reference to LAUDATIO metadata module schema 'Corpus'.) classifies the title according to some convenient typology.

Derived from	att.typed
Status	Recommended
Datatype	`data.enumerated`
Legal values are:	Corpus (The title only refers to the LAUDATIO metadata object 'Corpus')
Note	This attribute is provided for convenience in analysing titles and processing them according to their type; where such specialized processing is not necessary, there is no need for such analysis, and the entire title, including subtitles and any parallel titles, may be enclosed within a single title element.

Member of

model.emphLike

Contained by

core: author date editor item label p ref title

header: authority change extent language tagUsage titleStmt

namesdates: affiliation forename orgName persName surname

May contain

core: date label list ref title

header: idno

namesdates: affiliation forename orgName persName surname

Declaration

element title
{
   att.global.attribute.xmlbase,
   att.global.attribute.xmlspace,
   att.global.rendition.attribute.rend,
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute type { "Corpus" }?,
   macro.paraContent
}

Example

<title>Information Technology and the Research Process: Proceedings of
a conference held at Cranfield Institute of Technology, UK,
18–21 July 1989</title>

Example

<title>Hardy's Tess of the D'Urbervilles: a machine readable
edition</title>

Example

<title type="full">
<title type="main">Synthèse</title>
<title type="sub">an international journal for
epistemology, methodology and history of
science</title>
</title>

Note

The attributes key and ref, inherited from the class att.canonical may be used to indicate the canonical form for the title; the former, by supplying (for example) the identifier of a record in some external library system; the latter by pointing to an XML element somewhere containing the canonical form of the title.

<titleStmt>

<titleStmt> (title statement) groups information about the title of a work and those responsible for its content. [2.2.1. The Title Statement 2.2. The File Description]
Module	header
Attributes	Attributesatt.global.responsibility (@cert, @resp)
Contained by	header: fileDesc
May contain	core: author editor title
Declaration	element titleStmt { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, ( title+, model.respLike* ) }
Example	<titleStmt> <title>Capgrave's Life of St. John Norbert: a machine-readable transcription</title> <respStmt> <resp>compiled by</resp> <name>P.J. Lucas</name> </respStmt> </titleStmt>

Schema teiODD_LAUDATIOCorpus_S7: Model classes

model.addressLike

model.addressLike groups elements used to represent a postal or email address. [1. The TEI Infrastructure]
Module	tei
Used by	model.pPart.data
Members	affiliation

model.applicationLike

model.applicationLike groups elements used to record application-specific information about a document in its header.
Module	tei
Used by	appInfo
Members	application

model.dateLike

model.dateLike groups elements containing temporal expressions. [3.5.4. Dates and Times 13.3.6. Dates and Times]
Module	tei
Used by	model.pPart.data
Members	date

model.divBottom

model.divBottom groups elements appearing at the end of a text division. [4.2. Elements Common to All Divisions]
Module	tei
Used by	list
Members	model.divBottomPart model.divWrapper

model.divPart

model.divPart groups paragraph-level elements appearing directly within divisions. [1.3. The TEI Class System]
Module	tei
Used by	macro.specialPara
Members	model.lLike model.pLike [p]
Note	Note that this element class does not include members of the model.inter class, which can appear either within or between paragraph-level items.

model.divTop

model.divTop groups elements appearing at the beginning of a text division. [4.2. Elements Common to All Divisions]
Module	tei
Used by	list
Members	model.divTopPart [model.headLike] model.divWrapper

model.divTopPart

model.divTopPart groups elements which can occur only at the beginning of a text division. [4.6. Title Pages]
Module	tei
Used by	model.divTop
Members	model.headLike

model.editorialDeclPart

model.editorialDeclPart groups elements which may be used inside editorialDecl and appear multiple times.
Module	tei
Used by	editorialDecl
Members	normalization segmentation

model.emphLike

model.emphLike groups phrase-level elements which are typographically distinct and to which a specific function can be attributed. [3.3. Highlighting and Quotation]
Module	tei
Used by	model.highlighted model.limitedPhrase
Members	title

model.encodingDescPart

model.encodingDescPart groups elements which may be used inside encodingDesc and appear multiple times.
Module	tei
Used by	encodingDesc
Members	appInfo editorialDecl projectDesc tagsDecl

model.global

model.global groups elements which may appear at any point within a TEI text. [1.3. The TEI Class System]
Module	tei
Used by	date list macro.paraContent macro.phraseSeq macro.phraseSeq.limited macro.specialPara text
Members	model.global.edit model.global.meta model.milestoneLike model.noteLike

model.highlighted

model.highlighted groups phrase-level elements which are typographically distinct. [3.3. Highlighting and Quotation]
Module	tei
Used by	model.phrase
Members	model.emphLike [title] model.hiLike

model.inter

model.inter groups elements which can appear either within or between paragraph-like elements. [1.3. The TEI Class System]
Module	tei
Used by	macro.limitedContent macro.paraContent macro.specialPara
Members	model.biblLike model.egLike model.labelLike [label] model.listLike [list] model.oddDecl model.qLike [model.quoteLike] model.stageLike

model.labelLike

model.labelLike groups elements used to gloss or explain other parts of a document.
Module	tei
Used by	application model.inter
Members	label

model.limitedPhrase

model.limitedPhrase groups phrase-level elements excluding those elements primarily intended for transcription of existing sources. [1.3. The TEI Class System]
Module	tei
Used by	macro.limitedContent macro.phraseSeq.limited
Members	model.emphLike [title] model.hiLike model.pPart.data [model.addressLike [affiliation] model.dateLike [date] model.measureLike model.nameLike [model.nameLike.agent [orgName persName] model.offsetLike model.persNamePart [forename surname] model.placeStateLike [model.placeNamePart] idno] ] model.pPart.editorial model.pPart.msdesc model.phrase.xml model.ptrLike [ref]

model.listLike

model.listLike groups list-like elements. [3.7. Lists]
Module	tei
Used by	model.inter sourceDesc
Members	list

model.nameLike

model.nameLike groups elements which name or refer to a person, place, or organization.
Module	tei
Used by	model.pPart.data
Members	model.nameLike.agent [orgName persName] model.offsetLike model.persNamePart [forename surname] model.placeStateLike [model.placeNamePart] idno
Note	A superset of the naming elements that may appear in datelines, addresses, statements of responsibility, etc.

model.nameLike.agent

model.nameLike.agent groups elements which contain names of individuals or corporate bodies. [3.5. Names, Numbers, Dates, Abbreviations, and Addresses]
Module	tei
Used by	model.nameLike
Members	orgName persName
Note	This class is used in the content model of elements which reference names of people or organizations.

model.persNamePart

model.persNamePart groups elements which form part of a personal name. [13.2.1. Personal Names]
Module	namesdates
Used by	model.nameLike
Members	forename surname

model.phrase

model.phrase groups elements which can occur at the level of individual words or phrases. [1.3. The TEI Class System]
Module	tei
Used by	date macro.paraContent macro.phraseSeq macro.specialPara
Members	model.graphicLike model.highlighted [model.emphLike [title] model.hiLike] model.lPart model.pPart.data [model.addressLike [affiliation] model.dateLike [date] model.measureLike model.nameLike [model.nameLike.agent [orgName persName] model.offsetLike model.persNamePart [forename surname] model.placeStateLike [model.placeNamePart] idno] ] model.pPart.edit [model.pPart.editorial model.pPart.transcriptional] model.pPart.msdesc model.phrase.xml model.ptrLike [ref] model.segLike model.specDescLike
Note	This class of elements can occur within paragraphs, list items, lines of verse, etc.

model.placeStateLike

model.placeStateLike groups elements which describe changing states of a place.
Module	tei
Used by	model.nameLike
Members	model.placeNamePart

model.pLike

model.pLike groups paragraph-like elements.
Module	tei
Used by	application availability editorialDecl encodingDesc model.divPart normalization projectDesc publicationStmt segmentation sourceDesc
Members	p

model.pPart.data

model.pPart.data groups phrase-level elements containing names, dates, numbers, measures, and similar data. [3.5. Names, Numbers, Dates, Abbreviations, and Addresses]
Module	tei
Used by	model.limitedPhrase model.phrase
Members	model.addressLike [affiliation] model.dateLike [date] model.measureLike model.nameLike [model.nameLike.agent [orgName persName] model.offsetLike model.persNamePart [forename surname] model.placeStateLike [model.placeNamePart] idno]

model.pPart.edit

model.pPart.edit groups phrase-level elements for simple editorial correction and transcription. [3.4. Simple Editorial Changes]
Module	tei
Used by	model.phrase
Members	model.pPart.editorial model.pPart.transcriptional

model.profileDescPart

model.profileDescPart groups elements which may be used inside profileDesc and appear multiple times.
Module	tei
Used by	profileDesc
Members	langUsage

model.ptrLike

model.ptrLike groups elements used for purposes of location and reference. [3.6. Simple Links and Cross-References]
Module	tei
Used by	application model.limitedPhrase model.phrase
Members	ref

model.publicationStmtPart.agency

model.publicationStmtPart.agency groups the child elements of a publicationStmt element of the TEI header that indicate an authorising agent. [2.2.4. Publication, Distribution, Licensing, etc.]
Module	tei
Used by	publicationStmt
Members	authority
Note	The ‘agency’ child elements, while not required, are required if one of the ‘detail’ child elements is to be used. It is not valid to have a ‘detail’ child element without a preceding ‘agency’ child element. See also model.publicationStmtPart.detail.

model.publicationStmtPart.detail

model.publicationStmtPart.detail groups the agency-specific child elements of the publicationStmt element of the TEI header. [2.2.4. Publication, Distribution, Licensing, etc.]
Module	tei
Used by	publicationStmt
Members	availability date idno
Note	A ‘detail’ child element may not occur unless an ‘agency’ child element precedes it. See also model.publicationStmtPart.agency.

model.qLike

model.qLike groups elements related to highlighting which can appear either within or between chunk-level elements. [3.3. Highlighting and Quotation]
Module	tei
Used by	model.inter
Members	model.quoteLike

model.respLike

model.respLike groups elements which are used to indicate intellectual or other significant responsibility, for example within a bibliographic element.
Module	tei
Used by	titleStmt
Members	author editor

model.teiHeaderPart

model.teiHeaderPart groups high level elements which may appear more than once in a TEI header.
Module	tei
Used by	teiHeader
Members	encodingDesc profileDesc

Schema teiODD_LAUDATIOCorpus_S7: Attribute classes

att.ascribed

att.ascribed provides attributes for elements representing speech or action that can be ascribed to a specific individual. [3.3.3. Quotation 8.3. Elements Unique to Spoken Texts]

Module tei

Members change

Attributes

who

indicates the person, or group of people, to whom the element content is ascribed.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
In the following example from Hamlet, speeches (<sp>) in the body of the play are linked to <castItem> elements in the <castList> using the who attribute. <castItem type="role"> <role xml:id="Barnardo">Bernardo</role> </castItem> <castItem type="role"> <role xml:id="Francisco">Francisco</role> <roleDesc>a soldier</roleDesc> </castItem> <!-- ... --> <sp who="#Barnardo"> <speaker>Bernardo</speaker> <l n="1">Who's there?</l> </sp> <sp who="#Francisco"> <speaker>Francisco</speaker> <l n="2">Nay, answer me: stand, and unfold yourself.</l> </sp>
Note	For transcribed speech, this will typically identify a participant or participant group; in other contexts, it will point to any identified <person> element.

att.canonical

att.canonical provides attributes which can be used to associate a representation such as a name or title with canonical information about the object being named or referenced.

Module tei

Members att.naming [att.personal]

Attributes

key

provides an externally-defined means of identifying the entity (or entities) being named, using a coded value of some kind.

Status	Optional
Datatype	`data.text`
<author> <name key="name 427308" type="organisation">[New Zealand Parliament, Legislative Council]</name> </author>
<author> <name key="Hugo, Victor (1802-1885)" ref="http://www.idref.fr/026927608">Victor Hugo</name> </author>
Note	The value may be a unique identifier from a database, or any other externally-defined string identifying the referent. No particular syntax is proposed for the values of the key attribute, since its form will depend entirely on practice within a given project. For the same reason, this attribute is not recommended in data interchange, since there is no way of ensuring that the values used by one project are distinct from those used by another. In such a situation, a preferable approach for magic tokens which follows standard practice on the Web is to use a ref attribute whose value is a tag URI as defined in RFC 4151.

ref

(reference) provides an explicit means of locating a full definition for the entity being named by means of one or more URIs.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
<name ref="http://viaf.org/viaf/109557338" type="person">Seamus Heaney</name>
Note	The value must point directly to one or more XML elements or other resources by means of one or more URIs, separated by whitespace. If more than one is supplied the implication is that the name identifies several distinct entities.

att.cReferencing

att.cReferencing provides an attribute which may be used to supply a canonical reference as a means of identifying the target of a pointer.

Module tei

Members ref

Attributes

cRef

(canonical reference) specifies the destination of the pointer by supplying a canonical reference expressed using the scheme defined in a <refsDecl> element in the TEI header

Status Optional

Datatype data.text

Note

The value of cRef should be constructed so that when the algorithm for the resolution of canonical references (described in section 16.2.5. Canonical References) is applied to it the result is a valid URI reference to the intended target

The <refsDecl> to use may be indicated with the decls attribute.

Currently these Guidelines only provide for a single canonical reference to be encoded on any given <ptr> element.

att.datable

att.datable provides attributes for normalization of elements that contain dates, times, or datable events. [3.5.4. Dates and Times 13.3.6. Dates and Times]

Module tei

Members application change

Attributes

Attributes att.datable.w3c (@when, @notBefore, @notAfter, @from, @to) att.datable.iso (@when-iso, @notBefore-iso, @notAfter-iso, @from-iso, @to-iso) att.datable.custom (@when-custom, @notBefore-custom, @notAfter-custom, @from-custom, @to-custom, @datingPoint, @datingMethod)

calendar

indicates the system or calendar to which the date represented by the content of this element belongs.

Status	Optional
Datatype	`data.pointer`
Schematron	<sch:rule context="tei:*[@calendar]"> <sch:assert test="string-length(.) gt 0">@calendar indicates the system or calendar to which the date represented by the content of this element belongs, but this <sch:name/> element has no textual content.</sch:assert></sch:rule>
He was born on <date calendar="#Gregorian">Feb. 22, 1732</date> (<date calendar="#Julian" when="1732-02-22"> Feb. 11, 1731/32, O.S.</date>).

period

supplies a pointer to some location defining a named period of time within which the datable item is understood to have occurred.

Status	Optional
Datatype	`data.pointer`

Note

This ‘superclass’ provides attributes that can be used to provide normalized values of temporal information. By default, the attributes from the att.datable.w3c class are provided. If the module for names & dates is loaded, this class also provides attributes from the att.datable.iso and att.datable.custom classes. In general, the possible values of attributes restricted to the W3C datatypes form a subset of those values available via the ISO 8601 standard. However, the greater expressiveness of the ISO datatypes may not be needed, and there exists much greater software support for the W3C datatypes.

att.datable.custom

att.datable.custom provides attributes for normalization of elements that contain datable events to a custom dating system (i.e. other than the Gregorian used by W3 and ISO). [13.3.6. Dates and Times]

Module namesdates

Members att.datable [application change]

Attributes

when-custom

supplies the value of a date or time in some custom standard form.

Status Optional

Datatype 1–∞ occurrences of data.wordseparated by whitespace

The following are examples of custom date or time formats that are not valid ISO or W3C format normalizations, normalized to a different dating system

Alhazen died in Cairo on the
<date when="1040-03-06"
when-custom="431-06-12"> 12th day of Jumada t-Tania, 430 AH
</date>.
The current world will end at the
<date when="2012-12-21"
when-custom="13.0.0.0.0">end of B'ak'tun 13</date>.
The Battle of Meggidu
(<date when-custom="Thutmose_III:23">23rd year of reign of Thutmose III</date>).
Esidorus bixit in pace annos LXX plus minus sub
<date when-custom="Ind:4-10-11">die XI mensis Octobris indictione IIII</date>

Not all custom date formulations will have Gregorian equivalents.

The when-custom attribute and other custom dating are not contrained to a datatype by the TEI, but individual projects are recommended to regularize and document their dating formats.

notBefore-custom

specifies the earliest possible date for the event in some custom standard form.

Status	Optional
Datatype	1–∞ occurrences of `data.word`separated by whitespace

notAfter-custom

specifies the latest possible date for the event in some custom standard form.

Status	Optional
Datatype	1–∞ occurrences of `data.word`separated by whitespace

from-custom

indicates the starting point of the period in some custom standard form.

Status	Optional
Datatype	1–∞ occurrences of `data.word`separated by whitespace
<event datingMethod="#julian" from-custom="1666-09-02" to-custom="1666-09-05" xml:id="FIRE1"> <head>The Great Fire of London</head> <p>The Great Fire of London burned through a large part of the city of London.</p> </event>

to-custom

indicates the ending point of the period in some custom standard form.

Status	Optional
Datatype	1–∞ occurrences of `data.word`separated by whitespace

datingPoint

supplies a pointer to some location defining a named point in time with reference to which the datable item is understood to have occurred

Status	Optional
Datatype	`data.pointer`

datingMethod

supplies a pointer to a <calendar> element or other means of interpreting the values of the custom dating attributes.

Status	Optional
Datatype	`data.pointer`
Contayning the Originall, Antiquity, Increaſe, Moderne eſtate, and deſcription of that Citie, written in the yeare <date calendar="#julian" datingMethod="#julian" when-custom="1598">1598</date>. by Iohn Stow Citizen of London.
Note	Here the calendar attribute points to a <calendar> element for the Julian calendar, specifying that the text content of the date element is a Julian date, and the datingMethod attribute also points to the Julian calendar to indicate that the content of the when-custom attribute value is Julian too.

att.datable.iso

att.datable.iso provides attributes for normalization of elements that contain datable events using the ISO 8601 standard. [3.5.4. Dates and Times 13.3.6. Dates and Times]

Module namesdates

Members att.datable [application change]

Attributes

when-iso

supplies the value of a date or time in a standard form.

Status	Optional
Datatype	`data.temporal.iso`
The following are examples of ISO date, time, and date & time formats that are not valid W3C format normalizations. <date when-iso="1996-09-24T07:25+00">Sept. 24th, 1996 at 3:25 in the morning</date> <date when-iso="1996-09-24T03:25-04">Sept. 24th, 1996 at 3:25 in the morning</date> <time when-iso="1999-01-04T20:42-05">4 Jan 1999 at 8:42 pm</time> <time when-iso="1999-W01-1T20,70-05">4 Jan 1999 at 8:42 pm</time> <date when-iso="2006-05-18T10:03">a few minutes after ten in the morning on Thu 18 May</date> <time when-iso="03:00">3 A.M.</time> <time when-iso="14">around two</time> <time when-iso="15,5">half past three</time> All of the examples of the when attribute in the att.datable.w3c class are also valid with respect to this attribute.
He likes to be punctual. I said <q> <time when-iso="12">around noon</time> </q>, and he showed up at <time when-iso="12:00:00">12 O'clock</time> on the dot. The second occurence of <time> could have been encoded with the when attribute, as 12:00:00 is a valid time with respect to the W3C XML Schema Part 2: Datatypes Second Edition specification. The first occurence could not.

notBefore-iso

specifies the earliest possible date for the event in standard form, e.g. yyyy-mm-dd.

Status	Optional
Datatype	`data.temporal.iso`

notAfter-iso

specifies the latest possible date for the event in standard form, e.g. yyyy-mm-dd.

Status	Optional
Datatype	`data.temporal.iso`

from-iso

indicates the starting point of the period in standard form.

Status	Optional
Datatype	`data.temporal.iso`

to-iso

indicates the ending point of the period in standard form.

Status	Optional
Datatype	`data.temporal.iso`

Note

The value of these attributes should be a normalized representation of the date, time, or combined date & time intended, in any of the standard formats specified by ISO 8601, using the Gregorian calendar.

Note

If both when-iso and dur-iso are specified, the values should be interpreted as indicating a span of time by its starting time (or date) and duration. That is,

indicates the same time period as

In providing a ‘regularized’ form, no claim is made that the form in the source text is incorrect; the regularized form is simply that chosen as the main form for purposes of unifying variant forms under a single heading.

att.datable.w3c

att.datable.w3c provides attributes for normalization of elements that contain datable events conforming to the W3C XML Schema Part 2: Datatypes Second Edition. [3.5.4. Dates and Times 13.3.6. Dates and Times]

Module tei

Members att.datable [application change]

Attributes

when

supplies the value of the date or time in a standard form, e.g. yyyy-mm-dd.

Status	Optional
Datatype	`data.temporal.w3c`
Examples of W3C date, time, and date & time formats. <p> <date when="1945-10-24">24 Oct 45</date> <date when="1996-09-24T07:25:00Z">September 24th, 1996 at 3:25 in the morning</date> <time when="1999-01-04T20:42:00-05:00">Jan 4 1999 at 8 pm</time> <time when="14:12:38">fourteen twelve and 38 seconds</time> <date when="1962-10">October of 1962</date> <date when="--06-12">June 12th</date> <date when="---01">the first of the month</date> <date when="--08">August</date> <date when="2006">MMVI</date> <date when="0056">AD 56</date> <date when="-0056">56 BC</date> </p>
This list begins in the year 1632, more precisely on Trinity Sunday, i.e. the Sunday after Pentecost, in that year the <date calendar="#Julian" when="1632-06-06">27th of May (old style)</date>.
<opener> <dateline> <placeName>Dorchester, Village,</placeName> <date when="1828-03-02">March 2d. 1828.</date> </dateline> <salute>To Mrs. Cornell,</salute> Sunday <time when="12:00:00">noon.</time> </opener>

notBefore

specifies the earliest possible date for the event in standard form, e.g. yyyy-mm-dd.

Status	Optional
Datatype	`data.temporal.w3c`

notAfter

specifies the latest possible date for the event in standard form, e.g. yyyy-mm-dd.

Status	Optional
Datatype	`data.temporal.w3c`

from

indicates the starting point of the period in standard form, e.g. yyyy-mm-dd.

Status	Optional
Datatype	`data.temporal.w3c`

indicates the ending point of the period in standard form, e.g. yyyy-mm-dd.

Status	Optional
Datatype	`data.temporal.w3c`

Example

<date from="1863-05-28" to="1863-06-01">28 May through 1 June 1863</date>

Note

The value of these attributes should be a normalized representation of the date, time, or combined date & time intended, in any of the standard formats specified by XML Schema Part 2: Datatypes Second Edition, using the Gregorian calendar.

The most commonly-encountered format for the date portion of a temporal attribute is yyyy-mm-dd, but yyyy, --mm, ---dd, yyyy-mm, or --mm-dd may also be used. For the time part, the form hh:mm:ss is used.

Note that this format does not currently permit use of the value 0000 to represent the year 1 BCE; instead the value -0001 should be used.

att.declarable

att.declarable provides attributes for those elements in the TEI header which may be independently selected by means of the special purpose decls attribute. [15.3. Associating Contextual Information with a Text]

Module tei

Members projectDesc sourceDesc

Attributes

default

indicates whether or not this element is selected by default when its parent is selected.

Status	Optional
Datatype	`data.truthValue`
Legal values are:	true This element is selected if its parent is selected false This element can only be selected explicitly, unless it is the only one of its kind, in which case it is selected if its parent is selected. [Default]

Note

The rules governing the association of declarable elements with individual parts of a TEI text are fully defined in chapter 15.3. Associating Contextual Information with a Text. Only one element of a particular type may have a default attribute with a value of true.

att.declaring

att.declaring provides attributes for elements which may be independently associated with a particular declarable element within the header, thus overriding the inherited default for that element. [15.3. Associating Contextual Information with a Text]

Module tei

Members p ref text

Attributes

decls

identifies one or more declarable elements within the header, which are understood to apply to the element bearing this attribute and its content.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace

Note

The rules governing the association of declarable elements with individual parts of a TEI text are fully defined in chapter 15.3. Associating Contextual Information with a Text.

att.dimensions

att.dimensions provides attributes for describing the size of physical objects.

Module tei

Members att.editLike

Attributes

Attributes att.ranging (@atLeast, @atMost, @min, @max, @confidence)

unit

names the unit used for the measurement Suggested values include: 1] cm(centimetres) ; 2] mm(millimetres) ; 3] in(inches) ; 4] lines; 5] chars(characters)

Status	Optional
Datatype	`data.enumerated`
Suggested values include:	cm (centimetres) mm (millimetres) in (inches) lines lines of text chars (characters) characters of text

quantity

specifies the length in the units specified

Status	Optional
Datatype	`data.numeric`

extent

indicates the size of the object concerned using a project-specific vocabulary combining quantity and units in a single string of words.

Status	Optional
Datatype	`data.text`
<gap extent="5 words"/>
<height extent="half the page"/>

precision

characterizes the precision of the values specified by the other attributes.

Status	Optional
Datatype	`data.certainty`

scope

where the measurement summarizes more than one observation, specifies the applicability of this measurement. Sample values include: 1] all; 2] most; 3] range

Status	Optional
Datatype	`data.enumerated`
Sample values include:	all measurement applies to all instances. most measurement applies to most of the instances inspected. range measurement applies to only the specified range of instances.

att.docStatus

att.docStatus provides attributes for use on metadata elements describing the status of a document.

Module tei

Members change

Attributes

status

describes the status of a document either currently or, when associated with a dated element, at the time indicated. Sample values include: 1] approved; 2] candidate; 3] cleared; 4] deprecated; 5] draft; 6] embargoed; 7] expired; 8] frozen; 9] galley; 10] proposed; 11] published; 12] recommendation; 13] submitted; 14] unfinished; 15] withdrawn

Status	Optional
Datatype	`data.enumerated`
Sample values include:	approved candidate cleared deprecated draft [Default] embargoed expired frozen galley proposed published recommendation submitted unfinished withdrawn

Example

att.fragmentable

att.fragmentable groups structural elements which may be fragmented, usually as a consequence of some overlapping hierarchy.

Module tei

Members p

Attributes

part

specifies whether or not its parent element is fragmented in some way, typically by some other overlapping structure: for example a speech which is divided between two or more verse stanzas, a paragraph which is split across a page division, a verse line which is divided between two speakers.

Status	Optional
Datatype	`data.enumerated`
Legal values are:	Y (yes) the element is fragmented in some (unspecified) respect N (no) either the element is not fragmented, or no claim is made as to its completeness. [Default] I (initial) this is the initial part of a fragmented element M (medial) this is a medial part of a fragmented element F (final) this is the final part of a fragmented element
Note	The values I, M, or F should be used only where it is clear how the element may be be reconstituted.

att.global

att.global provides attributes common to all elements in the TEI encoding scheme. [1.3.1.1. Global Attributes]

Module tei

Members TEI appInfo application authority change encodingDesc idno label p projectDesc publicationStmt ref sourceDesc tagUsage text

Attributes

Attributes att.global.rendition (@rend, @style, @rendition) att.global.responsibility (@cert, @resp)

xml:id

(identifier) provides a unique identifier for the element bearing the attribute.

Status	Optional
Datatype	`xsd:ID`
Note	The xml:id attribute may be used to specify a canonical reference for an element; see section 3.10. Reference Systems.

(number) gives a number (or other label) for an element, which is not necessarily unique within the document.

Status	Optional
Datatype	`data.text`
Note	The value of this attribute is always understood to be a single token, even if it contains space or other punctuation characters, and need not be composed of numbers only. It is typically used to specify the numbering of chapters, sections, list items, etc.; it may also be used in the specification of a standard reference system for the text.

xml:lang

(language) indicates the language of the element content using a ‘tag’ generated according to BCP 47.

Status	Optional
Datatype	`data.language`
<p> … The consequences of this rapid depopulation were the loss of the last <foreign xml:lang="rap">ariki</foreign> or chief (Routledge 1920:205,210) and their connections to ancestral territorial organization.</p>
Note	The xml:lang value will be inherited from the immediately enclosing element, or from its parent, and so on up the document hierarchy. It is generally good practice to specify xml:lang at the highest appropriate level, noticing that a different default may be needed for the teiHeader from that needed for the associated resource element or elements, and that a single TEI document may contain texts in many languages. The authoritative list of registered language subtags is maintained by IANA and is available at http://www.iana.org/assignments/language-subtag-registry. For a good general overview of the construction of language tags, see http://www.w3.org/International/articles/language-tags/, and for a practical step-by-step guide, see http://www.w3.org/International/questions/qa-choosing-language-tags. The value used must conform with BCP 47. If the value is a private use code (i.e., starts with x- or contains -x-), a language element with a matching value for its ident attribute should be supplied in the TEI header to document this value. Such documentation may also optionally be supplied for non-private-use codes, though these must remain consistent with their (IETF)Internet Engineering Task Force definitions.

xml:base

provides a base URI reference with which applications can resolve relative URI references into absolute URI references.

Status	Optional
Datatype	`data.pointer`
<div type="bibl"> <head>Bibliography</head> <listBibl xml:base="http://www.lib.ucdavis.edu/BWRP/Works/"> <bibl> <author> <name>Landon, Letitia Elizabeth</name> </author> <ref target="LandLVowOf.sgm"> <title>The Vow of the Peacock</title> </ref> </bibl> <bibl> <author> <name>Compton, Margaret Clephane</name> </author> <ref target="NortMIrene.sgm"> <title>Irene, a Poem in Six Cantos</title> </ref> </bibl> <bibl> <author> <name>Taylor, Jane</name> </author> <ref target="TaylJEssay.sgm"> <title>Essays in Rhyme on Morals and Manners</title> </ref> </bibl> </listBibl> </div>

xml:space

signals an intention about how white space should be managed by applications.

Status	Optional
Datatype	`data.enumerated`
Legal values are:	default signals that the application's default white-space processing modes are acceptable preserve indicates the intent that applications preserve all white space
Note	The XML specification provides further guidance on the use of this attribute. Note that many parsers may not handle xml:space correctly.

att.global.rendition

att.global.rendition provides rendering attributes common to all elements in the TEI encoding scheme. [1.3.1.1.3. Rendition Indicators]

Module tei

Members att.global [TEI appInfo application authority change encodingDesc idno label p projectDesc publicationStmt ref sourceDesc tagUsage text]

Attributes

rend

(rendition) indicates how the element in question was rendered or presented in the source text.

Status	Optional
Datatype	1–∞ occurrences of `data.word`separated by whitespace
<head rend="align(center) case(allcaps)"> <lb/>To The <lb/>Duchesse <lb/>of <lb/>Newcastle, <lb/>On Her <lb/> <hi rend="case(mixed)">New Blazing-World</hi>. </head>
Note	These Guidelines make no binding recommendations for the values of the rend attribute; the characteristics of visual presentation vary too much from text to text and the decision to record or ignore individual characteristics varies too much from project to project. Some potentially useful conventions are noted from time to time at appropriate points in the Guidelines. The values of the rend attribute are a set of sequence-indeterminate individual tokens separated by whitespace.

style

contains an expression in some formal style definition language which defines the rendering or presentation used for this element in the source text

Status	Optional
Datatype	`data.text`
<head style="text-align: center; font-variant: small-caps"> <lb/>To The <lb/>Duchesse <lb/>of <lb/>Newcastle, <lb/>On Her <lb/> <hi style="font-variant: normal">New Blazing-World</hi>. </head>
Note	Unlike the attribute values of rend, which uses whitespace as a separator, the style attribute may contain whitespace. This attribute is intended for recording inline stylistic information concerning the source, not any particular output. The formal language in which values for this attribute are expressed may be specified using the <styleDefDecl> element in the TEI header.

rendition

points to a description of the rendering or presentation used for this element in the source text.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
<head rendition="#ac #sc"> <lb/>To The <lb/>Duchesse <lb/>of <lb/>Newcastle, <lb/>On Her <lb/> <hi rendition="#normal">New Blazing-World</hi>. </head> <!-- elsewhere... --> <rendition scheme="css" xml:id="sc">font-variant: small-caps</rendition> <rendition scheme="css" xml:id="normal">font-variant: normal</rendition> <rendition scheme="css" xml:id="ac">text-align: center</rendition>
Note	The rendition attribute is used in a very similar way to the class attribute defined for XHTML but with the important distinction that its function is to describe the appearance of the source text, not necessarily to determine how that text should be presented on screen or paper. Where both rendition and rend are supplied, the latter is understood to override or complement the former. Each URI provided should indicate a <rendition> element defining the intended rendition in terms of some appropriate style language, as indicated by the scheme attribute.

att.global.responsibility

att.global.responsibility provides attributes indicating the agency responsible for some aspect of the text, the markup or something asserted by the markup, and the degree of certainty associated with it. [3.4. Simple Editorial Changes 11.3.2.2. Hand, Responsibility, and Certainty Attributes 17.3. Spans and Interpretations 13.1.1. Linking Names and Their Referents]

Module tei

Members att.global [TEI appInfo application authority change encodingDesc idno label p projectDesc publicationStmt ref sourceDesc tagUsage text]

Attributes

cert

(certainty) signifies the degree of certainty associated with the intervention or interpretation.

Status	Optional
Datatype	`data.certainty`

resp

(responsible party) indicates the agency responsible for the intervention or interpretation, for example an editor or transcriber.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
Note	Note that a simple resp pointing to a person or organization is likely to be somewhat ambiguous with regard to the nature of the responsibility. For this reason, we recommend that resp be used to point not to an agent (<person> or <org>) but to a <respStmt>, author, editor or similar element which clarifies the exact role played by the agent. Pointing to multiple <respStmt>s allows the encoder to specify clearly each of the roles played in part of a TEI file (creating, transcribing, encoding, editing, proofing etc.).

Example

Blessed are the
<choice>
<sic>placemakers</sic>
<corr cert="high" resp="#editor">peacemakers</corr>
</choice>: for they shall be called the children of God.

Example

<lg>

<l>Punkes, Panders, baſe extortionizing
 sla<choice>
 <sic>n</sic>
 <corr resp="#JENS1_transcriber">u</corr>
 </choice>es,</l>

</lg>


<respStmt xml:id="JENS1_transcriber">
<resp when="2014">Transcriber</resp>
<name>Janelle Jenstad</name>
</respStmt>

att.internetMedia

att.internetMedia provides attributes for specifying the type of a computer resource using a standard taxonomy.

Module tei

Members ref

Attributes

mimeType

(MIME media type) specifies the applicable multimedia internet mail extension (MIME) media type

Status	Optional
Datatype	1–∞ occurrences of `data.word`separated by whitespace

Example

In this example mimeType is used to indicate that the URL points to a TEI XML file encoded in UTF-8.

Note

This attribute class provides an attribute for describing a computer resource, typically available over the internet, using a value taken from a standard taxonomy. At present only a single taxonomy is supported, the Multipurpose Internet Mail Extensions (MIME) Media Type system. This typology of media types is defined by the Internet Engineering Task Force in RFC 2046. The list of types is maintained by the Internet Assigned Numbers Authority (IANA). The mimeType attribute must have a value taken from this list.

att.naming

att.naming provides attributes common to elements which refer to named persons, places, organizations etc. [3.5.1. Referring Strings 13.3.5. Names and Nyms]

Module tei

Members att.personal

Attributes

Attributes att.canonical (@key, @ref)

role

Status	Optional
Datatype	1–∞ occurrences of `data.enumerated`separated by whitespace

nymRef

(reference to the canonical name) provides a means of locating the canonical form (nym) of the names associated with the object named by the element bearing it.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
Note	The value must point directly to one or more XML elements by means of one or more URIs, separated by whitespace. If more than one is supplied, the implication is that the name is associated with several distinct canonical names.

att.placement

att.placement provides attributes for describing where on the source page or object a textual element appears. [3.4.3. Additions, Deletions, and Omissions 11.3.1.4. Additions and Deletions]

Module tei

Members label

Attributes

place

specifies where this item is placed Suggested values include: 1] below; 2] bottom; 3] margin; 4] top; 5] opposite; 6] overleaf; 7] above; 8] end; 9] inline; 10] inspace

Status	Recommended
Datatype	1–∞ occurrences of `data.enumerated`separated by whitespace
Suggested values include:	below below the line bottom at the foot of the page margin in the margin (left, right, or both) top at the top of the page opposite on the opposite, i.e. facing, page overleaf on the other side of the leaf above above the line end at the end of e.g. chapter or volume. inline within the body of the text. inspace in a predefined space, for example left by an earlier scribe.
<add place="margin">[An addition written in the margin]</add> <add place="bottom opposite">[An addition written at the foot of the current page and also on the facing page]</add>
<note place="bottom">Ibid, p.7</note>

att.pointing

att.pointing defines a set of attributes used by all elements which point to other elements by means of one or more URI references. [1.3.1.1.2. Language Indicators 3.6. Simple Links and Cross-References]

Module tei

Members ref

Attributes

targetLang

specifies the language of the content to be found at the destination referenced by target, using a ‘language tag’ generated according to BCP 47.

Status	Optional
Datatype	`data.language`
Schematron	<sch:rule context="tei:*[not(self::tei:schemaSpec)][@targetLang]"> <sch:assert test="count(@target)">@targetLang can only be used if @target is specified.</sch:assert></sch:rule>
<linkGrp xml:id="pol-swh_aln_2.1-linkGrp"> <ptr target="pol/UDHR/text.xml#pol_txt_1-head" targetLang="pl" type="tuv" xml:id="pol-swh_aln_2.1.1-ptr"/> <ptr target="swh/UDHR/text.xml#swh_txt_1-head" targetLang="sw" type="tuv" xml:id="pol-swh_aln_2.1.2-ptr"/> </linkGrp> In the example above, the <linkGrp> combines pointers at parallel fragments of the Universal Declaration of Human Rights: one of them is in Polish, the other in Swahili.
Note	The value must conform to BCP 47. If the value is a private use code (i.e., starts with x- or contains -x-), a language element with a matching value for its ident attribute should be supplied in the TEI header to document this value. Such documentation may also optionally be supplied for non-private-use codes, though these must remain consistent with their (IETF)Internet Engineering Task Force definitions.

target

specifies the destination of the reference by supplying one or more URI References

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
Note	One or more syntactically valid URI references, separated by whitespace. Because whitespace is used to separate URIs, no whitespace is permitted inside a single URI. If a whitespace character is required in a URI, it should be escaped with the normal mechanism, e.g. `TEI%20Consortium`.

evaluate

specifies the intended meaning when the target of a pointer is itself a pointer.

Status	Optional
Datatype	`data.enumerated`
Legal values are:	all if the element pointed to is itself a pointer, then the target of that pointer will be taken, and so on, until an element is found which is not a pointer. one if the element pointed to is itself a pointer, then its target (whether a pointer or not) is taken as the target of this pointer. none no further evaluation of targets is carried out beyond that needed to find the element specified in the pointer's target.
Note	If no value is given, the application program is responsible for deciding (possibly on the basis of user input) how far to trace a chain of pointers.

att.ranging

att.ranging provides attributes for describing numerical ranges.

Module tei

Members att.dimensions [att.editLike]

Attributes

atLeast

gives a minimum estimated value for the approximate measurement.

Status	Optional
Datatype	`data.numeric`

atMost

gives a maximum estimated value for the approximate measurement.

Status	Optional
Datatype	`data.numeric`

min

where the measurement summarizes more than one observation or a range, supplies the minimum value observed.

Status	Optional
Datatype	`data.numeric`

max

where the measurement summarizes more than one observation or a range, supplies the maximum value observed.

Status	Optional
Datatype	`data.numeric`

confidence

specifies the degree of statistical confidence (between zero and one) that a value falls within the range specified by min and max, or the proportion of observed values that fall within that range.

Status	Optional
Datatype	`data.probability`

Example

The MS. was lost in transmission by mail from <del rend="overstrike">
<gap atLeast="1" atMost="2"
extent="one or two letters" reason="illegible" unit="chars"/>
</del> Philadelphia to the Graphic office, New York.

att.sortable

att.sortable provides attributes for elements in lists or groups that are sortable, but whose sorting key cannot be derived mechanically from the element content. [9.1. Dictionary Body and Overall Structure]

Module tei

Members idno

Attributes

sortKey

supplies the sort key for this element in an index, list or group which contains it.

Status	Optional
Datatype	`data.word`
David's other principal backer, Josiah ha-Kohen <index indexName="NAMES"> <term sortKey="Azarya_Josiah_Kohen">Josiah ha-Kohen b. Azarya</term> </index> b. Azarya, son of one of the last gaons of Sura was David's own first cousin.
Note	The sort key is used to determine the sequence and grouping of entries in an index. It provides a sequence of characters which, when sorted with the other values, will produced the desired order; specifics of sort key construction are application-dependent Dictionary order often differs from the collation sequence of machine-readable character sets; in English-language dictionaries, an entry for 4-H will often appear alphabetized under ‘fourh’, and McCoy may be alphabetized under‘maccoy’, while A1, A4, and A5 may all appear in numeric order ‘alphabetized’ between ‘a-’ and ‘AA’. The sort key is required if the orthography of the dictionary entry does not suffice to determine its location.

att.source

att.source provides attributes for pointing to the source of a bibliographic reference. [3.3.3. Quotation 8.3.4. Writing]

Module tei

Members att.editLike

Attributes

source

provides a pointer to the bibliographical source from which a quotation or citation is drawn.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace

Example

As Willard McCarty (<bibl xml:id="mcc_2012">2012, p.2</bibl>)
tells us, <quote source="#mcc_2012">‘Collaboration’ is a
problematic and should be a contested term.</quote>

Example

<quote source="#chicago_15_ed">Grammatical theories
are in flux, and the more we learn, the less we
seem to know.</quote>



<bibl xml:id="chicago_15_ed">
<title level="m">The Chicago Manual of Style</title>,
<edition>15th edition</edition>.
<pubPlace>Chicago</pubPlace>:
<publisher>University of Chicago Press</publisher>
(<date>2003</date>),
<biblScope unit="page">p.147</biblScope>.

</bibl>

att.typed

att.typed provides attributes which can be used to classify or subclassify elements in any way. [1.3.1. Attribute Classes 17.1.1. Words and Above 3.5.1. Referring Strings 3.6. Simple Links and Cross-References 3.5.5. Abbreviations and Their Expansions 3.12.1. Core Tags for Verse 7.2.5. Speech Contents 4.1.1. Un-numbered Divisions 4.1.2. Numbered Divisions 4.2.1. Headings and Trailers 4.4. Virtual Divisions 13.3.2.3. Personal Relationships 11.3.1.1. Core Elements for Transcriptional Work 16.1.1. Pointers and Links 16.3. Blocks, Segments, and Anchors 12.2. Linking the Apparatus to the Text 22.4.4.2. RELAX NG Content Models 8.3. Elements Unique to Spoken Texts 23.3.1.4. Modification of Attribute and Attribute Value Lists]

Module tei

Members application change label ref text

Attributes

type

characterizes the element in some sense, using any convenient classification scheme or typology.

Status	Optional
Datatype	`data.enumerated`
<div type="verse"> <head>Night in Tarras</head> <lg type="stanza"> <l>At evening tramping on the hot white road</l> <l>…</l> </lg> <lg type="stanza"> <l>A wind sprang up from nowhere as the sky</l> <l>…</l> </lg> </div>
Note	The type attribute is present on a number of elements, not all of which are members of att.typed, usually because these elements restrict the possible values for the attribute in a specific way.

subtype

provides a sub-categorization of the element, if needed

Status	Optional
Datatype	`data.enumerated`
Note	The subtype attribute may be used to provide any sub-classification for the element additional to that provided by its type attribute.

Schematron

<sch:rule context="*[@subtype]">
<sch:assert test="@type">The <sch:name/> element should not be categorized in detail with @subtype
unless also categorized in general with @type</sch:assert></sch:rule>

Note

When appropriate, values from an established typology should be used. Alternatively a typology may be defined in the associated TEI header. If values are to be taken from a project-specific list, this should be defined using the <valList> element in the project-specific schema description, as described in 23.3.1.4. Modification of Attribute and Attribute Value Lists .

Schema teiODD_LAUDATIOCorpus_S7: Macros

data.certainty

data.certainty defines the range of attribute values expressing a degree of certainty.
Module	tei
Used by
Declaration	data.certainty = "high" \| "medium" \| "low" \| "unknown"
Note	Certainty may be expressed by one of the predefined symbolic values high, medium, or low. The value unknown should be used in cases where the encoder does not wish to assert an opinion about the matter. For more precise indication, data.probability may be used instead or in addition.

data.count

data.count defines the range of attribute values used for a non-negative integer value used as a count.
Module	tei
Used by	Element: tagUsage/@occurs tagUsage/@withId
Declaration	data.count = xsd:nonNegativeInteger
Note	Only positive integer values (including zero) are permitted

data.duration.iso

data.duration.iso defines the range of attribute values available for representation of a duration in time using ISO 8601 standard formats
Module	tei
Used by
Declaration	data.duration.iso = token { pattern = "[0-9.,DHMPRSTWYZ/:+\-]+" }
Example	<time dur-iso="PT0,75H">three-quarters of an hour</time>
Example	<date dur-iso="P1,5D">a day and a half</date>
Example	<date dur-iso="P14D">a fortnight</date>
Example	<time dur-iso="PT0.02S">20 ms</time>
Note	A duration is expressed as a sequence of number-letter pairs, preceded by the letter P; the letter gives the unit and may be Y (year), M (month), D (day), H (hour), M (minute), or S (second), in that order. The numbers are all unsigned integers, except for the last, which may have a decimal component (using either `.` or `,` as the decimal point; the latter is preferred). If any number is 0, then that number-letter pair may be omitted. If any of the H (hour), M (minute), or S (second) number-letter pairs are present, then the separator `T` must precede the first ‘time’ number-letter pair. For complete details, see ISO 8601 Data elements and interchange formats — Information interchange — Representation of dates and times.

data.duration.w3c

data.duration.w3c defines the range of attribute values available for representation of a duration in time using W3C datatypes.
Module	tei
Used by
Declaration	data.duration.w3c = xsd:duration
Example	<time dur="PT45M">forty-five minutes</time>
Example	<date dur="P1DT12H">a day and a half</date>
Example	<date dur="P7D">a week</date>
Example	<time dur="PT0.02S">20 ms</time>
Note	A duration is expressed as a sequence of number-letter pairs, preceded by the letter P; the letter gives the unit and may be Y (year), M (month), D (day), H (hour), M (minute), or S (second), in that order. The numbers are all unsigned integers, except for the `S` number, which may have a decimal component (using `.` as the decimal point). If any number is 0, then that number-letter pair may be omitted. If any of the H (hour), M (minute), or S (second) number-letter pairs are present, then the separator `T` must precede the first ‘time’ number-letter pair. For complete details, see the W3C specification.

data.enumerated

data.enumerated defines the range of attribute values expressed as a single XML name taken from a list of documented possibilities.
Module	tei
Used by	Element: author/@role availability/@status editor/@role extent/@type idno/@type list/@type teiHeader/@type title/@type
Declaration	data.enumerated = data.word
Note	Attributes using this datatype must contain a single word matching the pattern defined for this datatype: for example it cannot include whitespace but may begin with digits. Typically, the list of documented possibilities will be provided (or exemplified) by a value list in the associated attribute specification, expressed with a <valList> element.

data.language

data.language defines the range of attribute values used to identify a particular combination of human language and writing system. [6.1. Language Identification]
Module	tei
Used by	Element: language/@ident
Declaration	data.language = xsd:language \| ""
Note	The values for this attribute are language ‘tags’ as defined in BCP 47. Currently BCP 47 comprises RFC 4646 and RFC 4647; over time, other IETF documents may succeed these as the best current practice. A ‘language tag’, per BCP 47, is assembled from a sequence of components or subtags separated by the hyphen character (-, U+002D). The tag is made of the following subtags, in the following order. Every subtag except the first is optional. If present, each occurs only once, except the fourth and fifth components (variant and extension), which are repeatable. language The IANA-registered code for the language. This is almost always the same as the ISO 639 2-letter language code if there is one. The list of available registered language subtags can be found at http://www.iana.org/assignments/language-subtag-registry. It is recommended that this code be written in lower case. script The ISO 15924 code for the script. These codes consist of 4 letters, and it is recommended they be written with an initial capital, the other three letters in lower case. The canonical list of codes is maintained by the Unicode Consortium, and is available at http://unicode.org/iso15924/iso15924-codes.html. The IETF recommends this code be omitted unless it is necessary to make a distinction you need. region Either an ISO 3166 country code or a UN M.49 region code that is registered with IANA (not all such codes are registered, e.g. UN codes for economic groupings or codes for countries for which there is already an ISO 3166 2-letter code are not registered). The former consist of 2 letters, and it is recommended they be written in upper case. The list of codes can be found at http://www.iso.org/iso/home/standards/country_codes/iso-3166-1_decoding_table.htm. The latter consist of 3 digits; the list of codes can be found at http://unstats.un.org/unsd/methods/m49/m49.htm. variant An IANA-registered variation. These codes are used to indicate additional, well-recognized variations that define a language or its dialects that are not covered by other available subtags. extension An extension has the format of a single letter followed by a hyphen followed by additional subtags. These exist to allow for future extension to BCP 47, but as of this writing no such extensions are in use. private use An extension that uses the initial subtag of the single letter x (i.e., starts with `x-`) has no meaning except as negotiated among the parties involved. These should be used with great care, since they interfere with the interoperability that use of RFC 4646 is intended to promote. In order for a document that makes use of these subtags to be TEI-conformant, a corresponding language element must be present in the TEI header. There are two exceptions to the above format. First, there are language tags in the IANA registry that do not match the above syntax, but are present because they have been ‘grandfathered’ from previous specifications. Second, an entire language tag can consist of only a private use subtag. These tags start with `x-`, and do not need to follow any further rules established by the IETF and endorsed by these Guidelines. Like all language tags that make use of private use subtags, the language in question must be documented in a corresponding language element in the TEI header. Examples include sn Shona zh-TW Taiwanese zh-Hant-HK Chinese written in traditional script as used in Hong Kong en-SL English as spoken in Sierra Leone pl Polish es-MX Spanish as spoken in Mexico es-419 Spanish as spoken in Latin America The W3C Internationalization Activity has published a useful introduction to BCP 47, Language tags in HTML and XML.

data.name

data.name defines the range of attribute values expressed as an XML Name.
Module	tei
Used by	Element: application/@ident
Declaration	data.name = xsd:Name
Note	Attributes using this datatype must contain a single word which follows the rules defining a legal XML name (see http://www.w3.org/TR/REC-xml/#dt-name): for example they cannot include whitespace or begin with digits.

data.numeric

data.numeric defines the range of attribute values used for numeric values.
Module	tei
Used by
Declaration	data.numeric = xsd:double \| token { pattern = "(\-?[\d]+/\-?[\d]+)" } \| xsd:decimal
Note	Any numeric value, represented as a decimal number, in floating point format, or as a ratio. To represent a floating point number, expressed in scientific notation, ‘E notation’, a variant of ‘exponential notation’, may be used. In this format, the value is expressed as two numbers separated by the letter E. The first number, the significand (sometimes called the mantissa) is given in decimal format, while the second is an integer. The value is obtained by multiplying the mantissa by 10 the number of times indicated by the integer. Thus the value represented in decimal notation as 1000.0 might be represented in scientific notation as 10E3. A value expressed as a ratio is represented by two integer values separated by a solidus (/) character. Thus, the value represented in decimal notation as 0.5 might be represented as a ratio by the string 1/2.

data.outputMeasurement

data.outputMeasurement defines a range of values for use in specifying the size of an object that is intended for display.
Module	tei
Used by
Declaration	data.outputMeasurement = token { pattern = "[\-+]?\d+(\.\d+)?(%\|cm\|mm\|in\|pt\|pc\|px\|em\|ex\|gd\|rem\|vw\|vh\|vm)" }
Example	<figure> <head>The TEI Logo</head> <figDesc>Stylized yellow angle brackets with the letters <mentioned>TEI</mentioned> in between and <mentioned>text encoding initiative</mentioned> underneath, all on a white background.</figDesc> <graphic height="600px" url="http://www.tei-c.org/logos/TEI-600.jpg" width="600px"/> </figure>
Note	These values map directly onto the values used by XSL-FO and CSS. For definitions of the units see those specifications; at the time of this writing the most complete list is in the CSS3 working draft.

data.pattern

data.pattern (regular expression pattern) defines attribute values which are expressed as a regular expression.
Module	tei
Used by
Declaration	data.pattern = token
Note	A regular expression, often called a pattern, is an expression that describes a set of strings. They are usually used to give a concise description of a set, without having to list all elements. For example, the set containing the three strings Handel, Händel, and Haendel can be described by the pattern `H(ä\|ae?)ndel` (or alternatively, it is said that the pattern `H(ä\|ae?)ndel` matches each of the three strings) Wikipedia

data.pointer

data.pointer defines the range of attribute values used to provide a single URI, absolute or relative, pointing to some other resource, either within the current document or elsewhere.
Module	tei
Used by	Element: change/@target tagUsage/@render
Declaration	data.pointer = xsd:anyURI
Note	The range of syntactically valid values is defined by RFC 3986 Uniform Resource Identifier (URI): Generic Syntax. Note that the values themselves are encoded using RFC 3987 Internationalized Resource Identifiers (IRIs) mapping to URIs. For example, `https://secure.wikimedia.org/wikipedia/en/wiki/%` is encoded as `https://secure.wikimedia.org/wikipedia/en/wiki/%25` while `http://موقع.وزارة-الاتصالات.مصر/` is encoded as `http://xn--4gbrim.xn----rmckbbajlc6dj7bxne2c.xn--wgbh1c/`

data.probability

data.probability defines the range of attribute values expressing a probability.
Module	tei
Used by
Declaration	data.probability = xsd:double { minInclusive = "0" maxInclusive = "1" }
Note	Probability is expressed as a real number between 0 and 1; 0 representing certainly false and 1 representing certainly true.

data.replacement

data.replacement defines attribute values which contain a replacement template.
Module	tei
Used by
Declaration	data.replacement = text

data.temporal.iso

data.temporal.iso defines the range of attribute values expressing a temporal expression such as a date, a time, or a combination of them, that conform to the international standard Data elements and interchange formats – Information interchange – Representation of dates and times.
Module	tei
Used by	Element: date/@when
Declaration	data.temporal.iso = xsd:date \| xsd:gYear \| xsd:gMonth \| xsd:gDay \| xsd:gYearMonth \| xsd:gMonthDay \| xsd:time \| xsd:dateTime \| token { pattern = "[0-9.,DHMPRSTWYZ/:+\-]+" }
Note	If it is likely that the value used is to be compared with another, then a time zone indicator should always be included, and only the dateTime representation should be used. For all representations for which ISO 8601 describes both a basic and an extended format, these Guidelines recommend use of the extended format. While ISO 8601 permits the use of both `00:00` and `24:00` to represent midnight, these Guidelines strongly recommend against the use of `24:00`.

data.temporal.w3c

data.temporal.w3c defines the range of attribute values expressing a temporal expression such as a date, a time, or a combination of them, that conform to the W3C XML Schema Part 2: Datatypes Second Edition specification.
Module	tei
Used by
Declaration	data.temporal.w3c = xsd:date \| xsd:gYear \| xsd:gMonth \| xsd:gDay \| xsd:gYearMonth \| xsd:gMonthDay \| xsd:time \| xsd:dateTime
Note	If it is likely that the value used is to be compared with another, then a time zone indicator should always be included, and only the dateTime representation should be used.

data.text

data.text defines the range of attribute values used to express some kind of identifying string as a single sequence of unicode characters possibly including whitespace.
Module	tei
Used by	Element: author/@n editor/@n item/@n language/@style namespace/@name tagUsage/@gi
Declaration	data.text = string
Note	Attributes using this datatype must contain a single ‘token’ in which whitespace and other punctuation characters are permitted.

data.truthValue

data.truthValue defines the range of attribute values used to express a truth value.
Module	tei
Used by	Element: tagsDecl/@partial
Declaration	data.truthValue = xsd:boolean
Note	The possible values of this datatype are 1 or true, or 0 or false. This datatype applies only for cases where uncertainty is inappropriate; if the attribute concerned may have a value other than true or false, e.g. unknown, or inapplicable, it should have the extended version of this datatype: data.xTruthValue.

data.version

data.version defines the range of attribute values which may be used to specify a TEI or Unicode version number.
Module	tei
Used by	Element: TEI/@version
Declaration	data.version = token { pattern = "[\d]+(\.[\d]+){0,2}" }
Note	The value of this attribute follows the pattern specified by the Unicode consortium for its version number (http://unicode.org/versions/). A version number contains digits and fullstop characters only. The first number supplied identifies the major version number. A second and third number, for minor and sub-minor version numbers, may also be supplied.

data.versionNumber

data.versionNumber defines the range of attribute values used for version numbers.
Module	tei
Used by	Element: application/@version
Declaration	data.versionNumber = token { pattern = "[\d]+[a-z][\d](\.[\d]+[a-z][\d]){0,3}" }

data.word

data.word defines the range of attribute values expressed as a single word or token.
Module	tei
Used by	data.enumeratedElement: namespace/@rend
Declaration	data.word = token { pattern = "(\p{L}\|\p{N}\|\p{P}\|\p{S})+" }
Note	Attributes using this datatype must contain a single ‘word’ which contains only letters, digits, punctuation characters, or symbols: thus it cannot include whitespace.

data.xTruthValue

data.xTruthValue (extended truth value) defines the range of attribute values used to express a truth value which may be unknown.
Module	tei
Used by
Declaration	data.xTruthValue = xsd:boolean \| "unknown" \| "inapplicable"
Note	In cases where where uncertainty is inappropriate, use the datatype data.TruthValue.

macro.limitedContent

macro.limitedContent (paragraph content) defines the content of prose elements that are not used for transcription of extant materials. [1.3. The TEI Class System]
Module	tei
Used by	tagUsage
Declaration	macro.limitedContent = ( text \| model.limitedPhrase \| model.inter )*

macro.paraContent

macro.paraContent (paragraph content) defines the content of paragraphs and similar elements. [1.3. The TEI Class System]
Module	tei
Used by	p ref title
Declaration	macro.paraContent = ( text \| model.gLike \| model.phrase \| model.inter \| model.global \| lg \| l )*

macro.phraseSeq

macro.phraseSeq (phrase sequence) defines a sequence of character data and phrase-level elements. [1.4.1. Standard Content Models]
Module	tei
Used by	affiliation author editor extent forename label orgName persName surname
Declaration	macro.phraseSeq = ( text \| model.gLike \| model.phrase \| model.global )*

macro.phraseSeq.limited

macro.phraseSeq.limited (limited phrase sequence) defines a sequence of character data and those phrase-level elements that are not typically used for transcribing extant documents. [1.4.1. Standard Content Models]
Module	tei
Used by	authority language
Declaration	macro.phraseSeq.limited = ( text \| model.limitedPhrase \| model.global )*

macro.specialPara

macro.specialPara ('special' paragraph content) defines the content model of elements such as notes or list items, which either contain a series of component-level elements or else have the same structure as a paragraph, containing a series of phrase-level and inter-level elements. [1.3. The TEI Class System]
Module	tei
Used by	change item
Declaration	macro.specialPara = ( text \| model.gLike \| model.phrase \| model.inter \| model.divPart \| model.global )*

3 Contact

Carolin Odebrecht,
Korpuslinguistik und Morphologie. Institut für deutsche Sprache und Linguistik, Humboldt-universität zu Berlin.
carolin.odebrecht@hu-berlin.de

LAUDATIO: Long-term Access and Usage of Deeply Annotated Information. The management and archiving of digital research data is an overlapping field for linguistics, library and information science (LIS) and computer science. These disciplines are cooperating in the LAUDATIO project. The name LAUDATIO is an abbreviation for Long-term Access and Usage of Deeply Annotated Information. The project is funded by the German Research Foundation from 2011-2014. The departments of Corpus Linguistics as well as Historical Linguistics, and the Computer and Media Service (CMS) at Humboldt-Universität zu Berlin and The National Institute for Research in Computer Science and Control (INRIA France) are project partners cooperating with the Berlin School of Library and Information Science (BSLIS).

Funded by Deutsche Forschungsgemeinschaft (DFG).

4 Licence

The Creative Commons Attribution 3.0 Unported (CC BY 3.0) Licence applies to this document.

Corpus Metadata Carolin Odebrecht Institut für deutsche Sprache und Linguistik, Humboldt-Universität zu Berlin carolin.odebrecht@hu-berlin.de. DFG - Deutsche Forschungsgemeinschaft

Author: Carolin Odebrecht 2015-06-08

Table of contents

1.1 Metadata model

1.2 Implementation of the metadata model

2.1 Core structure

2.2 Basic TEI header structure

2.3 Detailed TEI header structure

2.4 Formal declaration

Schema teiODD_LAUDATIOCorpus_S7: Elements

<affiliation>

<appInfo>

<application>

<author>

<authority>

<availability>

<change>

<date>

<editor>

<editorialDecl>

<encodingDesc>

<extent>

<fileDesc>

<forename>

<idno>

<item>

<label>

<language>

<langUsage>

<list>

<namespace>

<normalization>

<orgName>

<p>

<persName>

<profileDesc>

<projectDesc>

<publicationStmt>

<ref>

<revisionDesc>

<segmentation>

<sourceDesc>

<surname>

<tagsDecl>

<tagUsage>

<TEI>

<teiHeader>

<text>

<title>

<titleStmt>

Schema teiODD_LAUDATIOCorpus_S7: Model classes

model.addressLike

model.applicationLike

model.dateLike

model.divBottom

model.divPart

model.divTop

model.divTopPart

model.editorialDeclPart

model.emphLike

model.encodingDescPart

model.global

model.highlighted

model.inter

model.labelLike

model.limitedPhrase

model.listLike

model.nameLike

model.nameLike.agent

model.persNamePart

model.phrase

model.placeStateLike

model.pLike

model.pPart.data

model.pPart.edit

model.profileDescPart

model.ptrLike

model.publicationStmtPart.agency

model.publicationStmtPart.detail

model.qLike