Corpus Metadata Carolin Odebrecht Institut für deutsche Sprache und Linguistik, Humboldt-Universität zu Berlin carolin.odebrecht@hu-berlin.de. DFG - Deutsche Forschungsgemeinschaft

Author: Carolin Odebrecht 2015-06-08

Table of contents

1 Application for the TEI customizaiton

The LAUDATIO-Repository provides open access to historical text corpora. With the help of metadata such as

you will be able to chose your corpus of interest for your analysis or next preparation steps.

The TEI customization provides a metadata scheme for the LAUDATIO-Repository which enables a structured and uniform display and facetted and free-metadata search of such corpus metadata. The customization is based on a metadata model.

For each corpus in the repository the metadata are stored in TEI XML. There are three ODDs which specify the metadata for a corpus, the documents of the corpus and the annotation in the documents. This customization refers to the class 'corpus', see figure 1 below.

The TEI ODD is developed in the context of the LAUDATIO-Project in cooperation with the working groups of the Search and Visualization System ANNIS and the converter framework SaltNPepper.

1.1 Metadata model

The metadata model refers to all kinds of historical text corpora.

Figure 1. Metadata model
Metadata model

The metadata model defines a corpus as the sum of documents which are the sum of annotations. For each concept, there is a list of attributes which represent the metadata relevant for data retrieval in the repository. For example, the attribut 'title' is assigned to each concept but there is a difference between a title for the corpus, a title for a historical document within the corpus or the title of an annotation layer. with the help of the model, it will be clear which title is meant in which situation.

For detailed information, motivation and discussion see

1.2 Implementation of the metadata model

The TEI customization creates TEI scheme for the LAUDATIO metadata concepts 'Corpus', 'Document' and 'PreparationStep' by mapping the metadata model and TEI p5 header structure. The metadata model is realized with TEI p5 by using a subset of the TEI p5 and a TEI header for each conclascept:

Each TEI XML file contains fileDesc, encodingDesc and revisionDesc. The concepts 'Corpus' and 'Document' contain profileDesc because they refer to the language used in the data.

The header have relations to each other in the following way: The 'Corpus Header' provides lists of documents corresponding to the 'Document Header' and list of annotations corresponding to the 'PreparationStep Header'. Each 'Document Header' contains a list of annotation for each document in the corpus corresponding to the 'PreparationStep Header'.

2 TEI p5 customization for the concept 'Corpus'

The ODD combines the four basic modules (<textstructure>, <core>, <header> and <namesdates>), but removes most of the available elements and changes several attribute conceptes, e.g. add a TEI attribute to an element. Thus, the customization is very restricted and allows little variation in selecting elements and attributes.

2.1 Core structure

The <core> module is very restricted and only contains the elements corresponding to bibliographic metadata such as author, editor, date and title and more general structural element such as list, item, label, p and ref.

  • author in a bibliographic reference, contains the name(s) of an author, personal or corporate, of a work; for example in the same form as that provided by a recognized bibliographic name authority.
    n(number) gives a number (or other label) for an element, which is not necessarily unique within the document. Sample values include:
    rolemay be used to specify further information about the entity referenced by this name in the form of a set of whitespace-separated values, for example the occupation of a person, or the status of a place.
  • editor contains a secondary statement of responsibility for a bibliographic item, for example the name of an individual, institution or organization, (or of several such) acting as editor, compiler, translator, etc.
    n(number) gives a number (or other label) for an element, which is not necessarily unique within the document. Sample values include:
    rolemay be used to specify further information about the entity referenced by this name in the form of a set of whitespace-separated values, for example the occupation of a person, or the status of a place.
  • date contains a date in any format.
    whensupplies the value of the date or time in a standard form, e.g. yyyy-mm-dd.
  • list contains any sequence of items organized as a list.
    typeList of documents in the corpus with corresponding references.
  • item contains one component of a list.
    n(number) gives a number (or other label) for an element, which is not necessarily unique within the document.
  • p (paragraph) marks paragraphs in prose.
  • ref (reference) defines a reference to another location, possibly modified by additional text or comment.
    target [att.pointing]specifies the destination of the reference by supplying one or more URI References
  • title contains a title for any kind of work.
    type(You must specify the exact reference to LAUDATIO metadata module schema 'Corpus'.) classifies the title according to some convenient typology.

No class or general module modification were made. The TEI scheme is only locally restricted. That means that no inheritance relations or module relations were changed, deleted or added. The customization is documented in the section 'Formal declaration'.

All encoded metadata refer to information addressing the whole corpus, e.g. the editors and annotators of the corpus, the preliminary annotations guidelines used for all formats of the corpus, the revision history and the available formats and its license.

2.2 Basic TEI header structure

LAUDATIO metadata scheme for the object 'Corpus' contains much more elements and modules comparing to standard TEI header which are usually used for the purpose of text encoding only. For validity, fileDesc, the profileDesc, encodingDesc and revisionDesc are required.

The fileDesc contains the following elements where the basic meaning are left untouched but are referred to the object 'Corpus', meaning the element author refers to the annotator of the corpus, the element title refers to the title of the entire corpus. Elements used within teiHeader are:

  • titleStmt (title statement) groups information about the title of a work and those responsible for its content.
  • extent describes the approximate size of a text stored on some carrier medium or of some other object, digital or non-digital, specified in any convenient units.
    type(You must indicate what type of extent you refere of) characterizes the element in some sense, using any convenient classification scheme or typology.
  • publicationStmt (publication statement) groups information concerning the publication or distribution of an electronic or other text.
  • sourceDesc (source description) describes the source from which an electronic text was derived or generated, typically a bibliographic description in the case of a digitized text, or a phrase such as "born digital" for a text which has no previous existence.
  • profileDesc (text-profile description) The langUsage defines which language the document has.
  • encodingDesc (encoding description) encoding descriptiondocuments the relationship between an electronic text and the source or sources from which it was derived. For LAUDATIO: The element encodingDescrefers to the different formats of the corpus. for each format a single encodingDesc contains information about what kind of annotation including keys and values and what kind of formats were used.
    n [att.global](number) gives a number (or other label) for an element, which is not necessarily unique within the document.
  • revisionDesc (revision description) summarizes the revision history for a file.
<TEI xmlns="http://www.tei-c.org/ns/1.0">
 <teiHeader type="CorpusHeader">
  <fileDesc>
   <titleStmt>
    <title>...</title>
    <editor>...</editor>
    <author>...</author>
   </titleStmt>
   <extent>...</extent>
   <publicationStmt>
    <authority>...</authority>
    <idno>...</idno>
    <availability>...</availability>
    <date>...</date>
   </publicationStmt>
   <sourceDesc>...</sourceDesc>
  </fileDesc>
  <profileDesc> ... </profileDesc>
  <encodingDesc> ... </encodingDesc>
  <revisionDesc> ... </revisionDesc>
 </teiHeader>
 <text/>
</TEI>
The titleStmt contains the title, editors, annotators and technical stuff of the corpus with the declaration <forname>, surname and affiliation.
  • title contains a title for any kind of work.
  • persName (personal name) contains a proper noun or proper-noun phrase referring to a person, possibly including one or more of the person's forenames, surnames, honorifics, added names, etc.
  • forename contains a forename, given or baptismal name.
  • surname contains a family (inherited) name, as opposed to a given, baptismal, or nick name.
  • affiliation contains an informal description of a person's present or past affiliation with some organization, for example an employer or sponsor.
<teiHeader type="CorpusHeader">
 <titleStmt>
  <title>...</title>
  <editor n="...role="...">
   <persName>
    <forename>...</forename>
    <surname>...</surname>
   </persName>
   <affiliation>
    <orgName type="Department">...</orgName>
    <orgName type="Institution">...</orgName>
   </affiliation>
  </editor>
  <author n="...role="...">
   <persName>
    <forename>...</forename>
    <surname>...</surname>
   </persName>
   <affiliation>
    <orgName type="Department">...</orgName>
    <orgName type="Institution">...</orgName>
   </affiliation>
  </author>
 </titleStmt>
</teiHeader>
The extent provides information about the size of the corpus. The attribute type defines the counting unit, 'tokens' or 'words'.
  • extent describes the approximate size of a text stored on some carrier medium or of some other object, digital or non-digital, specified in any convenient units.
    type(You must indicate what type of extent you refere of) characterizes the element in some sense, using any convenient classification scheme or typology.
<teiHeader type="CorpusHeader">
 <fileDesc>
  <titleStmt/>
  <extent type="...">...</extent>
 </fileDesc>
</teiHeader>
The publicationStmt covers the availability, responsibility and publication dates.
  • authority (release authority) supplies the name of a person or other agency responsible for making a work available, other than a publisher or distributor.
  • idno (identifier) supplies any form of identifier used to identify some object, such as a bibliographic item, a person, a title, an organization, etc. in a standardized way.
  • availability supplies information about the availability of a text, for example any restrictions on its use or distribution, its copyright status, any licence applying to it, etc.
    statussupplies a code identifying the current availability of the text.
  • date contains a date in any format.
    whensupplies the value of the date or time in a standard form, e.g. yyyy-mm-dd.
<teiHeader type="CorpusHeader">
 <fileDesc>
  <publicationStmt>
   <authority>...</authority>
   <idno>...</idno>
   <availability status="...">
    <p>....</p>
   </availability>
   <date type="CorpusReleasewhen="...">...</date>
  </publicationStmt>
 </fileDesc>
</teiHeader>
sourceDesc contains a list of documents of the corpus which are given with technical short terms.
  • list contains any sequence of items organized as a list.
    typeList of documents in the corpus with corresponding references.
  • item contains one component of a list.
    corresp
    n(number) gives a number (or other label) for an element, which is not necessarily unique within the document.
<teiHeader type="CorpusHeader">
 <fileDesc>
  <sourceDesc>
   <list type="CorpusDocument">
    <item corresp="...n="1"/>
    <item corresp="...n="2"/>
    <item corresp="...n="3"/>
   </list>
  </sourceDesc>
 </fileDesc>
</teiHeader>
The profileDesc only contains metadata concerning the language of the corpus data including remarks on language type and lanuage area.
  • langUsage (language usage) describes the languages, sublanguages, registers, dialects, etc. represented within a text.
  • language characterizes a single language or sublanguage used within a text.
    stylecontains an expression in some formal style definition language which defines the rendering or presentation used for this element in the source text
    ident(identifier) Supplies a language code constructed as defined in BCP 47 which is used to identify the language documented by this element, and which is referenced by the global xml:lang attribute.
<teiHeader type="CorpusHeader">
 <fileDesc/>
 <profileDesc>
  <langUsage>
   <language ident="...style="Language">...</language>
   <language ident="..."
    style="LanguageType">
...</language>
   <language ident="..."
    style="LanguageArea">
...</language>
  </langUsage>
 </profileDesc>
</teiHeader>
The encodingDesc contains descriptions of the corpus annotation guidelines depending on the format, segmentation and normalization. Each annotation key and value of each format is listed with a short description similar to annotation guidelines. Further more, the annotation are roughly divided according to their functions. Additionally, the research project and its references are provided. Elements used within encodingDescare:
  • appInfo (application information) records information about an application which has edited the TEI file.
  • application provides information about an application which has acted upon the document.
    identsupplies an identifier for the application, independent of its version number or display name.
    versionsupplies a version number for the application, independent of its identifier or display name.
  • label contains any label or heading used to identify part of a text, typically but not exclusively in a list or glossary.
  • projectDesc (project description) describes in detail the aim or purpose for which an electronic file was encoded, together with any other relevant information concerning the process by which it was assembled or collected.
  • ref (reference) defines a reference to another location, possibly modified by additional text or comment.
    target [att.pointing]specifies the destination of the reference by supplying one or more URI References
  • editorialDecl (editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text.
  • segmentation describes the principles according to which the text has been segmented, for example into sentences, tone-units, graphemic strata, etc.
  • normalization indicates the extent of normalization or regularization of the original source carried out in converting it to electronic form.
  • tagsDecl (tagging declaration) provides detailed information about the tagging applied to a document.
  • namespace supplies the formal name of the namespace to which the elements documented by its children belong.
    namespecifies the full formal name of the namespace concerned.
    rend(rendition) indicates how the element in question was rendered or presented in the source text.
  • tagUsage supplies information about the usage of a specific element within a text.
    gi(generic identifier) specifies the name (generic identifier) of the element indicated by the tag, within the namespace indicated by the parent namespace element.
<teiHeader type="CorpusHeader">
 <fileDesc/>
 <profileDesc/>
 <encodingDesc n="1">
  <appInfo>
   <application ident="...version="1">
    <label>...</label>
   </application>
  </appInfo>
  <projectDesc>
   <p>
    <ref target="..."/>...</p>
  </projectDesc>
  <editorialDecl>
   <segmentation>
    <p>...</p>
   </segmentation>
   <normalization>
    <p>...</p>
   </normalization>
  </editorialDecl>
  <tagsDecl>
   <namespace name="...rend="Lexical"
    xml:id="...">

    <tagUsage gi="...">...</tagUsage>
   </namespace>
  </tagsDecl>
 </encodingDesc>
</teiHeader>
revisionDesc refers to the version history of the whole corpus. Corpora in the LAUDATIO-Repository may have several versions. The changes between these versions will be provided here.
  • revisionDesc (revision description) summarizes the revision history for a file.
  • change documents a change or set of changes made during the production of a source document, or during the revision of an electronic file.
    n [att.global](number) gives a number (or other label) for an element, which is not necessarily unique within the document.
    when [att.datable.w3c]supplies the value of the date or time in a standard form, e.g. yyyy-mm-dd.
    who [att.ascribed]indicates the person, or group of people, to whom the element content is ascribed.
<teiHeader type="CorpusHeader">
 <fileDesc/>
 <profileDesc/>
 <encodingDesc/>
 <revisionDesc>
  <change n="...type="...when="..."
   who="...">
...</change>
  <change n="...type="...when="..."
   who="...">
...</change>
 </revisionDesc>
</teiHeader>

2.3 Detailed TEI header structure

A full TEI structure with all required elements is given below:
<teiHeader type="CorpusHeader">
 <fileDesc>
  <titleStmt>
   <title>...</title>
   <editor n="...role="...">
    <persName>
     <forename>...</forename>
     <surname>...</surname>
    </persName>
    <affiliation>
     <orgName type="Department">...</orgName>
     <orgName type="Institution">...</orgName>
    </affiliation>
   </editor>
   <author n="...role="...">
    <persName>
     <forename>...</forename>
     <surname>...</surname>
    </persName>
    <affiliation>
     <orgName type="Department">...</orgName>
     <orgName type="Institution">...</orgName>
    </affiliation>
   </author>
  </titleStmt>
  <extent type="...">...</extent>
  <publicationStmt>
   <authority>...</authority>
   <idno>...</idno>
   <availability status="...">
    <p>....</p>
   </availability>
   <date type="CorpusReleasewhen="...">...</date>
  </publicationStmt>
  <sourceDesc>
   <list type="CorpusDocument">
    <item corresp="...n="1"/>
    <item corresp="...n="2"/>
    <item corresp="...n="3"/>
   </list>
  </sourceDesc>
 </fileDesc>
 <profileDesc>
  <langUsage>
   <language ident="...style="Language">...</language>
   <language ident="..."
    style="LanguageType">
...</language>
   <language ident="..."
    style="LanguageArea">
...</language>
  </langUsage>
 </profileDesc>
 <encodingDesc n="1">
  <appInfo>
   <application ident="...version="1">
    <label>...</label>
   </application>
  </appInfo>
  <projectDesc>
   <p>
    <ref target="..."/>...</p>
  </projectDesc>
  <editorialDecl>
   <segmentation>
    <p>...</p>
   </segmentation>
   <normalization>
    <p>...</p>
   </normalization>
  </editorialDecl>
  <tagsDecl>
   <namespace name="...rend="Lexical"
    xml:id="...">

    <tagUsage gi="...">...</tagUsage>
   </namespace>
   <namespace name="...rend="Syntactical"
    xml:id="Syntax">

    <tagUsage gi="...">...</tagUsage>
    <tagUsage gi="...">...</tagUsage>
    <tagUsage gi="...">...</tagUsage>
   </namespace>
   <namespace name="morph"
    rend="Morphologicalxml:id="morph">

    <tagUsage gi="...">...</tagUsage>
    <tagUsage gi="...">...</tagUsage>
    <tagUsage gi="...">...</tagUsage>
   </namespace>
  </tagsDecl>
 </encodingDesc>
 <revisionDesc>
  <change n="...type="...when="..."
   who="...">
...</change>
  <change n="...type="...when="..."
   who="...">
...</change>
 </revisionDesc>
</teiHeader>

2.4 Formal declaration

Schema teiODD_LAUDATIOCorpus_S7: Elements

<affiliation>

<affiliation> contains an informal description of a person's present or past affiliation with some organization, for example an employer or sponsor. [15.2.2. The Participant Description]
Modulenamesdates
Member of
Contained by
May contain
Declaration
element affiliation { macro.phraseSeq }
Example
<affiliation>Junior project officer for the US <name type="org">National Endowment for
   the Humanities</name>
</affiliation>
<affiliation notAfter="1960-01-01"
 notBefore="1957-02-28">
Paid up member of the
<orgName>Australian Journalists Association</orgName>
</affiliation>
Note

If included, the name of an organization may be tagged using either the <name> element as above, or the more specific orgName element.

<appInfo>

<appInfo> (application information) records information about an application which has edited the TEI file. [2.3.10. The Application Information Element]
Moduleheader
AttributesAttributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp))
Member of
Contained by
header: encodingDesc
May contain
header: application
Declaration
element appInfo { att.global.attributes, model.applicationLike+ }
Example
<appInfo>
 <application ident="Xairaversion="1.24">
  <label>XAIRA Indexer</label>
  <ptr target="#P1"/>
 </application>
</appInfo>

<application>

<application> provides information about an application which has acted upon the document. [2.3.10. The Application Information Element]
Moduleheader
AttributesAttributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.typed (@type, @subtype) att.datable (@calendar, @period) (att.datable.w3c (@when, @notBefore, @notAfter, @from, @to)) (att.datable.iso (@when-iso, @notBefore-iso, @notAfter-iso, @from-iso, @to-iso)) (att.datable.custom (@when-custom, @notBefore-custom, @notAfter-custom, @from-custom, @to-custom, @datingPoint, @datingMethod))
identsupplies an identifier for the application, independent of its version number or display name.
Status Required
Datatype data.name
versionsupplies a version number for the application, independent of its identifier or display name.
Status Required
Datatype data.versionNumber
Member of
Contained by
header: appInfo
May contain
core: label p ref
Declaration
element application
{
   att.global.attributes,
   att.typed.attributes,
   att.datable.attributes,
   attribute ident { data.name },
   attribute version { data.versionNumber },
   ( model.labelLike+, ( model.ptrLike* | model.pLike* ) )
}
Example
<appInfo>
 <application ident="ImageMarkupTool1"
  notAfter="2006-06-01version="1.5">

  <label>Image Markup Tool</label>
  <ptr target="#P1"/>
  <ptr target="#P2"/>
 </application>
</appInfo>

This example shows an appInfo element documenting the fact that version 1.5 of the Image Markup Tool1 application has an interest in two parts of a document which was last saved on June 6 2006. The parts concerned are accessible at the URLs given as target for the two <ptr> elements.

<author>

<author> in a bibliographic reference, contains the name(s) of an author, personal or corporate, of a work; for example in the same form as that provided by a recognized bibliographic name authority. [3.11.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement]
Modulecore
AttributesAttributesatt.global.responsibility (@cert, @resp)
n(number) gives a number (or other label) for an element, which is not necessarily unique within the document. Sample values include:
Derived fromatt.global
Status Recommended
Datatype data.text
Sample values include:
rolemay be used to specify further information about the entity referenced by this name in the form of a set of whitespace-separated values, for example the occupation of a person, or the status of a place.
Derived fromatt.naming
Status Recommended
Datatype 1–∞ occurrences of  data.enumeratedseparated by whitespace
Legal values are:
Annotator
Infrastructure
Transcription
Member of
Contained by
header: titleStmt
May contain
Declaration
element author
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute n { data.text }?,
   attribute role
   {
      list { ( "Annotator" | "Infrastructure" | "Transcription" )+ }
   }?,
   macro.phraseSeq
}
Example
<author>British Broadcasting Corporation</author>
<author>La Fayette, Marie Madeleine Pioche de la Vergne, comtesse de (1634–1693)</author>
<author>Anonymous</author>
<author>Bill and Melinda Gates Foundation</author>
<author>
 <persName>Beaumont, Francis</persName> and
<persName>John Fletcher</persName>
</author>
<author>
 <orgName key="BBC">British Broadcasting
   Corporation</orgName>: Radio 3 Network
</author>
Note

Particularly where cataloguing is likely to be based on the content of the header, it is advisable to use a generally recognized name authority file to supply the content for this element. The attributes key or ref may also be used to reference canonical information about the author(s) intended from any appropriate authority, such as a library catalogue or online resource.

In the case of a broadcast, use this element for the name of the company or network responsible for making the broadcast.

Where an author is unknown or unspecified, this element may contain text such as Unknown or Anonymous. When the appropriate TEI modules are in use, it may also contain detailed tagging of the names used for people, organizations or places, in particular where multiple names are given.

<authority>

<authority> (release authority) supplies the name of a person or other agency responsible for making a work available, other than a publisher or distributor. [2.2.4. Publication, Distribution, Licensing, etc.]
Moduleheader
AttributesAttributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp))
Member of
Contained by
May contain
Declaration
element authority { att.global.attributes, macro.phraseSeq.limited }
Example
<authority>John Smith</authority>

<availability>

<availability> supplies information about the availability of a text, for example any restrictions on its use or distribution, its copyright status, any licence applying to it, etc. [2.2.4. Publication, Distribution, Licensing, etc.]
Moduleheader
AttributesAttributesatt.global.responsibility (@cert, @resp)
statussupplies a code identifying the current availability of the text.
Status Required
Datatype data.enumerated
Legal values are:
free
the text is freely available.
unknown
the status of the text is unknown. [Default]
restricted
the text is not freely available.
Member of
Contained by
May contain
core: p
Declaration
element availability
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute status { "free" | "unknown" | "restricted" },
   ( model.availabilityPart | model.pLike )+
}
Example
<availability status="restricted">
 <p>Available for academic research purposes only.</p>
</availability>
<availability status="free">
 <p>In the public domain</p>
</availability>
<availability status="restricted">
 <p>Available under licence from the publishers.</p>
</availability>
Example
<availability>
 <licence target="http://opensource.org/licenses/MIT">
  <p>The MIT License
     applies to this document.</p>
  <p>Copyright (C) 2011 by The University of Victoria</p>
  <p>Permission is hereby granted, free of charge, to any person obtaining a copy
     of this software and associated documentation files (the "Software"), to deal
     in the Software without restriction, including without limitation the rights
     to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
     copies of the Software, and to permit persons to whom the Software is
     furnished to do so, subject to the following conditions:</p>
  <p>The above copyright notice and this permission notice shall be included in
     all copies or substantial portions of the Software.</p>
  <p>THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
     IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
     FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
     AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
     LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
     OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
     THE SOFTWARE.</p>
 </licence>
</availability>
Note

A consistent format should be adopted

<change>

<change> documents a change or set of changes made during the production of a source document, or during the revision of an electronic file. [2.5. The Revision Description 2.4.1. Creation 11.7. Changes]
Moduleheader
AttributesAttributes att.ascribed (@who) att.datable (@calendar, @period) (att.datable.w3c (@when, @notBefore, @notAfter, @from, @to)) (att.datable.iso (@when-iso, @notBefore-iso, @notAfter-iso, @from-iso, @to-iso)) (att.datable.custom (@when-custom, @notBefore-custom, @notAfter-custom, @from-custom, @to-custom, @datingPoint, @datingMethod)) att.docStatus (@status) att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.typed (@type, @subtype)
targetpoints to one or more elements that belong to this change.
Status Optional
Datatype 1–∞ occurrences of  data.pointerseparated by whitespace
Contained by
header: revisionDesc
May contain
Declaration
element change
{
   att.ascribed.attributes,
   att.datable.attributes,
   att.docStatus.attributes,
   att.global.attributes,
   att.typed.attributes,
   attribute target { list { data.pointer+ } }?,
   macro.specialPara
}
Example
<titleStmt>
 <title> ... </title>
 <editor xml:id="LDB">Lou Burnard</editor>
 <respStmt xml:id="BZ">
  <resp>copy editing</resp>
  <name>Brett Zamir</name>
 </respStmt>
</titleStmt>
<!-- ... -->
<revisionDesc status="published">
 <change status="publicwhen="2008-02-02"
  who="#BZ">
Finished chapter 23</change>
 <change status="draftwhen="2008-01-02"
  who="#BZ">
Finished chapter 2</change>
 <change n="P2.2when="1991-12-21"
  who="#LDB">
Added examples to section 3</change>
 <change when="1991-11-11who="#MSM">Deleted chapter 10</change>
</revisionDesc>
Example
<profileDesc>
 <creation>
  <listChange>
   <change xml:id="DRAFT1">First draft in pencil</change>
   <change notBefore="1880-12-09"
    xml:id="DRAFT2">
First revision, mostly
       using green ink</change>
   <change notBefore="1881-02-13"
    xml:id="DRAFT3">
Final corrections as
       supplied to printer.</change>
  </listChange>
 </creation>
</profileDesc>
Note

The who attribute may be used to point to any other element, but will typically specify a <respStmt> or <person> element elsewhere in the header, identifying the person responsible for the change and their role in making it.

It is recommended that changes be recorded with the most recent first. The status attribute may be used to indicate the status of a document following the change documented.

<date>

<date> contains a date in any format. [3.5.4. Dates and Times 2.2.4. Publication, Distribution, Licensing, etc. 2.5. The Revision Description 3.11.2.4. Imprint, Size of a Document, and Reprint Information 15.2.3. The Setting Description 13.3.6. Dates and Times]
Modulecore
AttributesAttributesatt.typed (subtype, @type)
whensupplies the value of the date or time in a standard form, e.g. yyyy-mm-dd.
Derived fromatt.datable.w3c
Status Required
Datatype data.temporal.iso
Member of
Contained by
May contain
Declaration
element date
{
   att.typed.attribute.type,
   attribute when { data.temporal.iso },
   ( text | model.gLike | model.phrase | model.global )*
}
Example
<date when="1980-02">early February 1980</date>
Example
Given on the <date when="1977-06-12">Twelfth Day
of June in the Year of Our Lord One Thousand Nine Hundred and Seventy-seven of the Republic
the Two Hundredth and first and of the University the Eighty-Sixth.</date>
Example
<date when="1990-09">September 1990</date>

<editor>

<editor> contains a secondary statement of responsibility for a bibliographic item, for example the name of an individual, institution or organization, (or of several such) acting as editor, compiler, translator, etc. [3.11.2.2. Titles, Authors, and Editors]
Modulecore
AttributesAttributesatt.global.responsibility (@cert, @resp)
n(number) gives a number (or other label) for an element, which is not necessarily unique within the document. Sample values include:
Derived fromatt.global
Status Recommended
Datatype data.text
Sample values include:
rolemay be used to specify further information about the entity referenced by this name in the form of a set of whitespace-separated values, for example the occupation of a person, or the status of a place.
Derived fromatt.naming
Status Recommended
Datatype 1–∞ occurrences of  data.enumeratedseparated by whitespace
Legal values are:
CorpusEditor
Member of
Contained by
header: titleStmt
May contain
Declaration
element editor
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute n { data.text }?,
   attribute role { list { "CorpusEditor"+ } }?,
   macro.phraseSeq
}
Example
<editor>Eric Johnson</editor>
<editor role="illustrator">John Tenniel</editor>
Note

A consistent format should be adopted.

Particularly where cataloguing is likely to be based on the content of the header, it is advisable to use generally recognized authority lists for the exact form of personal names.

<editorialDecl>

<editorialDecl> (editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text. [2.3.3. The Editorial Practices Declaration 2.3. The Encoding Description 15.3.2. Declarable Elements]
Moduleheader
AttributesAttributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.global.responsibility (@cert, @resp)
Member of
Contained by
header: encodingDesc
May contain
Declaration
element editorialDecl
{
   att.global.attribute.n,
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   ( model.pLike | model.editorialDeclPart )+
}
Example
<editorialDecl>
 <normalization>
  <p>All words converted to Modern American spelling using
     Websters 9th Collegiate dictionary
  </p>
 </normalization>
 <quotation marks="all">
  <p>All opening quotation marks converted to “ all closing
     quotation marks converted to &amp;cdq;.</p>
 </quotation>
</editorialDecl>

<encodingDesc>

<encodingDesc> (encoding description) encoding descriptiondocuments the relationship between an electronic text and the source or sources from which it was derived. For LAUDATIO: The element encodingDescrefers to the different formats of the corpus. for each format a single encodingDesc contains information about what kind of annotation including keys and values and what kind of formats were used. [2.3. The Encoding Description 2.1.1. The TEI Header and Its Components]
Moduleheader
AttributesAttributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp))
Member of
Contained by
header: teiHeader
May contain
Declaration
element encodingDesc
{
   att.global.attributes,
   ( ( model.encodingDescPart | model.pLike )+ )
}
Example

A corpus is available in EXMARaLDA format. The annotations were applied with the help of the EXMARaLDA Partitur Editor in a certain version, see appInfo. Information about the minimal segmentation in the format and the normalizations of the primary data is given in segmentation and normalization. The tagsDecl lists the annotation keys and values which are available in EXMARalDA. Each namesspace has @rend with a fixed a of values which classify the annotation according to their basic functions such as 'Trancription', 'Syntactical' and 'Morphological'. This enables searching certain annotation in more than one corpus in the LAUDATIO-Repository.

<encodingDesc n="1">
 <appInfo>
  <application ident="EXMARaLDA"
   version="1.5.1">

   <label>EXMARaLDA Partitur Editor</label>
  </application>
 </appInfo>
 <projectDesc>
  <p>
   <ref target="http://korpling.german.hu-berlin.de/ridges/"/>The RIDGES
     project (Register in Diachronic German Science) is an investigation into the
     development of the German scientific language in the early modern and modern
     periods, ranging from the mid 16th to the late 19th century. </p>
 </projectDesc>
 <editorialDecl>
  <segmentation>
   <p>Multiple segmentations on the basis of "dipl", "clean" and "norm".</p>
  </segmentation>
  <normalization>
   <p>Two normalization are provided by the layer "clean" containing
       normalizations of special characters and by the layer "norm" containing
       normalizations of orthography.</p>
  </normalization>
 </editorialDecl>
 <tagsDecl>
  <namespace name="dipl"
   rend="Transcriptionxml:id="dipl">

   <tagUsage gi="NA">The diplomatic transcription of the word form as found on
       the manuscript. Line-breaks are marked as in the text, usually as
       'U+2E17'.</tagUsage>
  </namespace>
  <namespace name="posrend="Lexical"
   xml:id="pos">

   <tagUsage gi="ADJA">Attributives Adjektiv.</tagUsage>
   <tagUsage gi="ADJD">Adverbiales oder prädikatives Adjektiv.</tagUsage>
   <tagUsage gi="ADV">Adverb.</tagUsage>
  </namespace>
 </tagsDecl>
</encodingDesc>

<extent>

<extent> describes the approximate size of a text stored on some carrier medium or of some other object, digital or non-digital, specified in any convenient units. [2.2.3. Type and Extent of File 2.2. The File Description 3.11.2.4. Imprint, Size of a Document, and Reprint Information 10.7.1. Object Description]
Moduleheader
AttributesAttributesatt.global.responsibility (@cert, @resp) att.typed (type, @subtype)
type(You must indicate what type of extent you refere of) characterizes the element in some sense, using any convenient classification scheme or typology.
Derived fromatt.typed
Status Required
Datatype data.enumerated
Legal values are:
Tokens
(The size of the corpus is given in token.)
Words
(The size of the corpus is given in words.)
Contained by
header: fileDesc
May contain
Declaration
element extent
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   att.typed.attribute.subtype,
   attribute type { "Tokens" | "Words" },
   macro.phraseSeq
}
Example
<extent>3200 sentences</extent>
<extent>between 10 and 20 Mb</extent>
<extent>ten 3.5 inch high density diskettes</extent>
Example

The <measure> element may be used to supplied normalised or machine tractable versions of the size or sizes concerned.

<extent>
 <measure quantity="4.2unit="MiB">About four megabytes</measure>
 <measure quantity="245unit="pages">245 pages of source
   material</measure>
</extent>

<fileDesc>

<fileDesc> (file description) [2.2. The File Description 2.1.1. The TEI Header and Its Components]
Moduleheader
AttributesAttributesatt.global.responsibility (@cert, @resp)
Contained by
header: teiHeader
May contain
Declaration
element fileDesc
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   (
      (
         titleStmt,
         editionStmt?,
         extent?,
         publicationStmt,
         seriesStmt?,
         notesStmt?
      ),
      sourceDesc+
   )
}
Example

The <fileDsec> refers to the whole corpus. Each corpus has one or more editors and annotators and may have some technical supporters, see editor and author. All project and institution references can be made in affiliation for person or in publicationStmt in general. All documents of a corpus are listed in sourceDesc with a technical short name.

<fileDesc>
 <titleStmt>
  <title type="Corpus">RIDGES Herbology Version 2.0</title>
  <editor n="1role="CorpusEditor">
   <persName>
    <forename>Anke</forename>
    <surname>Lüdeling</surname>
   </persName>
   <affiliation>
    <orgName type="Department">Institut für deutsche Sprache und
         Linguistik</orgName>
    <orgName type="Institution">Humboldt-Universität zu Berlin</orgName>
   </affiliation>
  </editor>
  <author n="1role="Infrastructure">
   <persName>
    <forename>Carolin</forename>
    <surname>Odebrecht</surname>
   </persName>
   <affiliation>
    <orgName type="Department">Institut für deutsche Sprache und
         Linguistik</orgName>
    <orgName type="Institution">Humboldt-Universität zu Berlin</orgName>
   </affiliation>
  </author>
 </titleStmt>
 <extent type="Tokens">60720</extent>
 <publicationStmt>
  <authority>Humboldt-Universität zu Berlin</authority>
  <idno>RIDGES Herbology Project.</idno>
  <availability status="free">
   <p>Open Source Project. Open Source Project. All corpus data generated by
       the RIDGES project is licensed under a Creative Commons Attribution 3.0
       Unported License.</p>
  </availability>
  <date type="CorpusReleasewhen="2011">First corpus release.</date>
  <date type="CorpusRelease"
   when="2012-05-31">
Second corpus release.</date>
 </publicationStmt>
 <sourceDesc>
  <list type="CorpusDocument">
   <item corresp="deutsche.pflanzennamen.1870"
    n="1"/>

   <item corresp="noch.einige.worte.1840"
    n="2"/>

   <item corresp="flora.7.1821n="3"/>
  </list>
 </sourceDesc>
</fileDesc>
Note

The major source of information for those seeking to create a catalogue entry or bibliographic citation for an electronic file. As such, it provides a title and statements of responsibility together with details of the publication or distribution of the file, of any series to which it belongs, and detailed bibliographic notes for matters not addressed elsewhere in the header. It also contains a full bibliographic description for the source or sources from which the electronic text was derived.

<forename>

<forename> contains a forename, given or baptismal name. [13.2.1. Personal Names]
Modulenamesdates
AttributesAttributesatt.global.responsibility (@cert, @resp)
Member of
Contained by
May contain
Declaration
element forename
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   macro.phraseSeq
}
Example
<persName>
 <roleName>Ex-President</roleName>
 <forename>George</forename>
 <surname>Bush</surname>
</persName>

<idno>

<idno> (identifier) supplies any form of identifier used to identify some object, such as a bibliographic item, a person, a title, an organization, etc. in a standardized way. [2.2.4. Publication, Distribution, Licensing, etc. 2.2.5. The Series Statement 3.11.2.4. Imprint, Size of a Document, and Reprint Information]
Moduleheader
AttributesAttributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.sortable (@sortKey)
typecategorizes the identifier, for example as an ISBN, Social Security number, etc.
Status Optional
Datatype data.enumerated
Member of
Contained by
May contain
header: idno
Declaration
element idno
{
   att.global.attributes,
   att.sortable.attributes,
   attribute type { data.enumerated }?,
   ( text | model.gLike | idno )*
}
Example
<idno type="ISBN">978-1-906964-22-1</idno>
<idno type="ISSN">0143-3385</idno>
<idno type="DOI">10.1000/123</idno>
<idno type="URI">http://www.worldcat.org/oclc/185922478</idno>
<idno type="URI">http://authority.nzetc.org/463/</idno>
<idno type="LT">Thomason Tract E.537(17)</idno>
<idno type="Wing">C695</idno>
<idno type="oldCat">
 <g ref="#sym"/>345
</idno>

In the last case, the identifier includes a non-Unicode character which is defined elsewhere by means of a <glyph> or <char> element referenced here as #sym.

Note

idno should be used for labels which identify an object or concept in a formal cataloguing system such as a database or an RDF store, or in a distributed system such as the World Wide Web. Some suggested values for type on idno are ISBN, ISSN, DOI, and URI.

<item>

<item> contains one component of a list. [3.7. Lists 2.5. The Revision Description]
Modulecore
AttributesAttributesatt.global.responsibility (@cert, @resp)
n(number) gives a number (or other label) for an element, which is not necessarily unique within the document.
Derived fromatt.global
Status Required
Datatype data.text
corresp
Status Required
Contained by
core: list
May contain
Declaration
element item
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute n { data.text },
   attribute corresp { text },
   macro.specialPara
}
Example
<list rend="numbered">
 <head>Here begin the chapter headings of Book IV</head>
 <item n="4.1">The death of Queen Clotild.</item>
 <item n="4.2">How King Lothar wanted to appropriate one third of the Church revenues.</item>
 <item n="4.3">The wives and children of Lothar.</item>
 <item n="4.4">The Counts of the Bretons.</item>
 <item n="4.5">Saint Gall the Bishop.</item>
 <item n="4.6">The priest Cato.</item>
 <item> ...</item>
</list>
Note

May contain simple prose or a sequence of chunks.

Whatever string of characters is used to label a list item in the copy text may be used as the value of the global n attribute, but it is not required that numbering be recorded explicitly. In ordered lists, the n attribute on the item element is by definition synonymous with the use of the label element to record the enumerator of the list item. In glossary lists, however, the term being defined should be given with the label element, not n.

<label>

<label> contains any label or heading used to identify part of a text, typically but not exclusively in a list or glossary. [3.7. Lists]
Modulecore
AttributesAttributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.typed (@type, @subtype) att.placement (@place)
Member of
Contained by
May contain
Declaration
element label
{
   att.global.attributes,
   att.typed.attributes,
   att.placement.attributes,
   macro.phraseSeq
}
Example

Labels are commonly used for the headwords in glossary lists; note the use of the global xml:lang attribute to set the default language of the glossary list to Middle English, and identify the glosses and headings as modern English or Latin:

<list type="glossxml:lang="enm">
 <head xml:lang="en">Vocabulary</head>
 <headLabel xml:lang="en">Middle English</headLabel>
 <headItem xml:lang="en">New English</headItem>
 <label>nu</label>
 <item xml:lang="en">now</item>
 <label>lhude</label>
 <item xml:lang="en">loudly</item>
 <label>bloweth</label>
 <item xml:lang="en">blooms</item>
 <label>med</label>
 <item xml:lang="en">meadow</item>
 <label>wude</label>
 <item xml:lang="en">wood</item>
 <label>awe</label>
 <item xml:lang="en">ewe</item>
 <label>lhouth</label>
 <item xml:lang="en">lows</item>
 <label>sterteth</label>
 <item xml:lang="en">bounds, frisks (cf. <cit>
   <ref>Chaucer, K.T.644</ref>
   <quote>a courser, <term>sterting</term>as the fyr</quote>
  </cit>
 </item>
 <label>verteth</label>
 <item xml:lang="la">pedit</item>
 <label>murie</label>
 <item xml:lang="en">merrily</item>
 <label>swik</label>
 <item xml:lang="en">cease</item>
 <label>naver</label>
 <item xml:lang="en">never</item>
</list>
Example

Labels may also be used to record explicitly the numbers or letters which mark list items in ordered lists, as in this extract from Gibbon's Autobiography. In this usage the label element is synonymous with the n attribute on the item element:

I will add two facts, which have seldom occurred
in the composition of six, or at least of five quartos. <list rend="runontype="ordered">
 <label>(1)</label>
 <item>My first rough manuscript, without any intermediate copy, has been sent to the press.</item>
 <label>(2) </label>
 <item>Not a sheet has been seen by any human eyes, excepting those of the author and the
   printer: the faults and the merits are exclusively my own.</item>
</list>
Example

Labels may also be used for other structured list items, as in this extract from the journal of Edward Gibbon:

<list type="gloss">
 <label>March 1757.</label>
 <item>I wrote some critical observations upon Plautus.</item>
 <label>March 8th.</label>
 <item>I wrote a long dissertation upon some lines of Virgil.</item>
 <label>June.</label>
 <item>I saw Mademoiselle Curchod — <quote xml:lang="la">Omnia vincit amor, et nos cedamus
     amori.</quote>
 </item>
 <label>August.</label>
 <item>I went to Crassy, and staid two days.</item>
</list>

Note that the label might also appear within the item rather than as its sibling. Though syntactically valid, this usage is not recommended TEI practice.

Example

Labels may also be used to represent a label or heading attached to a paragraph or sequence of paragraphs not treated as a structural division, or to a group of verse lines. Note that, in this case, the label element appears within the p or <lg> element, rather than as a preceding sibling of it.

<p>[...]
<lb/>&amp; n’entrer en mauuais &amp; mal-heu-
<lb/>ré meſnage. Or des que le conſente-
<lb/>ment des parties y eſt le mariage eſt
<lb/> arreſté, quoy que de faict il ne ſoit
<label place="margin">Puiſſance maritale
   entre les Romains.</label>
 <lb/> conſommé. Depuis la conſomma-
<lb/>tion du mariage la femme eſt ſoubs
<lb/> la puiſſance du mary, s’il n’eſt eſcla-
<lb/>ue ou enfant de famille : car en ce
<lb/> cas, la femme, qui a eſpouſé vn en-
<lb/>fant de famille, eſt ſous la puiſſance
[...]</p>

In this example the text of the label appears in the right hand margin of the original source, next to the paragraph it describes, but approximately in the middle of it.

If so desired the type attribute may be used to distinguish different categories of label.

<language>

<language> characterizes a single language or sublanguage used within a text. [2.4.2. Language Usage]
Moduleheader
AttributesAttributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.global.responsibility (@cert, @resp)
stylecontains an expression in some formal style definition language which defines the rendering or presentation used for this element in the source text
Derived fromatt.global.rendition
Status Required
Datatype data.text
Legal values are:
Language
LanguageType
LanguageArea
ident(identifier) Supplies a language code constructed as defined in BCP 47 which is used to identify the language documented by this element, and which is referenced by the global xml:lang attribute.
Status Required
Datatype data.language
Contained by
header: langUsage
May contain
Declaration
element language
{
   att.global.attribute.n,
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute style { "Language" | "LanguageType" | "LanguageArea" },
   attribute ident { data.language },
   macro.phraseSeq.limited
}
Example
<langUsage>
 <language ident="en-USusage="75">modern American English</language>
 <language ident="i-az-Arabusage="20">Azerbaijani in Arabic script</language>
 <language ident="x-lapusage="05">Pig Latin</language>
</langUsage>
Note

Particularly for sublanguages, an informal prose characterization should be supplied as content for the element.

<langUsage>

<langUsage> (language usage) describes the languages, sublanguages, registers, dialects, etc. represented within a text. [2.4.2. Language Usage 2.4. The Profile Description 15.3.2. Declarable Elements]
Moduleheader
AttributesAttributesatt.global.responsibility (@cert, @resp)
Member of
Contained by
header: profileDesc
May contain
header: language
Declaration
element langUsage
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   language+
}
Example
<langUsage>
 <language ident="fr-CAusage="60">Québecois</language>
 <language ident="en-CAusage="20">Canadian business English</language>
 <language ident="en-GBusage="20">British English</language>
</langUsage>

<list>

<list> contains any sequence of items organized as a list. [3.7. Lists]
Modulecore
AttributesAttributesatt.global.responsibility (@cert, @resp) att.typed (type, @subtype)
typeList of documents in the corpus with corresponding references.
Derived fromatt.typed
Status Required
Datatype data.enumerated
Legal values are:
CorpusDocument
Note

Previous versions of these Guidelines recommended the use of type on list to encode the rendering or appearance of a list (whether it was bulleted, numbered, etc.). The current recommendation is to use the rend or style attributes for these aspects of a list, while using type for the more appropriate task of characterizing the nature of the content of a list.

Note

The formal syntax of the element declarations allows label tags to be omitted from lists tagged <list type="gloss">; this is however a semantic error.

Member of
Contained by
May contain
core: item label
Declaration
element list
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   att.typed.attribute.subtype,
   attribute type { "CorpusDocument" },
   (
      ( ( model.divTop ) | ( model.global ) )*,
      (
         ( item, model.global* )+
       | (
            headLabel?,
            headItem?,
            ( label, model.global*, item, model.global* )+
         )
      ),
      ( ( model.divBottom ), model.global* )*
   )
}
Schematron

<sch:rule context="tei:list[@type='gloss']">
<sch:assert test="tei:label">The content of a "gloss" list should include a sequence of one or more pairs of a label element followed by an item element</sch:assert></sch:rule>
Example
<list rend="numbered">
 <item>a butcher</item>
 <item>a baker</item>
 <item>a candlestick maker, with <list rend="bulleted">
   <item>rings on his fingers</item>
   <item>bells on his toes</item>
  </list>
 </item>
</list>
Example
<list rend="bulletedtype="syllogism">
 <item>All Cretans are liars.</item>
 <item>Epimenides is a Cretan.</item>
 <item>ERGO Epimenides is a liar.</item>
</list>
Example
<list rend="simpletype="litany">
 <item>God save us from drought.</item>
 <item>God save us from pestilence.</item>
 <item>God save us from wickedness in high places.</item>
 <item>Praise be to God.</item>
</list>
Example

The following example treats the short numbered clauses of Anglo-Saxon legal codes as lists of items. The text is from an ordinance of King Athelstan (924–939):

<div1 type="section">
 <head>Athelstan's Ordinance</head>
 <list rend="numbered">
  <item n="1">Concerning thieves. First, that no thief is to be spared who is caught with
     the stolen goods, [if he is] over twelve years and [if the value of the goods is] over
     eightpence. <list rend="numbered">
    <item n="1.1">And if anyone does spare one, he is to pay for the thief with his
         wergild — and the thief is to be no nearer a settlement on that account — or to
         clear himself by an oath of that amount.</item>
    <item n="1.2">If, however, he [the thief] wishes to defend himself or to escape, he is
         not to be spared [whether younger or older than twelve].</item>
    <item n="1.3">If a thief is put into prison, he is to be in prison 40 days, and he may
         then be redeemed with 120 shillings; and the kindred are to stand surety for him
         that he will desist for ever.</item>
    <item n="1.4">And if he steals after that, they are to pay for him with his wergild,
         or to bring him back there.</item>
    <item n="1.5">And if he steals after that, they are to pay for him with his wergild,
         whether to the king or to him to whom it rightly belongs; and everyone of those who
         supported him is to pay 120 shillings to the king as a fine.</item>
   </list>
  </item>
  <item n="2">Concerning lordless men. And we pronounced about these lordless men, from whom
     no justice can be obtained, that one should order their kindred to fetch back such a
     person to justice and to find him a lord in public meeting. <list rend="numbered">
    <item n="2.1">And if they then will not, or cannot, produce him on that appointed day,
         he is then to be a fugitive afterwards, and he who encounters him is to strike him
         down as a thief.</item>
    <item n="2.2">And he who harbours him after that, is to pay for him with his wergild
         or to clear himself by an oath of that amount.</item>
   </list>
  </item>
  <item n="3">Concerning the refusal of justice. The lord who refuses justice and upholds
     his guilty man, so that the king is appealed to, is to repay the value of the goods and
     120 shillings to the king; and he who appeals to the king before he demands justice as
     often as he ought, is to pay the same fine as the other would have done, if he had
     refused him justice. <list rend="numbered">
    <item n="3.1">And the lord who is an accessory to a theft by his slave, and it becomes
         known about him, is to forfeit the slave and be liable to his wergild on the first
         occasionp if he does it more often, he is to be liable to pay all that he owns.</item>
    <item n="3.2">And likewise any of the king's treasurers or of our reeves, who has been
         an accessory of thieves who have committed theft, is to liable to the same.</item>
   </list>
  </item>
  <item n="4">Concerning treachery to a lord. And we have pronounced concerning treachery to
     a lord, that he [who is accused] is to forfeit his life if he cannot deny it or is
     afterwards convicted at the three-fold ordeal.</item>
 </list>
</div1>

Note that nested lists have been used so the tagging mirrors the structure indicated by the two-level numbering of the clauses. The clauses could have been treated as a one-level list with irregular numbering, if desired.

Example
<p>These decrees, most blessed Pope Hadrian, we propounded in the public council ... and they
confirmed them in our hand in your stead with the sign of the Holy Cross, and afterwards
inscribed with a careful pen on the paper of this page, affixing thus the sign of the Holy
Cross. <list rend="simple">
  <item>I, Eanbald, by the grace of God archbishop of the holy church of York, have
     subscribed to the pious and catholic validity of this document with the sign of the Holy
     Cross.</item>
  <item>I, Ælfwold, king of the people across the Humber, consenting have subscribed with
     the sign of the Holy Cross.</item>
  <item>I, Tilberht, prelate of the church of Hexham, rejoicing have subscribed with the
     sign of the Holy Cross.</item>
  <item>I, Higbald, bishop of the church of Lindisfarne, obeying have subscribed with the
     sign of the Holy Cross.</item>
  <item>I, Ethelbert, bishop of Candida Casa, suppliant, have subscribed with thef sign of
     the Holy Cross.</item>
  <item>I, Ealdwulf, bishop of the church of Mayo, have subscribed with devout will.</item>
  <item>I, Æthelwine, bishop, have subscribed through delegates.</item>
  <item>I, Sicga, patrician, have subscribed with serene mind with the sign of the Holy
     Cross.</item>
 </list>
</p>
Note

May contain an optional heading followed by a series of items, or a series of label and item pairs, the latter being optionally preceded by one or two specialized headings.

<namespace>

<namespace> supplies the formal name of the namespace to which the elements documented by its children belong. [2.3.4. The Tagging Declaration]
Moduleheader
AttributesAttributesatt.global (n, xml:lang, xml:base, xml:space, @xml:id) att.global.responsibility (@cert, @resp)
rend(rendition) indicates how the element in question was rendered or presented in the source text.
Derived fromatt.global.rendition
Status Required
Datatype 1–∞ occurrences of  data.wordseparated by whitespace
Legal values are:
Lexical
Syntactical
Morphological
MarkUp
Graphical
Transcription
Other
Meta
corresp
Status Recommended when applicable
namespecifies the full formal name of the namespace concerned.
Status Required
Datatype data.text
Contained by
header: tagsDecl
May contain
header: tagUsage
Declaration
element namespace
{
   att.global.attribute.xmlid,
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute rend
   {
      list
      {
         (
            "Lexical"
          | "Syntactical"
          | "Morphological"
          | "MarkUp"
          | "Graphical"
          | "Transcription"
          | "Other"
          | "Meta"
         )+
      }
   },
   attribute corresp { text }?,
   attribute name { data.text },
   tagUsage+
}
Example
<namespace name="http://www.tei-c.org/ns/1.0">
 <tagUsage gi="hioccurs="28render="#it"
  withId="2">
Used only to mark English words
   italicized in the copy text </tagUsage>
</namespace>

<normalization>

<normalization> indicates the extent of normalization or regularization of the original source carried out in converting it to electronic form. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements]
Moduleheader
AttributesAttributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.global.responsibility (@cert, @resp)
Member of
Contained by
May contain
core: p
Declaration
element normalization
{
   att.global.attribute.n,
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   model.pLike+
}
Example
<editorialDecl>
 <normalization method="markup">
  <p>Where both upper- and lower-case i, j, u, v, and vv have been normalized, to modern
     20th century typographical practice, the <gi>choice</gi> element has been used to
     enclose <gi>orig</gi> and <gi>reg</gi> elements giving the original and new values
     respectively. ... </p>
 </normalization>
 <normalization method="silent">
  <p>Spacing between words and following punctuation has been regularized to zero spaces;
     spacing between words has been regularized to one space.</p>
 </normalization>
 <normalization source="http://www.dict.sztaki.hu/webster">
  <p>Spelling converted throughout to Modern American usage, based on Websters 9th
     Collegiate dictionary.</p>
 </normalization>
</editorialDecl>

<orgName>

<orgName> (organization name) contains an organizational name. [13.2.2. Organizational Names]
Modulenamesdates
AttributesAttributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.typed (subtype, @type)
Member of
Contained by
May contain
Declaration
element orgName
{
   att.global.attribute.n,
   att.typed.attribute.type,
   macro.phraseSeq
}
Example
About a year back, a question of considerable interest was agitated in the <orgName key="PAS1type="voluntary">
 <placeName key="PEN">Pennsyla.</placeName> Abolition Society
</orgName>....

<p>

<p> (paragraph) marks paragraphs in prose. [3.1. Paragraphs 7.2.5. Speech Contents]
Modulecore
AttributesAttributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.declaring (@decls) att.fragmentable (@part)
Member of
Contained by
May contain
Declaration
element p
{
   att.global.attributes,
   att.declaring.attributes,
   att.fragmentable.attributes,
   macro.paraContent
}
Example
<p>Hallgerd was outside. <q>There is blood on your axe,</q> she said. <q>What have you
   done?</q>
</p>
<p>
 <q>I have now arranged that you can be married a second time,</q> replied Thjostolf.
</p>
<p>
 <q>Then you must mean that Thorvald is dead,</q> she said.
</p>
<p>
 <q>Yes,</q> said Thjostolf. <q>And now you must think up some plan for me.</q>
</p>

<persName>

<persName> (personal name) contains a proper noun or proper-noun phrase referring to a person, possibly including one or more of the person's forenames, surnames, honorifics, added names, etc. [13.2.1. Personal Names]
Modulenamesdates
Member of
Contained by
May contain
Declaration
element persName { macro.phraseSeq }
Example
<persName>
 <forename>Edward</forename>
 <forename>George</forename>
 <surname type="linked">Bulwer-Lytton</surname>, <roleName>Baron Lytton of
 <placeName>Knebworth</placeName>
 </roleName>
</persName>

<profileDesc>

<profileDesc> (text-profile description) The langUsage defines which language the document has. [2.4. The Profile Description 2.1.1. The TEI Header and Its Components]
Moduleheader
AttributesAttributesatt.global.responsibility (@cert, @resp)
Member of
Contained by
header: teiHeader
May contain
header: langUsage
Declaration
element profileDesc
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   ( model.profileDescPart* )
}
Example
<profileDesc>
 <langUsage>
  <language ident="destyle="Language">Early New High German, Modern
     German</language>
  <language ident="destyle="LanguageType">UpperGerman</language>
  <language ident="destyle="LanguageArea">Bavarian</language>
 </langUsage>
</profileDesc>
Note

Although the content model permits it, it is rarely meaningful to supply multiple occurrences for any of the child elements of profileDesc unless these are documenting multiple texts.

In earlier versions of these Guidelines, it was required that the <creation> element appear first.

<projectDesc>

<projectDesc> (project description) describes in detail the aim or purpose for which an electronic file was encoded, together with any other relevant information concerning the process by which it was assembled or collected. [2.3.1. The Project Description 2.3. The Encoding Description 15.3.2. Declarable Elements]
Moduleheader
AttributesAttributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.declarable (@default)
Member of
Contained by
header: encodingDesc
May contain
core: p
Declaration
element projectDesc
{
   att.global.attributes,
   att.declarable.attributes,
   model.pLike+
}
Example
<projectDesc>
 <p>Texts collected for use in the Claremont Shakespeare Clinic, June 1990</p>
</projectDesc>

<publicationStmt>

<publicationStmt> (publication statement) groups information concerning the publication or distribution of an electronic or other text. [2.2.4. Publication, Distribution, Licensing, etc. 2.2. The File Description]
Moduleheader
AttributesAttributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp))
Contained by
header: fileDesc
May contain
Declaration
element publicationStmt
{
   att.global.attributes,
   (
      (
         ( model.publicationStmtPart.agency ),
         model.publicationStmtPart.detail*
      )+
    | model.pLike+
   )
}
Example
<publicationStmt>
 <publisher>C. Muquardt </publisher>
 <pubPlace>Bruxelles &amp; Leipzig</pubPlace>
 <date when="1846"/>
</publicationStmt>
Example
<publicationStmt>
 <publisher>Chadwyck Healey</publisher>
 <pubPlace>Cambridge</pubPlace>
 <availability>
  <p>Available under licence only</p>
 </availability>
 <date when="1992">1992</date>
</publicationStmt>
Note

Where a publication statement contains several members of the model.publicationStmtPart classes rather than one or more paragraphs or anonymous blocks, care should be taken to ensure that the repeated elements are presented in a meaningful order. It is a conformance requirement that elements supplying information about publication place, address, identifier, availability, and date be given following the name of the publisher, distributor, or authority concerned, and preferably in that order.

<ref>

<ref> (reference) defines a reference to another location, possibly modified by additional text or comment. [3.6. Simple Links and Cross-References 16.1. Links]
Modulecore
AttributesAttributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.pointing (@targetLang, @target, @evaluate) att.internetMedia (@mimeType) att.typed (@type, @subtype) att.declaring (@decls) att.cReferencing (@cRef)
Member of
Contained by
May contain
Declaration
element ref
{
   att.global.attributes,
   att.pointing.attributes,
   att.internetMedia.attributes,
   att.typed.attributes,
   att.declaring.attributes,
   att.cReferencing.attributes,
   macro.paraContent
}
Schematron

<s:report test="@target and @cRef">Only one of the
attributes @target' and @cRef' may be supplied on <s:name/></s:report>
Example
See especially <ref target="http://www.natcorp.ox.ac.uk/Texts/A02.xml#s2">the second
sentence</ref>
Example
See also <ref target="#locution">s.v. <term>locution</term>
</ref>.
Note

The target and cRef attributes are mutually exclusive.

<revisionDesc>

<revisionDesc> (revision description) summarizes the revision history for a file. [2.5. The Revision Description 2.1.1. The TEI Header and Its Components]
Moduleheader
AttributesAttributesatt.global (xml:id, xml:lang, @n, @xml:base, @xml:space) att.global.rendition (style, rendition, @rend) att.global.responsibility (@cert, @resp)
Contained by
header: teiHeader
May contain
core: list
header: change
Declaration
element revisionDesc
{
   att.global.attribute.n,
   att.global.attribute.xmlbase,
   att.global.attribute.xmlspace,
   att.global.rendition.attribute.rend,
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   ( list | listChange | change+ )
}
Example
<revisionDesc status="embargoed">
 <change when="1991-11-11who="#LB"> deleted chapter 10 </change>
</revisionDesc>
Note

If present on this element, the status attribute should indicate the current status of the document. The same attribute may appear on any change to record the status at the time of that change. Conventionally change elements should be given in reverse date order, with the most recent change at the start of the list.

<segmentation>

<segmentation> describes the principles according to which the text has been segmented, for example into sentences, tone-units, graphemic strata, etc. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements]
Moduleheader
AttributesAttributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.global.responsibility (@cert, @resp)
Member of
Contained by
May contain
core: p
Declaration
element segmentation
{
   att.global.attribute.n,
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   model.pLike+
}
Example
<segmentation>
 <p>
  <gi>s</gi> elements mark orthographic sentences and are numbered sequentially within
   their parent <gi>div</gi> element </p>
</segmentation>
Example
<p>
 <gi>seg</gi> elements are used to mark functional constituents of various types within each
<gi>s</gi>; the typology used is defined by a <gi>taxonomy</gi> element in the corpus
header <gi>classDecl</gi>
</p>

<sourceDesc>

<sourceDesc> (source description) describes the source from which an electronic text was derived or generated, typically a bibliographic description in the case of a digitized text, or a phrase such as "born digital" for a text which has no previous existence. [2.2.7. The Source Description]
Moduleheader
AttributesAttributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.declarable (@default)
Contained by
header: fileDesc
May contain
core: list p
Declaration
element sourceDesc
{
   att.global.attributes,
   att.declarable.attributes,
   (
      model.pLike+
    | ( model.biblLike | model.sourceDescPart | model.listLike )+
   )
}
Example
<sourceDesc>
 <bibl>
  <title level="a">The Interesting story of the Children in the Wood</title>. In
 <author>Victor E Neuberg</author>, <title>The Penny Histories</title>.
 <publisher>OUP</publisher>
  <date>1968</date>. </bibl>
</sourceDesc>
Example
<sourceDesc>
 <p>Born digital: no previous source exists.</p>
</sourceDesc>

<surname>

<surname> contains a family (inherited) name, as opposed to a given, baptismal, or nick name. [13.2.1. Personal Names]
Modulenamesdates
AttributesAttributesatt.global.responsibility (@cert, @resp)
Member of
Contained by
May contain
Declaration
element surname
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   macro.phraseSeq
}
Example
<surname type="combine">St John Stevas</surname>

<tagsDecl>

<tagsDecl> (tagging declaration) provides detailed information about the tagging applied to a document. [2.3.4. The Tagging Declaration 2.3. The Encoding Description]
Moduleheader
AttributesAttributesatt.global.responsibility (@cert, @resp)
partialindicates whether the element types listed exhaustively include all those found within text, or represent only a subset.
Status Recommended
Datatype data.truthValue
Note

TEI recommended practice is to specify this attribute. When the tagUsage elements inside tagsDecl are used to list each of the element types in the associated text, the value should be given as false. When the tagUsage elements inside tagsDecl are used to provide usage information or default renditions for only a subset of the elements types within the associated text, the value should be true.

Member of
Contained by
header: encodingDesc
May contain
header: namespace
Declaration
element tagsDecl
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute partial { data.truthValue }?,
   ( rendition*, namespace* )
}
Example
<tagsDecl>
 <rendition xml:id="rend-it">to be rendered in italic font</rendition>
 <namespace name="http://www.tei-c.org/ns/1.0">
  <tagUsage gi="hioccurs="467"
   render="#rend-it"/>

  <tagUsage gi="titleoccurs="45"
   render="#rend-it"/>

 </namespace>
 <namespace name="http://docbook.org/ns/docbook">
  <tagUsage gi="paraoccurs="10"/>
 </namespace>
</tagsDecl>

<tagUsage>

<tagUsage> supplies information about the usage of a specific element within a text. [2.3.4. The Tagging Declaration]
Moduleheader
AttributesAttributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp))
gi(generic identifier) specifies the name (generic identifier) of the element indicated by the tag, within the namespace indicated by the parent namespace element.
Status Required
Datatype data.text
occursspecifies the number of occurrences of this element within the text.
Status Recommended
Datatype data.count
withId(with unique identifier) specifies the number of occurrences of this element within the text which bear a distinct value for the global xml:id attribute.
Status Recommended
Datatype data.count
renderspecifies the identifier of a <rendition> element which defines how this element was rendered in the source text.
Status Optional
Datatype 1–∞ occurrences of  data.pointerseparated by whitespace
Contained by
header: namespace
May contain
Declaration
element tagUsage
{
   att.global.attributes,
   attribute gi { data.text },
   attribute occurs { data.count }?,
   attribute withId { data.count }?,
   attribute render { list { data.pointer+ } }?,
   macro.limitedContent
}
Example
<tagsDecl>
 <rendition xml:id="it">Render using a slant or italic variant on the current font</rendition>
<!-- ... -->
 <namespace name="http://www.tei-c.org/ns/1.0">
  <tagUsage gi="hioccurs="28render="#it"
   withId="2">
Used to mark English words
     italicized in the copy text.</tagUsage>
  <tagUsage gi="foreignrender="#it">Used to mark non-English words in the copy text.</tagUsage>
<!-- ... -->
 </namespace>
</tagsDecl>

<TEI>

<TEI> (TEI document) contains a single TEI-conformant document, containing a single TEI header, a single text, one or more members of the model.resourceLike class, or a combination of these. A series of TEI elements may be combined together to form a <teiCorpus> element. [4. Default Text Structure 15.1. Varieties of Composite Text]
Moduletextstructure
AttributesAttributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp))
versionspecifies the major version number of the TEI Guidelines against which this document is valid.
Status Optional
Datatype data.version
Note

The major version number is historically prefixed by a P (for Proposal), and is distinct from the version number used for individual releases of the Guidelines, as used by (for example) the source of the <schemaSpec> element. The current version is P5.

Contained by
May contain
header: teiHeader
textstructure: text
Declaration
element TEI
{
   att.global.attributes,
   attribute version { data.version }?,
   ( teiHeader, ( ( model.resourceLike+, text? ) | text ) )
}
Schematron

<s:ns prefix="tei"
 uri="http://www.tei-c.org/ns/1.0"/>

<s:ns prefix="xs"
 uri="http://www.w3.org/2001/XMLSchema"/>
Schematron

<s:ns prefix="rng"
 uri="http://relaxng.org/ns/structure/1.0"/>
Example
<TEI version="5.0" xmlns="http://www.tei-c.org/ns/1.0">
 <teiHeader>
  <fileDesc>
   <titleStmt>
    <title>The shortest TEI Document Imaginable</title>
   </titleStmt>
   <publicationStmt>
    <p>First published as part of TEI P2, this is the P5
         version using a name space.</p>
   </publicationStmt>
   <sourceDesc>
    <p>No source: this is an original work.</p>
   </sourceDesc>
  </fileDesc>
 </teiHeader>
 <text>
  <body>
   <p>This is about the shortest TEI document imaginable.</p>
  </body>
 </text>
</TEI>
Example
<TEI version="5.0" xmlns="http://www.tei-c.org/ns/1.0">
 <teiHeader>
  <fileDesc>
   <titleStmt>
    <title>A TEI Document containing four page images </title>
   </titleStmt>
   <publicationStmt>
    <p>Unpublished demonstration file.</p>
   </publicationStmt>
   <sourceDesc>
    <p>No source: this is an original work.</p>
   </sourceDesc>
  </fileDesc>
 </teiHeader>
 <facsimile>
  <graphic url="page1.png"/>
  <graphic url="page2.png"/>
  <graphic url="page3.png"/>
  <graphic url="page4.png"/>
 </facsimile>
</TEI>
Note

This element is required. It is customary to specify the TEI namespace http://www.tei-c.org/ns/1.0 on it, using the xmlns attribute.

<teiHeader>

<teiHeader> (TEI header) supplies the descriptive and declarative information making up an electronic title page for every TEI-conformant document. [2.1.1. The TEI Header and Its Components 15.1. Varieties of Composite Text]
Moduleheader
AttributesAttributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.global.responsibility (@cert, @resp)
type(You must identify the module of the LAUDATIO metada schema, e.g. CorpusHeader, DocumentHeader, PreparationHeader) specifies the kind of document to which the header is attached, for example whether it is a corpus or individual text.
Status Required
Datatype data.enumerated
Legal values are:
CorpusHeader
DocumentHeader
PreparationHeader
Contained by
textstructure: TEI
May contain
Declaration
element teiHeader
{
   att.global.attribute.n,
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute type { "CorpusHeader" | "DocumentHeader" | "PreparationHeader" },
   ( fileDesc, model.teiHeaderPart*, revisionDesc? )
}
Example
<teiHeader>
 <fileDesc>
  <titleStmt>
   <title>Shakespeare: the first folio (1623) in electronic form</title>
   <author>Shakespeare, William (1564–1616)</author>
   <respStmt>
    <resp>Originally prepared by</resp>
    <name>Trevor Howard-Hill</name>
   </respStmt>
   <respStmt>
    <resp>Revised and edited by</resp>
    <name>Christine Avern-Carr</name>
   </respStmt>
  </titleStmt>
  <publicationStmt>
   <distributor>Oxford Text Archive</distributor>
   <address>
    <addrLine>13 Banbury Road, Oxford OX2 6NN, UK</addrLine>
   </address>
   <idno type="OTA">119</idno>
   <availability>
    <p>Freely available on a non-commercial basis.</p>
   </availability>
   <date when="1968">1968</date>
  </publicationStmt>
  <sourceDesc>
   <bibl>The first folio of Shakespeare, prepared by Charlton Hinman (The Norton Facsimile,
       1968)</bibl>
  </sourceDesc>
 </fileDesc>
 <encodingDesc>
  <projectDesc>
   <p>Originally prepared for use in the production of a series of old-spelling
       concordances in 1968, this text was extensively checked and revised for use during the
       editing of the new Oxford Shakespeare (Wells and Taylor, 1989).</p>
  </projectDesc>
  <editorialDecl>
   <correction>
    <p>Turned letters are silently corrected.</p>
   </correction>
   <normalization>
    <p>Original spelling and typography is retained, except that long s and ligatured
         forms are not encoded.</p>
   </normalization>
  </editorialDecl>
  <refsDecl xml:id="ASLREF">
   <cRefPattern matchPattern="(\S+) ([^.]+)\.(.*)"
    replacementPattern="#xpath(//div1[@n='$1']/div2/[@n='$2']//lb[@n='$3'])">

    <p>A reference is created by assembling the following, in the reverse order as that
         listed here: <list>
      <item>the <att>n</att> value of the preceding <gi>lb</gi>
      </item>
      <item>a period</item>
      <item>the <att>n</att> value of the ancestor <gi>div2</gi>
      </item>
      <item>a space</item>
      <item>the <att>n</att> value of the parent <gi>div1</gi>
      </item>
     </list>
    </p>
   </cRefPattern>
  </refsDecl>
 </encodingDesc>
 <revisionDesc>
  <list>
   <item>
    <date when="1989-04-12">12 Apr 89</date> Last checked by CAC</item>
   <item>
    <date when="1989-03-01">1 Mar 89</date> LB made new file</item>
  </list>
 </revisionDesc>
</teiHeader>
Note

One of the few elements unconditionally required in any TEI document.

<text>

<text> contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample. [4. Default Text Structure 15.1. Varieties of Composite Text]
Moduletextstructure
AttributesAttributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.declaring (@decls) att.typed (@type, @subtype)
Contained by
textstructure: TEI
May containEmpty element
Declaration
element text
{
   att.global.attributes,
   att.declaring.attributes,
   att.typed.attributes,
   (
      model.global*,
      ( front, model.global* )?,
      ( body | group ),
      model.global*,
      ( back, model.global* )?
   )
}
Example
<text>
 <front>
  <docTitle>
   <titlePart>Autumn Haze</titlePart>
  </docTitle>
 </front>
 <body>
  <l>Is it a dragonfly or a maple leaf</l>
  <l>That settles softly down upon the water?</l>
 </body>
</text>
Example

The body of a text may be replaced by a group of nested texts, as in the following schematic:

<text>
 <front>
<!-- front matter for the whole group -->
 </front>
 <group>
  <text>
<!-- first text -->
  </text>
  <text>
<!-- second text -->
  </text>
 </group>
</text>
Note

This element should not be used to represent a text which is inserted at an arbitrary point within the structure of another, for example as in an embedded or quoted narrative; the <floatingText> is provided for this purpose.

<title>

<title> contains a title for any kind of work. [3.11.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement 2.2.5. The Series Statement]
Modulecore
AttributesAttributesatt.global (xml:id, n, xml:lang, @xml:base, @xml:space) att.global.rendition (style, rendition, @rend) att.global.responsibility (@cert, @resp)
type(You must specify the exact reference to LAUDATIO metadata module schema 'Corpus'.) classifies the title according to some convenient typology.
Derived fromatt.typed
Status Recommended
Datatype data.enumerated
Legal values are:
Corpus
(The title only refers to the LAUDATIO metadata object 'Corpus')
Note

This attribute is provided for convenience in analysing titles and processing them according to their type; where such specialized processing is not necessary, there is no need for such analysis, and the entire title, including subtitles and any parallel titles, may be enclosed within a single title element.

Member of
Contained by
May contain
Declaration
element title
{
   att.global.attribute.xmlbase,
   att.global.attribute.xmlspace,
   att.global.rendition.attribute.rend,
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   attribute type { "Corpus" }?,
   macro.paraContent
}
Example
<title>Information Technology and the Research Process: Proceedings of
a conference held at Cranfield Institute of Technology, UK,
18–21 July 1989</title>
Example
<title>Hardy's Tess of the D'Urbervilles: a machine readable
edition</title>
Example
<title type="full">
 <title type="main">Synthèse</title>
 <title type="sub">an international journal for
   epistemology, methodology and history of
   science</title>
</title>
Note

The attributes key and ref, inherited from the class att.canonical may be used to indicate the canonical form for the title; the former, by supplying (for example) the identifier of a record in some external library system; the latter by pointing to an XML element somewhere containing the canonical form of the title.

<titleStmt>

<titleStmt> (title statement) groups information about the title of a work and those responsible for its content. [2.2.1. The Title Statement 2.2. The File Description]
Moduleheader
AttributesAttributesatt.global.responsibility (@cert, @resp)
Contained by
header: fileDesc
May contain
Declaration
element titleStmt
{
   att.global.responsibility.attribute.cert,
   att.global.responsibility.attribute.resp,
   ( title+, model.respLike* )
}
Example
<titleStmt>
 <title>Capgrave's Life of St. John Norbert: a machine-readable transcription</title>
 <respStmt>
  <resp>compiled by</resp>
  <name>P.J. Lucas</name>
 </respStmt>
</titleStmt>

Schema teiODD_LAUDATIOCorpus_S7: Model classes

model.addressLike

model.addressLike groups elements used to represent a postal or email address. [1. The TEI Infrastructure]
Moduletei
Used by
Membersaffiliation

model.applicationLike

model.applicationLike groups elements used to record application-specific information about a document in its header.
Moduletei
Used by
Membersapplication

model.dateLike

model.dateLike groups elements containing temporal expressions. [3.5.4. Dates and Times 13.3.6. Dates and Times]
Moduletei
Used by
Membersdate

model.divBottom

model.divBottom groups elements appearing at the end of a text division. [4.2. Elements Common to All Divisions]
Moduletei
Used by
Membersmodel.divBottomPart model.divWrapper

model.divPart

model.divPart groups paragraph-level elements appearing directly within divisions. [1.3. The TEI Class System]
Moduletei
Used by
Membersmodel.lLike model.pLike [p]
Note

Note that this element class does not include members of the model.inter class, which can appear either within or between paragraph-level items.

model.divTop

model.divTop groups elements appearing at the beginning of a text division. [4.2. Elements Common to All Divisions]
Moduletei
Used by
Membersmodel.divTopPart [model.headLike] model.divWrapper

model.divTopPart

model.divTopPart groups elements which can occur only at the beginning of a text division. [4.6. Title Pages]
Moduletei
Used by
Membersmodel.headLike

model.editorialDeclPart

model.editorialDeclPart groups elements which may be used inside editorialDecl and appear multiple times.
Moduletei
Used by
Membersnormalization segmentation

model.emphLike

model.emphLike groups phrase-level elements which are typographically distinct and to which a specific function can be attributed. [3.3. Highlighting and Quotation]
Moduletei
Used by
Memberstitle

model.encodingDescPart

model.encodingDescPart groups elements which may be used inside encodingDesc and appear multiple times.
Moduletei
Used by
MembersappInfo editorialDecl projectDesc tagsDecl

model.global

model.global groups elements which may appear at any point within a TEI text. [1.3. The TEI Class System]
Moduletei
Used by
Membersmodel.global.edit model.global.meta model.milestoneLike model.noteLike

model.highlighted

model.highlighted groups phrase-level elements which are typographically distinct. [3.3. Highlighting and Quotation]
Moduletei
Used by
Membersmodel.emphLike [title] model.hiLike

model.inter

model.inter groups elements which can appear either within or between paragraph-like elements. [1.3. The TEI Class System]
Moduletei
Used by
Membersmodel.biblLike model.egLike model.labelLike [label] model.listLike [list] model.oddDecl model.qLike [model.quoteLike] model.stageLike

model.labelLike

model.labelLike groups elements used to gloss or explain other parts of a document.
Moduletei
Used by
Memberslabel

model.limitedPhrase

model.limitedPhrase groups phrase-level elements excluding those elements primarily intended for transcription of existing sources. [1.3. The TEI Class System]
Moduletei
Used by
Membersmodel.emphLike [title] model.hiLike model.pPart.data [model.addressLike [affiliation] model.dateLike [date] model.measureLike model.nameLike [model.nameLike.agent [orgName persName] model.offsetLike model.persNamePart [forename surname] model.placeStateLike [model.placeNamePart] idno] ] model.pPart.editorial model.pPart.msdesc model.phrase.xml model.ptrLike [ref]

model.listLike

model.listLike groups list-like elements. [3.7. Lists]
Moduletei
Used by
Memberslist

model.nameLike

model.nameLike groups elements which name or refer to a person, place, or organization.
Moduletei
Used by
Membersmodel.nameLike.agent [orgName persName] model.offsetLike model.persNamePart [forename surname] model.placeStateLike [model.placeNamePart] idno
Note

A superset of the naming elements that may appear in datelines, addresses, statements of responsibility, etc.

model.nameLike.agent

model.nameLike.agent groups elements which contain names of individuals or corporate bodies. [3.5. Names, Numbers, Dates, Abbreviations, and Addresses]
Moduletei
Used by
MembersorgName persName
Note

This class is used in the content model of elements which reference names of people or organizations.

model.persNamePart

model.persNamePart groups elements which form part of a personal name. [13.2.1. Personal Names]
Modulenamesdates
Used by
Membersforename surname

model.phrase

model.phrase groups elements which can occur at the level of individual words or phrases. [1.3. The TEI Class System]
Moduletei
Used by
Membersmodel.graphicLike model.highlighted [model.emphLike [title] model.hiLike] model.lPart model.pPart.data [model.addressLike [affiliation] model.dateLike [date] model.measureLike model.nameLike [model.nameLike.agent [orgName persName] model.offsetLike model.persNamePart [forename surname] model.placeStateLike [model.placeNamePart] idno] ] model.pPart.edit [model.pPart.editorial model.pPart.transcriptional] model.pPart.msdesc model.phrase.xml model.ptrLike [ref] model.segLike model.specDescLike
Note

This class of elements can occur within paragraphs, list items, lines of verse, etc.

model.placeStateLike

model.placeStateLike groups elements which describe changing states of a place.
Moduletei
Used by
Membersmodel.placeNamePart

model.pLike

model.pLike groups paragraph-like elements.
Moduletei
Used by
Membersp

model.pPart.data

model.pPart.data groups phrase-level elements containing names, dates, numbers, measures, and similar data. [3.5. Names, Numbers, Dates, Abbreviations, and Addresses]
Moduletei
Used by
Membersmodel.addressLike [affiliation] model.dateLike [date] model.measureLike model.nameLike [model.nameLike.agent [orgName persName] model.offsetLike model.persNamePart [forename surname] model.placeStateLike [model.placeNamePart] idno]

model.pPart.edit

model.pPart.edit groups phrase-level elements for simple editorial correction and transcription. [3.4. Simple Editorial Changes]
Moduletei
Used by
Membersmodel.pPart.editorial model.pPart.transcriptional

model.profileDescPart

model.profileDescPart groups elements which may be used inside profileDesc and appear multiple times.
Moduletei
Used by
MemberslangUsage

model.ptrLike

model.ptrLike groups elements used for purposes of location and reference. [3.6. Simple Links and Cross-References]
Moduletei
Used by
Membersref

model.publicationStmtPart.agency

model.publicationStmtPart.agency groups the child elements of a publicationStmt element of the TEI header that indicate an authorising agent. [2.2.4. Publication, Distribution, Licensing, etc.]
Moduletei
Used by
Membersauthority
Note

The ‘agency’ child elements, while not required, are required if one of the ‘detail’ child elements is to be used. It is not valid to have a ‘detail’ child element without a preceding ‘agency’ child element.

See also model.publicationStmtPart.detail.

model.publicationStmtPart.detail

model.publicationStmtPart.detail groups the agency-specific child elements of the publicationStmt element of the TEI header. [2.2.4. Publication, Distribution, Licensing, etc.]
Moduletei
Used by
Membersavailability date idno
Note

A ‘detail’ child element may not occur unless an ‘agency’ child element precedes it.

See also model.publicationStmtPart.agency.

model.qLike

model.qLike groups elements related to highlighting which can appear either within or between chunk-level elements. [3.3. Highlighting and Quotation]
Moduletei
Used by
Membersmodel.quoteLike

model.respLike

model.respLike groups elements which are used to indicate intellectual or other significant responsibility, for example within a bibliographic element.
Moduletei
Used by
Membersauthor editor

model.teiHeaderPart

model.teiHeaderPart groups high level elements which may appear more than once in a TEI header.
Moduletei
Used by
MembersencodingDesc profileDesc

Schema teiODD_LAUDATIOCorpus_S7: Attribute classes

att.ascribed

att.ascribed provides attributes for elements representing speech or action that can be ascribed to a specific individual. [3.3.3. Quotation 8.3. Elements Unique to Spoken Texts]
Moduletei
Memberschange
AttributesAttributes
whoindicates the person, or group of people, to whom the element content is ascribed.
Status Optional
Datatype 1–∞ occurrences of  data.pointerseparated by whitespace

In the following example from Hamlet, speeches (<sp>) in the body of the play are linked to <castItem> elements in the <castList> using the who attribute.

<castItem type="role">
 <role xml:id="Barnardo">Bernardo</role>
</castItem>
<castItem type="role">
 <role xml:id="Francisco">Francisco</role>
 <roleDesc>a soldier</roleDesc>
</castItem>
<!-- ... -->
<sp who="#Barnardo">
 <speaker>Bernardo</speaker>
 <l n="1">Who's there?</l>
</sp>
<sp who="#Francisco">
 <speaker>Francisco</speaker>
 <l n="2">Nay, answer me: stand, and unfold yourself.</l>
</sp>
Note

For transcribed speech, this will typically identify a participant or participant group; in other contexts, it will point to any identified <person> element.

att.canonical

att.canonical provides attributes which can be used to associate a representation such as a name or title with canonical information about the object being named or referenced.
Moduletei
Membersatt.naming [att.personal]
AttributesAttributes
keyprovides an externally-defined means of identifying the entity (or entities) being named, using a coded value of some kind.
Status Optional
Datatype data.text
<author>
 <name key="name 427308"
  type="organisation">
[New Zealand Parliament, Legislative Council]</name>
</author>
<author>
 <name key="Hugo, Victor (1802-1885)"
  ref="http://www.idref.fr/026927608">
Victor Hugo</name>
</author>
Note

The value may be a unique identifier from a database, or any other externally-defined string identifying the referent.

No particular syntax is proposed for the values of the key attribute, since its form will depend entirely on practice within a given project. For the same reason, this attribute is not recommended in data interchange, since there is no way of ensuring that the values used by one project are distinct from those used by another. In such a situation, a preferable approach for magic tokens which follows standard practice on the Web is to use a ref attribute whose value is a tag URI as defined in RFC 4151.

ref(reference) provides an explicit means of locating a full definition for the entity being named by means of one or more URIs.
Status Optional
Datatype 1–∞ occurrences of  data.pointerseparated by whitespace
<name ref="http://viaf.org/viaf/109557338"
 type="person">
Seamus Heaney</name>
Note

The value must point directly to one or more XML elements or other resources by means of one or more URIs, separated by whitespace. If more than one is supplied the implication is that the name identifies several distinct entities.

att.cReferencing

att.cReferencing provides an attribute which may be used to supply a canonical reference as a means of identifying the target of a pointer.
Moduletei
Membersref
AttributesAttributes
cRef(canonical reference) specifies the destination of the pointer by supplying a canonical reference expressed using the scheme defined in a <refsDecl> element in the TEI header
Status Optional
Datatype data.text
Note

The value of cRef should be constructed so that when the algorithm for the resolution of canonical references (described in section 16.2.5. Canonical References) is applied to it the result is a valid URI reference to the intended target

The <refsDecl> to use may be indicated with the decls attribute.

Currently these Guidelines only provide for a single canonical reference to be encoded on any given <ptr> element.

att.datable

att.datable provides attributes for normalization of elements that contain dates, times, or datable events. [3.5.4. Dates and Times 13.3.6. Dates and Times]
Moduletei
Membersapplication change
AttributesAttributes att.datable.w3c (@when, @notBefore, @notAfter, @from, @to) att.datable.iso (@when-iso, @notBefore-iso, @notAfter-iso, @from-iso, @to-iso) att.datable.custom (@when-custom, @notBefore-custom, @notAfter-custom, @from-custom, @to-custom, @datingPoint, @datingMethod)
calendarindicates the system or calendar to which the date represented by the content of this element belongs.
Status Optional
Datatype data.pointer
Schematron

<sch:rule context="tei:*[@calendar]">
<sch:assert test="string-length(.) gt 0">@calendar indicates the system or calendar to which the date represented by the content of this element
belongs, but this <sch:name/> element has no textual content.</sch:assert></sch:rule>
He was born on <date calendar="#Gregorian">Feb. 22, 1732</date>
(<date calendar="#Julianwhen="1732-02-22"> Feb. 11, 1731/32, O.S.</date>).
periodsupplies a pointer to some location defining a named period of time within which the datable item is understood to have occurred.
Status Optional
Datatype data.pointer
Note

This ‘superclass’ provides attributes that can be used to provide normalized values of temporal information. By default, the attributes from the att.datable.w3c class are provided. If the module for names & dates is loaded, this class also provides attributes from the att.datable.iso and att.datable.custom classes. In general, the possible values of attributes restricted to the W3C datatypes form a subset of those values available via the ISO 8601 standard. However, the greater expressiveness of the ISO datatypes may not be needed, and there exists much greater software support for the W3C datatypes.

att.datable.custom

att.datable.custom provides attributes for normalization of elements that contain datable events to a custom dating system (i.e. other than the Gregorian used by W3 and ISO). [13.3.6. Dates and Times]
Modulenamesdates
Membersatt.datable [application change]
AttributesAttributes
when-customsupplies the value of a date or time in some custom standard form.
Status Optional
Datatype 1–∞ occurrences of  data.wordseparated by whitespace

The following are examples of custom date or time formats that are not valid ISO or W3C format normalizations, normalized to a different dating system

<p>Alhazen died in Cairo on the
<date when="1040-03-06"
  when-custom="431-06-12">
12th day of Jumada t-Tania, 430 AH
 </date>.</p>
<p>The current world will end at the
<date when="2012-12-21"
  when-custom="13.0.0.0.0">
end of B'ak'tun 13</date>.</p>
<p>The Battle of Meggidu
(<date when-custom="Thutmose_III:23">23rd year of reign of Thutmose III</date>).</p>
<p>Esidorus bixit in pace annos LXX plus minus sub
<date when-custom="Ind:4-10-11">die XI mensis Octobris indictione IIII</date>
</p>

Not all custom date formulations will have Gregorian equivalents.

The when-custom attribute and other custom dating are not contrained to a datatype by the TEI, but individual projects are recommended to regularize and document their dating formats.

notBefore-customspecifies the earliest possible date for the event in some custom standard form.
Status Optional
Datatype 1–∞ occurrences of  data.wordseparated by whitespace
notAfter-customspecifies the latest possible date for the event in some custom standard form.
Status Optional
Datatype 1–∞ occurrences of  data.wordseparated by whitespace
from-customindicates the starting point of the period in some custom standard form.
Status Optional
Datatype 1–∞ occurrences of  data.wordseparated by whitespace
<event datingMethod="#julian"
 from-custom="1666-09-02to-custom="1666-09-05xml:id="FIRE1">

 <head>The Great Fire of London</head>
 <p>The Great Fire of London burned through a large part
   of the city of London.</p>
</event>
to-customindicates the ending point of the period in some custom standard form.
Status Optional
Datatype 1–∞ occurrences of  data.wordseparated by whitespace
datingPointsupplies a pointer to some location defining a named point in time with reference to which the datable item is understood to have occurred
Status Optional
Datatype data.pointer
datingMethodsupplies a pointer to a <calendar> element or other means of interpreting the values of the custom dating attributes.
Status Optional
Datatype data.pointer
Contayning the Originall, Antiquity, Increaſe, Moderne
eſtate, and deſcription of that Citie, written in the yeare
<date calendar="#julian"
 datingMethod="#julianwhen-custom="1598">
1598</date>. by Iohn Stow
Citizen of London.
Note

Here the calendar attribute points to a <calendar> element for the Julian calendar, specifying that the text content of the date element is a Julian date, and the datingMethod attribute also points to the Julian calendar to indicate that the content of the when-custom attribute value is Julian too.

att.datable.iso

att.datable.iso provides attributes for normalization of elements that contain datable events using the ISO 8601 standard. [3.5.4. Dates and Times 13.3.6. Dates and Times]
Modulenamesdates
Membersatt.datable [application change]
AttributesAttributes
when-isosupplies the value of a date or time in a standard form.
Status Optional
Datatype data.temporal.iso

The following are examples of ISO date, time, and date & time formats that are not valid W3C format normalizations.

<date when-iso="1996-09-24T07:25+00">Sept. 24th, 1996 at 3:25 in the morning</date>
<date when-iso="1996-09-24T03:25-04">Sept. 24th, 1996 at 3:25 in the morning</date>
<time when-iso="1999-01-04T20:42-05">4 Jan 1999 at 8:42 pm</time>
<time when-iso="1999-W01-1T20,70-05">4 Jan 1999 at 8:42 pm</time>
<date when-iso="2006-05-18T10:03">a few minutes after ten in the morning on Thu 18 May</date>
<time when-iso="03:00">3 A.M.</time>
<time when-iso="14">around two</time>
<time when-iso="15,5">half past three</time>

All of the examples of the when attribute in the att.datable.w3c class are also valid with respect to this attribute.

He likes to be punctual. I said <q>
 <time when-iso="12">around noon</time>
</q>, and he showed up at <time when-iso="12:00:00">12 O'clock</time> on the dot.

The second occurence of <time> could have been encoded with the when attribute, as 12:00:00 is a valid time with respect to the W3C XML Schema Part 2: Datatypes Second Edition specification. The first occurence could not.

notBefore-isospecifies the earliest possible date for the event in standard form, e.g. yyyy-mm-dd.
Status Optional
Datatype data.temporal.iso
notAfter-isospecifies the latest possible date for the event in standard form, e.g. yyyy-mm-dd.
Status Optional
Datatype data.temporal.iso
from-isoindicates the starting point of the period in standard form.
Status Optional
Datatype data.temporal.iso
to-isoindicates the ending point of the period in standard form.
Status Optional
Datatype data.temporal.iso
Note

The value of these attributes should be a normalized representation of the date, time, or combined date & time intended, in any of the standard formats specified by ISO 8601, using the Gregorian calendar.

Note
If both when-iso and dur-iso are specified, the values should be interpreted as indicating a span of time by its starting time (or date) and duration. That is,
<date dur-iso="P8Dwhen-iso="2007-06-01"/>
indicates the same time period as
<date when-iso="2007-06-01/P8D"/>

In providing a ‘regularized’ form, no claim is made that the form in the source text is incorrect; the regularized form is simply that chosen as the main form for purposes of unifying variant forms under a single heading.

att.datable.w3c

att.datable.w3c provides attributes for normalization of elements that contain datable events conforming to the W3C XML Schema Part 2: Datatypes Second Edition. [3.5.4. Dates and Times 13.3.6. Dates and Times]
Moduletei
Membersatt.datable [application change]
AttributesAttributes
whensupplies the value of the date or time in a standard form, e.g. yyyy-mm-dd.
Status Optional
Datatype data.temporal.w3c

Examples of W3C date, time, and date & time formats.

<p>
 <date when="1945-10-24">24 Oct 45</date>
 <date when="1996-09-24T07:25:00Z">September 24th, 1996 at 3:25 in the morning</date>
 <time when="1999-01-04T20:42:00-05:00">Jan 4 1999 at 8 pm</time>
 <time when="14:12:38">fourteen twelve and 38 seconds</time>
 <date when="1962-10">October of 1962</date>
 <date when="--06-12">June 12th</date>
 <date when="---01">the first of the month</date>
 <date when="--08">August</date>
 <date when="2006">MMVI</date>
 <date when="0056">AD 56</date>
 <date when="-0056">56 BC</date>
</p>
This list begins in
the year 1632, more precisely on Trinity Sunday, i.e. the Sunday after
Pentecost, in that year the <date calendar="#Julianwhen="1632-06-06">27th of May (old style)</date>.
<opener>
 <dateline>
  <placeName>Dorchester, Village,</placeName>
  <date when="1828-03-02">March 2d. 1828.</date>
 </dateline>
 <salute>To
   Mrs. Cornell,</salute> Sunday <time when="12:00:00">noon.</time>
</opener>
notBeforespecifies the earliest possible date for the event in standard form, e.g. yyyy-mm-dd.
Status Optional
Datatype data.temporal.w3c
notAfterspecifies the latest possible date for the event in standard form, e.g. yyyy-mm-dd.
Status Optional
Datatype data.temporal.w3c
fromindicates the starting point of the period in standard form, e.g. yyyy-mm-dd.
Status Optional
Datatype data.temporal.w3c
toindicates the ending point of the period in standard form, e.g. yyyy-mm-dd.
Status Optional
Datatype data.temporal.w3c
Example
<date from="1863-05-28to="1863-06-01">28 May through 1 June 1863</date>
Note

The value of these attributes should be a normalized representation of the date, time, or combined date & time intended, in any of the standard formats specified by XML Schema Part 2: Datatypes Second Edition, using the Gregorian calendar.

The most commonly-encountered format for the date portion of a temporal attribute is yyyy-mm-dd, but yyyy, --mm, ---dd, yyyy-mm, or --mm-dd may also be used. For the time part, the form hh:mm:ss is used.

Note that this format does not currently permit use of the value 0000 to represent the year 1 BCE; instead the value -0001 should be used.

att.declarable

att.declarable provides attributes for those elements in the TEI header which may be independently selected by means of the special purpose decls attribute. [15.3. Associating Contextual Information with a Text]
Moduletei
MembersprojectDesc sourceDesc
AttributesAttributes
defaultindicates whether or not this element is selected by default when its parent is selected.
Status Optional
Datatype data.truthValue
Legal values are:
true
This element is selected if its parent is selected
false
This element can only be selected explicitly, unless it is the only one of its kind, in which case it is selected if its parent is selected. [Default]
Note

The rules governing the association of declarable elements with individual parts of a TEI text are fully defined in chapter 15.3. Associating Contextual Information with a Text. Only one element of a particular type may have a default attribute with a value of true.

att.declaring

att.declaring provides attributes for elements which may be independently associated with a particular declarable element within the header, thus overriding the inherited default for that element. [15.3. Associating Contextual Information with a Text]
Moduletei
Membersp ref text
AttributesAttributes
declsidentifies one or more declarable elements within the header, which are understood to apply to the element bearing this attribute and its content.
Status Optional
Datatype 1–∞ occurrences of  data.pointerseparated by whitespace
Note

The rules governing the association of declarable elements with individual parts of a TEI text are fully defined in chapter 15.3. Associating Contextual Information with a Text.

att.dimensions

att.dimensions provides attributes for describing the size of physical objects.
Moduletei
Membersatt.editLike
AttributesAttributes att.ranging (@atLeast, @atMost, @min, @max, @confidence)
unitnames the unit used for the measurement Suggested values include: 1] cm(centimetres) ; 2] mm(millimetres) ; 3] in(inches) ; 4] lines; 5] chars(characters)
Status Optional
Datatype data.enumerated
Suggested values include:
cm
(centimetres)
mm
(millimetres)
in
(inches)
lines
lines of text
chars
(characters) characters of text
quantityspecifies the length in the units specified
Status Optional
Datatype data.numeric
extentindicates the size of the object concerned using a project-specific vocabulary combining quantity and units in a single string of words.
Status Optional
Datatype data.text
<gap extent="5 words"/>
<height extent="half the page"/>
precisioncharacterizes the precision of the values specified by the other attributes.
Status Optional
Datatype data.certainty
scopewhere the measurement summarizes more than one observation, specifies the applicability of this measurement. Sample values include: 1] all; 2] most; 3] range
Status Optional
Datatype data.enumerated
Sample values include:
all
measurement applies to all instances.
most
measurement applies to most of the instances inspected.
range
measurement applies to only the specified range of instances.

att.docStatus

att.docStatus provides attributes for use on metadata elements describing the status of a document.
Moduletei
Memberschange
AttributesAttributes
statusdescribes the status of a document either currently or, when associated with a dated element, at the time indicated. Sample values include: 1] approved; 2] candidate; 3] cleared; 4] deprecated; 5] draft; 6] embargoed; 7] expired; 8] frozen; 9] galley; 10] proposed; 11] published; 12] recommendation; 13] submitted; 14] unfinished; 15] withdrawn
Status Optional
Datatype data.enumerated
Sample values include:
approved
candidate
cleared
deprecated
draft
[Default]
embargoed
expired
frozen
galley
proposed
published
recommendation
submitted
unfinished
withdrawn
Example
<revisionDesc status="published">
 <change status="published"
  when="2010-10-21"/>

 <change status="clearedwhen="2010-10-02"/>
 <change status="embargoed"
  when="2010-08-02"/>

 <change status="frozenwhen="2010-05-01"
  who="#MSM"/>

 <change status="draftwhen="2010-03-01"
  who="#LB"/>

</revisionDesc>

att.fragmentable

att.fragmentable groups structural elements which may be fragmented, usually as a consequence of some overlapping hierarchy.
Moduletei
Membersp
AttributesAttributes
partspecifies whether or not its parent element is fragmented in some way, typically by some other overlapping structure: for example a speech which is divided between two or more verse stanzas, a paragraph which is split across a page division, a verse line which is divided between two speakers.
Status Optional
Datatype data.enumerated
Legal values are:
Y
(yes) the element is fragmented in some (unspecified) respect
N
(no) either the element is not fragmented, or no claim is made as to its completeness. [Default]
I
(initial) this is the initial part of a fragmented element
M
(medial) this is a medial part of a fragmented element
F
(final) this is the final part of a fragmented element
Note

The values I, M, or F should be used only where it is clear how the element may be be reconstituted.

att.global

att.global provides attributes common to all elements in the TEI encoding scheme. [1.3.1.1. Global Attributes]
Moduletei
MembersTEI appInfo application authority change encodingDesc idno label p projectDesc publicationStmt ref sourceDesc tagUsage text
AttributesAttributes att.global.rendition (@rend, @style, @rendition) att.global.responsibility (@cert, @resp)
xml:id(identifier) provides a unique identifier for the element bearing the attribute.
Status Optional
Datatype xsd:ID
Note

The xml:id attribute may be used to specify a canonical reference for an element; see section 3.10. Reference Systems.

n(number) gives a number (or other label) for an element, which is not necessarily unique within the document.
Status Optional
Datatype data.text
Note

The value of this attribute is always understood to be a single token, even if it contains space or other punctuation characters, and need not be composed of numbers only. It is typically used to specify the numbering of chapters, sections, list items, etc.; it may also be used in the specification of a standard reference system for the text.

xml:lang(language) indicates the language of the element content using a ‘tag’ generated according to BCP 47.
Status Optional
Datatype data.language
<p> … The consequences of
this rapid depopulation were the loss of the last
<foreign xml:lang="rap">ariki</foreign> or chief
(Routledge 1920:205,210) and their connections to
ancestral territorial organization.</p>
Note

The xml:lang value will be inherited from the immediately enclosing element, or from its parent, and so on up the document hierarchy. It is generally good practice to specify xml:lang at the highest appropriate level, noticing that a different default may be needed for the teiHeader from that needed for the associated resource element or elements, and that a single TEI document may contain texts in many languages.

The authoritative list of registered language subtags is maintained by IANA and is available at http://www.iana.org/assignments/language-subtag-registry. For a good general overview of the construction of language tags, see http://www.w3.org/International/articles/language-tags/, and for a practical step-by-step guide, see http://www.w3.org/International/questions/qa-choosing-language-tags.

The value used must conform with BCP 47. If the value is a private use code (i.e., starts with x- or contains -x-), a language element with a matching value for its ident attribute should be supplied in the TEI header to document this value. Such documentation may also optionally be supplied for non-private-use codes, though these must remain consistent with their (IETF)Internet Engineering Task Force definitions.

xml:baseprovides a base URI reference with which applications can resolve relative URI references into absolute URI references.
Status Optional
Datatype data.pointer
<div type="bibl">
 <head>Bibliography</head>
 <listBibl xml:base="http://www.lib.ucdavis.edu/BWRP/Works/">
  <bibl>
   <author>
    <name>Landon, Letitia Elizabeth</name>
   </author>
   <ref target="LandLVowOf.sgm">
    <title>The Vow of the Peacock</title>
   </ref>
  </bibl>
  <bibl>
   <author>
    <name>Compton, Margaret Clephane</name>
   </author>
   <ref target="NortMIrene.sgm">
    <title>Irene, a Poem in Six Cantos</title>
   </ref>
  </bibl>
  <bibl>
   <author>
    <name>Taylor, Jane</name>
   </author>
   <ref target="TaylJEssay.sgm">
    <title>Essays in Rhyme on Morals and Manners</title>
   </ref>
  </bibl>
 </listBibl>
</div>
xml:spacesignals an intention about how white space should be managed by applications.
Status Optional
Datatype data.enumerated
Legal values are:
default
signals that the application's default white-space processing modes are acceptable
preserve
indicates the intent that applications preserve all white space
Note

The XML specification provides further guidance on the use of this attribute. Note that many parsers may not handle xml:space correctly.

att.global.rendition

att.global.rendition provides rendering attributes common to all elements in the TEI encoding scheme. [1.3.1.1.3. Rendition Indicators]
Moduletei
Membersatt.global [TEI appInfo application authority change encodingDesc idno label p projectDesc publicationStmt ref sourceDesc tagUsage text]
AttributesAttributes
rend(rendition) indicates how the element in question was rendered or presented in the source text.
Status Optional
Datatype 1–∞ occurrences of  data.wordseparated by whitespace
<head rend="align(center) case(allcaps)">
 <lb/>To The <lb/>Duchesse <lb/>of <lb/>Newcastle,
<lb/>On Her <lb/>
 <hi rend="case(mixed)">New Blazing-World</hi>.
</head>
Note

These Guidelines make no binding recommendations for the values of the rend attribute; the characteristics of visual presentation vary too much from text to text and the decision to record or ignore individual characteristics varies too much from project to project. Some potentially useful conventions are noted from time to time at appropriate points in the Guidelines. The values of the rend attribute are a set of sequence-indeterminate individual tokens separated by whitespace.

stylecontains an expression in some formal style definition language which defines the rendering or presentation used for this element in the source text
Status Optional
Datatype data.text
<head style="text-align: center; font-variant: small-caps">
 <lb/>To The <lb/>Duchesse <lb/>of <lb/>Newcastle, <lb/>On Her
<lb/>
 <hi style="font-variant: normal">New Blazing-World</hi>.
</head>
Note

Unlike the attribute values of rend, which uses whitespace as a separator, the style attribute may contain whitespace. This attribute is intended for recording inline stylistic information concerning the source, not any particular output.

The formal language in which values for this attribute are expressed may be specified using the <styleDefDecl> element in the TEI header.

renditionpoints to a description of the rendering or presentation used for this element in the source text.
Status Optional
Datatype 1–∞ occurrences of  data.pointerseparated by whitespace
<head rendition="#ac #sc">
 <lb/>To The <lb/>Duchesse <lb/>of <lb/>Newcastle, <lb/>On Her
<lb/>
 <hi rendition="#normal">New Blazing-World</hi>.
</head>
<!-- elsewhere... -->
<rendition scheme="cssxml:id="sc">font-variant: small-caps</rendition>
<rendition scheme="cssxml:id="normal">font-variant: normal</rendition>
<rendition scheme="cssxml:id="ac">text-align: center</rendition>
Note

The rendition attribute is used in a very similar way to the class attribute defined for XHTML but with the important distinction that its function is to describe the appearance of the source text, not necessarily to determine how that text should be presented on screen or paper.

Where both rendition and rend are supplied, the latter is understood to override or complement the former.

Each URI provided should indicate a <rendition> element defining the intended rendition in terms of some appropriate style language, as indicated by the scheme attribute.

att.global.responsibility

att.global.responsibility provides attributes indicating the agency responsible for some aspect of the text, the markup or something asserted by the markup, and the degree of certainty associated with it. [3.4. Simple Editorial Changes 11.3.2.2. Hand, Responsibility, and Certainty Attributes 17.3. Spans and Interpretations 13.1.1. Linking Names and Their Referents]
Moduletei
Membersatt.global [TEI appInfo application authority change encodingDesc idno label p projectDesc publicationStmt ref sourceDesc tagUsage text]
AttributesAttributes
cert(certainty) signifies the degree of certainty associated with the intervention or interpretation.
Status Optional
Datatype data.certainty
resp(responsible party) indicates the agency responsible for the intervention or interpretation, for example an editor or transcriber.
Status Optional
Datatype 1–∞ occurrences of  data.pointerseparated by whitespace
Note

Note that a simple resp pointing to a person or organization is likely to be somewhat ambiguous with regard to the nature of the responsibility. For this reason, we recommend that resp be used to point not to an agent (<person> or <org>) but to a <respStmt>, author, editor or similar element which clarifies the exact role played by the agent. Pointing to multiple <respStmt>s allows the encoder to specify clearly each of the roles played in part of a TEI file (creating, transcribing, encoding, editing, proofing etc.).

Example
Blessed are the
<choice>
 <sic>placemakers</sic>
 <corr cert="highresp="#editor">peacemakers</corr>
</choice>: for they shall be called the children of God.
Example

<!-- in the <text> ... --><lg>
<!-- ... -->
 <l>Punkes, Panders, baſe extortionizing
   sla<choice>
   <sic>n</sic>
   <corr resp="#JENS1_transcriber">u</corr>
  </choice>es,</l>
<!-- ... -->
</lg>
<!-- in the <teiHeader> ... -->
<!-- ... -->
<respStmt xml:id="JENS1_transcriber">
 <resp when="2014">Transcriber</resp>
 <name>Janelle Jenstad</name>
</respStmt>

att.internetMedia

att.internetMedia provides attributes for specifying the type of a computer resource using a standard taxonomy.
Moduletei
Membersref
AttributesAttributes
mimeType(MIME media type) specifies the applicable multimedia internet mail extension (MIME) media type
Status Optional
Datatype 1–∞ occurrences of  data.wordseparated by whitespace
Example

In this example mimeType is used to indicate that the URL points to a TEI XML file encoded in UTF-8.

<ref mimeType="application/tei+xml; charset=UTF-8"
 target="http://sourceforge.net/p/tei/code/HEAD/tree/trunk/P5/Source/guidelines-en.xml"/>
Note

This attribute class provides an attribute for describing a computer resource, typically available over the internet, using a value taken from a standard taxonomy. At present only a single taxonomy is supported, the Multipurpose Internet Mail Extensions (MIME) Media Type system. This typology of media types is defined by the Internet Engineering Task Force in RFC 2046. The list of types is maintained by the Internet Assigned Numbers Authority (IANA). The mimeType attribute must have a value taken from this list.

att.naming

att.naming provides attributes common to elements which refer to named persons, places, organizations etc. [3.5.1. Referring Strings 13.3.5. Names and Nyms]
Moduletei
Membersatt.personal
AttributesAttributes att.canonical (@key, @ref)
rolemay be used to specify further information about the entity referenced by this name in the form of a set of whitespace-separated values, for example the occupation of a person, or the status of a place.
Status Optional
Datatype 1–∞ occurrences of  data.enumeratedseparated by whitespace
nymRef(reference to the canonical name) provides a means of locating the canonical form (nym) of the names associated with the object named by the element bearing it.
Status Optional
Datatype 1–∞ occurrences of  data.pointerseparated by whitespace
Note

The value must point directly to one or more XML elements by means of one or more URIs, separated by whitespace. If more than one is supplied, the implication is that the name is associated with several distinct canonical names.

att.placement

att.placement provides attributes for describing where on the source page or object a textual element appears. [3.4.3. Additions, Deletions, and Omissions 11.3.1.4. Additions and Deletions]
Moduletei
Memberslabel
AttributesAttributes
placespecifies where this item is placed Suggested values include: 1] below; 2] bottom; 3] margin; 4] top; 5] opposite; 6] overleaf; 7] above; 8] end; 9] inline; 10] inspace
Status Recommended
Datatype 1–∞ occurrences of  data.enumeratedseparated by whitespace
Suggested values include:
below
below the line
bottom
at the foot of the page
margin
in the margin (left, right, or both)
top
at the top of the page
opposite
on the opposite, i.e. facing, page
overleaf
on the other side of the leaf
above
above the line
end
at the end of e.g. chapter or volume.
inline
within the body of the text.
inspace
in a predefined space, for example left by an earlier scribe.
<add place="margin">[An addition written in the margin]</add>
<add place="bottom opposite">[An addition written at the
foot of the current page and also on the facing page]</add>
<note place="bottom">Ibid, p.7</note>

att.pointing

att.pointing defines a set of attributes used by all elements which point to other elements by means of one or more URI references. [1.3.1.1.2. Language Indicators 3.6. Simple Links and Cross-References]
Moduletei
Membersref
AttributesAttributes
targetLangspecifies the language of the content to be found at the destination referenced by target, using a ‘language tag’ generated according to BCP 47.
Status Optional
Datatype data.language
Schematron

<sch:rule context="tei:*[not(self::tei:schemaSpec)][@targetLang]">
<sch:assert test="count(@target)">@targetLang can only be used if @target is specified.</sch:assert></sch:rule>
<linkGrp xml:id="pol-swh_aln_2.1-linkGrp">
 <ptr target="pol/UDHR/text.xml#pol_txt_1-head"
  targetLang="pltype="tuvxml:id="pol-swh_aln_2.1.1-ptr"/>

 <ptr target="swh/UDHR/text.xml#swh_txt_1-head"
  targetLang="swtype="tuvxml:id="pol-swh_aln_2.1.2-ptr"/>

</linkGrp>

In the example above, the <linkGrp> combines pointers at parallel fragments of the Universal Declaration of Human Rights: one of them is in Polish, the other in Swahili.

Note

The value must conform to BCP 47. If the value is a private use code (i.e., starts with x- or contains -x-), a language element with a matching value for its ident attribute should be supplied in the TEI header to document this value. Such documentation may also optionally be supplied for non-private-use codes, though these must remain consistent with their (IETF)Internet Engineering Task Force definitions.

targetspecifies the destination of the reference by supplying one or more URI References
Status Optional
Datatype 1–∞ occurrences of  data.pointerseparated by whitespace
Note

One or more syntactically valid URI references, separated by whitespace. Because whitespace is used to separate URIs, no whitespace is permitted inside a single URI. If a whitespace character is required in a URI, it should be escaped with the normal mechanism, e.g. TEI%20Consortium.

evaluatespecifies the intended meaning when the target of a pointer is itself a pointer.
Status Optional
Datatype data.enumerated
Legal values are:
all
if the element pointed to is itself a pointer, then the target of that pointer will be taken, and so on, until an element is found which is not a pointer.
one
if the element pointed to is itself a pointer, then its target (whether a pointer or not) is taken as the target of this pointer.
none
no further evaluation of targets is carried out beyond that needed to find the element specified in the pointer's target.
Note

If no value is given, the application program is responsible for deciding (possibly on the basis of user input) how far to trace a chain of pointers.

att.ranging

att.ranging provides attributes for describing numerical ranges.
Moduletei
Membersatt.dimensions [att.editLike]
AttributesAttributes
atLeastgives a minimum estimated value for the approximate measurement.
Status Optional
Datatype data.numeric
atMostgives a maximum estimated value for the approximate measurement.
Status Optional
Datatype data.numeric
minwhere the measurement summarizes more than one observation or a range, supplies the minimum value observed.
Status Optional
Datatype data.numeric
maxwhere the measurement summarizes more than one observation or a range, supplies the maximum value observed.
Status Optional
Datatype data.numeric
confidencespecifies the degree of statistical confidence (between zero and one) that a value falls within the range specified by min and max, or the proportion of observed values that fall within that range.
Status Optional
Datatype data.probability
Example
The MS. was lost in transmission by mail from <del rend="overstrike">
 <gap atLeast="1atMost="2"
  extent="one or two lettersreason="illegibleunit="chars"/>

</del> Philadelphia to the Graphic office, New York.

att.sortable

att.sortable provides attributes for elements in lists or groups that are sortable, but whose sorting key cannot be derived mechanically from the element content. [9.1. Dictionary Body and Overall Structure]
Moduletei
Membersidno
AttributesAttributes
sortKeysupplies the sort key for this element in an index, list or group which contains it.
Status Optional
Datatype data.word
David's other principal backer, Josiah
ha-Kohen <index indexName="NAMES">
 <term sortKey="Azarya_Josiah_Kohen">Josiah ha-Kohen b. Azarya</term>
</index> b. Azarya, son of one of the last gaons of Sura was David's own first
cousin.
Note

The sort key is used to determine the sequence and grouping of entries in an index. It provides a sequence of characters which, when sorted with the other values, will produced the desired order; specifics of sort key construction are application-dependent

Dictionary order often differs from the collation sequence of machine-readable character sets; in English-language dictionaries, an entry for 4-H will often appear alphabetized under ‘fourh’, and McCoy may be alphabetized under‘maccoy’, while A1, A4, and A5 may all appear in numeric order ‘alphabetized’ between ‘a-’ and ‘AA’. The sort key is required if the orthography of the dictionary entry does not suffice to determine its location.

att.source

att.source provides attributes for pointing to the source of a bibliographic reference. [3.3.3. Quotation 8.3.4. Writing]
Moduletei
Membersatt.editLike
AttributesAttributes
sourceprovides a pointer to the bibliographical source from which a quotation or citation is drawn.
Status Optional
Datatype 1–∞ occurrences of  data.pointerseparated by whitespace
Example
<p>
<!-- ... -->
As Willard McCarty (<bibl xml:id="mcc_2012">2012, p.2</bibl>)
tells us, <quote source="#mcc_2012">‘Collaboration’ is a
   problematic and should be a contested term.</quote>
<!-- ... -->
</p>
Example
<p>
<!-- ... -->
 <quote source="#chicago_15_ed">Grammatical theories
   are in flux, and the more we learn, the less we
   seem to know.</quote>
<!-- ... -->
</p>
<!-- ... -->
<bibl xml:id="chicago_15_ed">
 <title level="m">The Chicago Manual of Style</title>,
<edition>15th edition</edition>.
<pubPlace>Chicago</pubPlace>:
<publisher>University of Chicago Press</publisher>
(<date>2003</date>),
<biblScope unit="page">p.147</biblScope>.

</bibl>

att.typed

att.typed provides attributes which can be used to classify or subclassify elements in any way. [1.3.1. Attribute Classes 17.1.1. Words and Above 3.5.1. Referring Strings 3.6. Simple Links and Cross-References 3.5.5. Abbreviations and Their Expansions 3.12.1. Core Tags for Verse 7.2.5. Speech Contents 4.1.1. Un-numbered Divisions 4.1.2. Numbered Divisions 4.2.1. Headings and Trailers 4.4. Virtual Divisions 13.3.2.3. Personal Relationships 11.3.1.1. Core Elements for Transcriptional Work 16.1.1. Pointers and Links 16.3. Blocks, Segments, and Anchors 12.2. Linking the Apparatus to the Text 22.4.4.2. RELAX NG Content Models 8.3. Elements Unique to Spoken Texts 23.3.1.4. Modification of Attribute and Attribute Value Lists]
Moduletei
Membersapplication change label ref text
AttributesAttributes
typecharacterizes the element in some sense, using any convenient classification scheme or typology.
Status Optional
Datatype data.enumerated
<div type="verse">
 <head>Night in Tarras</head>
 <lg type="stanza">
  <l>At evening tramping on the hot white road</l>
  <l></l>
 </lg>
 <lg type="stanza">
  <l>A wind sprang up from nowhere as the sky</l>
  <l></l>
 </lg>
</div>
Note

The type attribute is present on a number of elements, not all of which are members of att.typed, usually because these elements restrict the possible values for the attribute in a specific way.

subtypeprovides a sub-categorization of the element, if needed
Status Optional
Datatype data.enumerated
Note

The subtype attribute may be used to provide any sub-classification for the element additional to that provided by its type attribute.

Schematron

<sch:rule context="*[@subtype]">
<sch:assert test="@type">The <sch:name/> element should not be categorized in detail with @subtype
unless also categorized in general with @type</sch:assert></sch:rule>
Note

When appropriate, values from an established typology should be used. Alternatively a typology may be defined in the associated TEI header. If values are to be taken from a project-specific list, this should be defined using the <valList> element in the project-specific schema description, as described in 23.3.1.4. Modification of Attribute and Attribute Value Lists .

Schema teiODD_LAUDATIOCorpus_S7: Macros

data.certainty

data.certainty defines the range of attribute values expressing a degree of certainty.
Moduletei
Used by
Declaration
data.certainty = "high" | "medium" | "low" | "unknown"
Note

Certainty may be expressed by one of the predefined symbolic values high, medium, or low. The value unknown should be used in cases where the encoder does not wish to assert an opinion about the matter. For more precise indication, data.probability may be used instead or in addition.

data.count

data.count defines the range of attribute values used for a non-negative integer value used as a count.
Moduletei
Used by
Element:
Declaration
data.count = xsd:nonNegativeInteger
Note

Only positive integer values (including zero) are permitted

data.duration.iso

data.duration.iso defines the range of attribute values available for representation of a duration in time using ISO 8601 standard formats
Moduletei
Used by
Declaration
data.duration.iso = token { pattern = "[0-9.,DHMPRSTWYZ/:+\-]+" }
Example
<time dur-iso="PT0,75H">three-quarters of an hour</time>
Example
<date dur-iso="P1,5D">a day and a half</date>
Example
<date dur-iso="P14D">a fortnight</date>
Example
<time dur-iso="PT0.02S">20 ms</time>
Note

A duration is expressed as a sequence of number-letter pairs, preceded by the letter P; the letter gives the unit and may be Y (year), M (month), D (day), H (hour), M (minute), or S (second), in that order. The numbers are all unsigned integers, except for the last, which may have a decimal component (using either . or , as the decimal point; the latter is preferred). If any number is 0, then that number-letter pair may be omitted. If any of the H (hour), M (minute), or S (second) number-letter pairs are present, then the separator T must precede the first ‘time’ number-letter pair.

For complete details, see ISO 8601 Data elements and interchange formats — Information interchange — Representation of dates and times.

data.duration.w3c

data.duration.w3c defines the range of attribute values available for representation of a duration in time using W3C datatypes.
Moduletei
Used by
Declaration
data.duration.w3c = xsd:duration
Example
<time dur="PT45M">forty-five minutes</time>
Example
<date dur="P1DT12H">a day and a half</date>
Example
<date dur="P7D">a week</date>
Example
<time dur="PT0.02S">20 ms</time>
Note

A duration is expressed as a sequence of number-letter pairs, preceded by the letter P; the letter gives the unit and may be Y (year), M (month), D (day), H (hour), M (minute), or S (second), in that order. The numbers are all unsigned integers, except for the S number, which may have a decimal component (using . as the decimal point). If any number is 0, then that number-letter pair may be omitted. If any of the H (hour), M (minute), or S (second) number-letter pairs are present, then the separator T must precede the first ‘time’ number-letter pair.

For complete details, see the W3C specification.

data.enumerated

data.enumerated defines the range of attribute values expressed as a single XML name taken from a list of documented possibilities.
Moduletei
Used by
Element:
Declaration
data.enumerated = data.word
Note

Attributes using this datatype must contain a single word matching the pattern defined for this datatype: for example it cannot include whitespace but may begin with digits.

Typically, the list of documented possibilities will be provided (or exemplified) by a value list in the associated attribute specification, expressed with a <valList> element.

data.language

data.language defines the range of attribute values used to identify a particular combination of human language and writing system. [6.1. Language Identification]
Moduletei
Used by
Element:
Declaration
data.language = xsd:language | ""
Note

The values for this attribute are language ‘tags’ as defined in BCP 47. Currently BCP 47 comprises RFC 4646 and RFC 4647; over time, other IETF documents may succeed these as the best current practice.

A ‘language tag’, per BCP 47, is assembled from a sequence of components or subtags separated by the hyphen character (-, U+002D). The tag is made of the following subtags, in the following order. Every subtag except the first is optional. If present, each occurs only once, except the fourth and fifth components (variant and extension), which are repeatable.

language
The IANA-registered code for the language. This is almost always the same as the ISO 639 2-letter language code if there is one. The list of available registered language subtags can be found at http://www.iana.org/assignments/language-subtag-registry. It is recommended that this code be written in lower case.
script
The ISO 15924 code for the script. These codes consist of 4 letters, and it is recommended they be written with an initial capital, the other three letters in lower case. The canonical list of codes is maintained by the Unicode Consortium, and is available at http://unicode.org/iso15924/iso15924-codes.html. The IETF recommends this code be omitted unless it is necessary to make a distinction you need.
region
Either an ISO 3166 country code or a UN M.49 region code that is registered with IANA (not all such codes are registered, e.g. UN codes for economic groupings or codes for countries for which there is already an ISO 3166 2-letter code are not registered). The former consist of 2 letters, and it is recommended they be written in upper case. The list of codes can be found at http://www.iso.org/iso/home/standards/country_codes/iso-3166-1_decoding_table.htm. The latter consist of 3 digits; the list of codes can be found at http://unstats.un.org/unsd/methods/m49/m49.htm.
variant
An IANA-registered variation. These codes are used to indicate additional, well-recognized variations that define a language or its dialects that are not covered by other available subtags.
extension
An extension has the format of a single letter followed by a hyphen followed by additional subtags. These exist to allow for future extension to BCP 47, but as of this writing no such extensions are in use.
private use
An extension that uses the initial subtag of the single letter x (i.e., starts with x-) has no meaning except as negotiated among the parties involved. These should be used with great care, since they interfere with the interoperability that use of RFC 4646 is intended to promote. In order for a document that makes use of these subtags to be TEI-conformant, a corresponding language element must be present in the TEI header.

There are two exceptions to the above format. First, there are language tags in the IANA registry that do not match the above syntax, but are present because they have been ‘grandfathered’ from previous specifications.

Second, an entire language tag can consist of only a private use subtag. These tags start with x-, and do not need to follow any further rules established by the IETF and endorsed by these Guidelines. Like all language tags that make use of private use subtags, the language in question must be documented in a corresponding language element in the TEI header.

Examples include

sn
Shona
zh-TW
Taiwanese
zh-Hant-HK
Chinese written in traditional script as used in Hong Kong
en-SL
English as spoken in Sierra Leone
pl
Polish
es-MX
Spanish as spoken in Mexico
es-419
Spanish as spoken in Latin America

The W3C Internationalization Activity has published a useful introduction to BCP 47, Language tags in HTML and XML.

data.name

data.name defines the range of attribute values expressed as an XML Name.
Moduletei
Used by
Element:
Declaration
data.name = xsd:Name
Note

Attributes using this datatype must contain a single word which follows the rules defining a legal XML name (see http://www.w3.org/TR/REC-xml/#dt-name): for example they cannot include whitespace or begin with digits.

data.numeric

data.numeric defines the range of attribute values used for numeric values.
Moduletei
Used by
Declaration
data.numeric =
   xsd:double | token { pattern = "(\-?[\d]+/\-?[\d]+)" } | xsd:decimal
Note

Any numeric value, represented as a decimal number, in floating point format, or as a ratio.

To represent a floating point number, expressed in scientific notation, ‘E notation’, a variant of ‘exponential notation’, may be used. In this format, the value is expressed as two numbers separated by the letter E. The first number, the significand (sometimes called the mantissa) is given in decimal format, while the second is an integer. The value is obtained by multiplying the mantissa by 10 the number of times indicated by the integer. Thus the value represented in decimal notation as 1000.0 might be represented in scientific notation as 10E3.

A value expressed as a ratio is represented by two integer values separated by a solidus (/) character. Thus, the value represented in decimal notation as 0.5 might be represented as a ratio by the string 1/2.

data.outputMeasurement

data.outputMeasurement defines a range of values for use in specifying the size of an object that is intended for display.
Moduletei
Used by
Declaration
data.outputMeasurement =
   token
   {
      pattern = "[\-+]?\d+(\.\d+)?(%|cm|mm|in|pt|pc|px|em|ex|gd|rem|vw|vh|vm)"
   }
Example
<figure>
 <head>The TEI Logo</head>
 <figDesc>Stylized yellow angle brackets with the letters <mentioned>TEI</mentioned> in
   between and <mentioned>text encoding initiative</mentioned> underneath, all on a white
   background.</figDesc>
 <graphic height="600px"
  url="http://www.tei-c.org/logos/TEI-600.jpgwidth="600px"/>

</figure>
Note

These values map directly onto the values used by XSL-FO and CSS. For definitions of the units see those specifications; at the time of this writing the most complete list is in the CSS3 working draft.

data.pattern

data.pattern (regular expression pattern) defines attribute values which are expressed as a regular expression.
Moduletei
Used by
Declaration
data.pattern = token
Note
A regular expression, often called a pattern, is an expression that describes a set of strings. They are usually used to give a concise description of a set, without having to list all elements. For example, the set containing the three strings Handel, Händel, and Haendel can be described by the pattern H(ä|ae?)ndel (or alternatively, it is said that the pattern H(ä|ae?)ndel matches each of the three strings)
Wikipedia

data.pointer

data.pointer defines the range of attribute values used to provide a single URI, absolute or relative, pointing to some other resource, either within the current document or elsewhere.
Moduletei
Used by
Element:
Declaration
data.pointer = xsd:anyURI
Note

The range of syntactically valid values is defined by RFC 3986 Uniform Resource Identifier (URI): Generic Syntax. Note that the values themselves are encoded using RFC 3987 Internationalized Resource Identifiers (IRIs) mapping to URIs. For example, https://secure.wikimedia.org/wikipedia/en/wiki/% is encoded as https://secure.wikimedia.org/wikipedia/en/wiki/%25 while http://موقع.وزارة-الاتصالات.مصر/ is encoded as http://xn--4gbrim.xn----rmckbbajlc6dj7bxne2c.xn--wgbh1c/

data.probability

data.probability defines the range of attribute values expressing a probability.
Moduletei
Used by
Declaration
data.probability = xsd:double { minInclusive = "0" maxInclusive = "1" }
Note

Probability is expressed as a real number between 0 and 1; 0 representing certainly false and 1 representing certainly true.

data.replacement

data.replacement defines attribute values which contain a replacement template.
Moduletei
Used by
Declaration
data.replacement = text

data.temporal.iso

data.temporal.iso defines the range of attribute values expressing a temporal expression such as a date, a time, or a combination of them, that conform to the international standard Data elements and interchange formats – Information interchange – Representation of dates and times.
Moduletei
Used by
Element:
Declaration
data.temporal.iso =
   xsd:date
 | xsd:gYear
 | xsd:gMonth
 | xsd:gDay
 | xsd:gYearMonth
 | xsd:gMonthDay
 | xsd:time
 | xsd:dateTime
 | token { pattern = "[0-9.,DHMPRSTWYZ/:+\-]+" }
Note

If it is likely that the value used is to be compared with another, then a time zone indicator should always be included, and only the dateTime representation should be used.

For all representations for which ISO 8601 describes both a basic and an extended format, these Guidelines recommend use of the extended format.

While ISO 8601 permits the use of both 00:00 and 24:00 to represent midnight, these Guidelines strongly recommend against the use of 24:00.

data.temporal.w3c

data.temporal.w3c defines the range of attribute values expressing a temporal expression such as a date, a time, or a combination of them, that conform to the W3C XML Schema Part 2: Datatypes Second Edition specification.
Moduletei
Used by
Declaration
data.temporal.w3c =
   xsd:date
 | xsd:gYear
 | xsd:gMonth
 | xsd:gDay
 | xsd:gYearMonth
 | xsd:gMonthDay
 | xsd:time
 | xsd:dateTime
Note

If it is likely that the value used is to be compared with another, then a time zone indicator should always be included, and only the dateTime representation should be used.

data.text

data.text defines the range of attribute values used to express some kind of identifying string as a single sequence of unicode characters possibly including whitespace.
Moduletei
Used by
Element:
Declaration
data.text = string
Note

Attributes using this datatype must contain a single ‘token’ in which whitespace and other punctuation characters are permitted.

data.truthValue

data.truthValue defines the range of attribute values used to express a truth value.
Moduletei
Used by
Element:
Declaration
data.truthValue = xsd:boolean
Note

The possible values of this datatype are 1 or true, or 0 or false.

This datatype applies only for cases where uncertainty is inappropriate; if the attribute concerned may have a value other than true or false, e.g. unknown, or inapplicable, it should have the extended version of this datatype: data.xTruthValue.

data.version

data.version defines the range of attribute values which may be used to specify a TEI or Unicode version number.
Moduletei
Used by
Element:
Declaration
data.version = token { pattern = "[\d]+(\.[\d]+){0,2}" }
Note

The value of this attribute follows the pattern specified by the Unicode consortium for its version number (http://unicode.org/versions/). A version number contains digits and fullstop characters only. The first number supplied identifies the major version number. A second and third number, for minor and sub-minor version numbers, may also be supplied.

data.versionNumber

data.versionNumber defines the range of attribute values used for version numbers.
Moduletei
Used by
Element:
Declaration
data.versionNumber =
   token { pattern = "[\d]+[a-z]*[\d]*(\.[\d]+[a-z]*[\d]*){0,3}" }

data.word

data.word defines the range of attribute values expressed as a single word or token.
Moduletei
Used by
Declaration
data.word = token { pattern = "(\p{L}|\p{N}|\p{P}|\p{S})+" }
Note

Attributes using this datatype must contain a single ‘word’ which contains only letters, digits, punctuation characters, or symbols: thus it cannot include whitespace.

data.xTruthValue

data.xTruthValue (extended truth value) defines the range of attribute values used to express a truth value which may be unknown.
Moduletei
Used by
Declaration
data.xTruthValue = xsd:boolean | "unknown" | "inapplicable"
Note

In cases where where uncertainty is inappropriate, use the datatype data.TruthValue.

macro.limitedContent

macro.limitedContent (paragraph content) defines the content of prose elements that are not used for transcription of extant materials. [1.3. The TEI Class System]
Moduletei
Used by
Declaration
macro.limitedContent = ( text | model.limitedPhrase | model.inter )*

macro.paraContent

macro.paraContent (paragraph content) defines the content of paragraphs and similar elements. [1.3. The TEI Class System]
Moduletei
Used by
Declaration
macro.paraContent =
   ( text | model.gLike | model.phrase | model.inter | model.global | lg | l )*

macro.phraseSeq

macro.phraseSeq (phrase sequence) defines a sequence of character data and phrase-level elements. [1.4.1. Standard Content Models]
Moduletei
Used by
Declaration
macro.phraseSeq = ( text | model.gLike | model.phrase | model.global )*

macro.phraseSeq.limited

macro.phraseSeq.limited (limited phrase sequence) defines a sequence of character data and those phrase-level elements that are not typically used for transcribing extant documents. [1.4.1. Standard Content Models]
Moduletei
Used by
Declaration
macro.phraseSeq.limited = ( text | model.limitedPhrase | model.global )*

macro.specialPara

macro.specialPara ('special' paragraph content) defines the content model of elements such as notes or list items, which either contain a series of component-level elements or else have the same structure as a paragraph, containing a series of phrase-level and inter-level elements. [1.3. The TEI Class System]
Moduletei
Used by
Declaration
macro.specialPara =
   (
      text
    | model.gLikemodel.phrasemodel.intermodel.divPartmodel.global
   )*

3 Contact

Carolin Odebrecht,
Korpuslinguistik und Morphologie. Institut für deutsche Sprache und Linguistik, Humboldt-universität zu Berlin.
carolin.odebrecht@hu-berlin.de

LAUDATIO: Long-term Access and Usage of Deeply Annotated Information. The management and archiving of digital research data is an overlapping field for linguistics, library and information science (LIS) and computer science. These disciplines are cooperating in the LAUDATIO project. The name LAUDATIO is an abbreviation for Long-term Access and Usage of Deeply Annotated Information. The project is funded by the German Research Foundation from 2011-2014. The departments of Corpus Linguistics as well as Historical Linguistics, and the Computer and Media Service (CMS) at Humboldt-Universität zu Berlin and The National Institute for Research in Computer Science and Control (INRIA France) are project partners cooperating with the Berlin School of Library and Information Science (BSLIS).

Funded by Deutsche Forschungsgemeinschaft (DFG).

4 Licence

The Creative Commons Attribution 3.0 Unported (CC BY 3.0) Licence applies to this document.

Carolin Odebrecht . Date: 2015-06-08