The LAUDATIO-Repository provides open access to historical text corpora. With the help of metadata such as
you will be able to chose your corpus of interest for your analysis or next preparation steps.
The TEI customization provides a metadata scheme for the LAUDATIO-Repository which enables a structured and uniform display and facetted and free-metadata search of such corpus metadata. The customization is based on a metadata model.
For each corpus in the repository the metadata are stored in TEI XML. There are three ODDs which specify the metadata for a corpus, the documents of the corpus and the annotation in the documents. This customization refers to the class 'corpus', see figure 1 below.
The TEI ODD is developed in the context of the LAUDATIO-Project in cooperation with the working groups of the Search and Visualization System ANNIS and the converter framework SaltNPepper.
The metadata model refers to all kinds of historical text corpora.
The metadata model defines a corpus as the sum of documents which are the sum of annotations. For each concept, there is a list of attributes which represent the metadata relevant for data retrieval in the repository. For example, the attribut 'title' is assigned to each concept but there is a difference between a title for the corpus, a title for a historical document within the corpus or the title of an annotation layer. with the help of the model, it will be clear which title is meant in which situation.
For detailed information, motivation and discussion see
The TEI customization creates TEI scheme for the LAUDATIO metadata concepts 'Corpus', 'Document' and 'PreparationStep' by mapping the metadata model and TEI p5 header structure. The metadata model is realized with TEI p5 by using a subset of the TEI p5 and a TEI header for each conclascept:
Each TEI XML file contains fileDesc, encodingDesc and revisionDesc. The concepts 'Corpus' and 'Document' contain profileDesc because they refer to the language used in the data.
The header have relations to each other in the following way: The 'Corpus Header' provides lists of documents corresponding to the 'Document Header' and list of annotations corresponding to the 'PreparationStep Header'. Each 'Document Header' contains a list of annotation for each document in the corpus corresponding to the 'PreparationStep Header'.
The ODD combines the four basic modules (<textstructure>, <core>, <header> and <namesdates>), but removes most of the available elements and changes several attribute conceptes, e.g. add a TEI attribute to an element. Thus, the customization is very restricted and allows little variation in selecting elements and attributes.
The <core> module is very restricted and only contains the elements corresponding to bibliographic metadata such as author, editor, date and title and more general structural element such as list, item, label, p and ref.
n | (number) gives a number (or other label) for an element, which is not necessarily unique within the document. Sample values include: |
role | may be used to specify further information about the entity referenced by this name in the form of a set of whitespace-separated values, for example the occupation of a person, or the status of a place. |
n | (number) gives a number (or other label) for an element, which is not necessarily unique within the document. Sample values include: |
role | may be used to specify further information about the entity referenced by this name in the form of a set of whitespace-separated values, for example the occupation of a person, or the status of a place. |
when | supplies the value of the date or time in a standard form, e.g. yyyy-mm-dd. |
type | List of documents in the corpus with corresponding references. |
n | (number) gives a number (or other label) for an element, which is not necessarily unique within the document. |
target [att.pointing] | specifies the destination of the reference by supplying one or more URI References |
type | (You must specify the exact reference to LAUDATIO metadata module schema 'Corpus'.) classifies the title according to some convenient typology. |
No class or general module modification were made. The TEI scheme is only locally restricted. That means that no inheritance relations or module relations were changed, deleted or added. The customization is documented in the section 'Formal declaration'.
All encoded metadata refer to information addressing the whole corpus, e.g. the editors and annotators of the corpus, the preliminary annotations guidelines used for all formats of the corpus, the revision history and the available formats and its license.
LAUDATIO metadata scheme for the object 'Corpus' contains much more elements and modules comparing to standard TEI header which are usually used for the purpose of text encoding only. For validity, fileDesc, the profileDesc, encodingDesc and revisionDesc are required.
The fileDesc contains the following elements where the basic meaning are left untouched but are referred to the object 'Corpus', meaning the element author refers to the annotator of the corpus, the element title refers to the title of the entire corpus. Elements used within teiHeader are:
type | (You must indicate what type of extent you refere of) characterizes the element in some sense, using any convenient classification scheme or typology. |
n [att.global] | (number) gives a number (or other label) for an element, which is not necessarily unique within the document. |
type | (You must indicate what type of extent you refere of) characterizes the element in some sense, using any convenient classification scheme or typology. |
status | supplies a code identifying the current availability of the text. |
when | supplies the value of the date or time in a standard form, e.g. yyyy-mm-dd. |
type | List of documents in the corpus with corresponding references. |
corresp | |
n | (number) gives a number (or other label) for an element, which is not necessarily unique within the document. |
style | contains an expression in some formal style definition language which defines the rendering or presentation used for this element in the source text |
ident | (identifier) Supplies a language code constructed as defined in BCP 47 which is used to identify the language documented by this element, and which is referenced by the global xml:lang attribute. |
ident | supplies an identifier for the application, independent of its version number or display name. |
version | supplies a version number for the application, independent of its identifier or display name. |
target [att.pointing] | specifies the destination of the reference by supplying one or more URI References |
name | specifies the full formal name of the namespace concerned. |
rend | (rendition) indicates how the element in question was rendered or presented in the source text. |
gi | (generic identifier) specifies the name (generic identifier) of the element indicated by the tag, within the namespace indicated by the parent namespace element. |
n [att.global] | (number) gives a number (or other label) for an element, which is not necessarily unique within the document. |
when [att.datable.w3c] | supplies the value of the date or time in a standard form, e.g. yyyy-mm-dd. |
who [att.ascribed] | indicates the person, or group of people, to whom the element content is ascribed. |
<affiliation> contains an informal description of a person's present or past affiliation with some organization, for example an employer or sponsor. [15.2.2. The Participant Description] | |
Module | namesdates |
Member of | |
Contained by | |
May contain | |
Declaration | element affiliation { macro.phraseSeq } |
Example | <affiliation>Junior project officer for the US <name type="org">National Endowment for the Humanities</name> </affiliation> <affiliation notAfter="1960-01-01" notBefore="1957-02-28">Paid up member of the <orgName>Australian Journalists Association</orgName> </affiliation> |
Note | If included, the name of an organization may be tagged using either the <name> element as above, or the more specific orgName element. |
<appInfo> (application information) records information about an application which has edited the TEI file. [2.3.10. The Application Information Element] | |
Module | header |
Attributes | Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) |
Member of | |
Contained by | header: encodingDesc |
May contain | header: application |
Declaration | element appInfo { att.global.attributes, model.applicationLike+ } |
Example | <appInfo> <application ident="Xaira" version="1.24"> <label>XAIRA Indexer</label> <ptr target="#P1"/> </application> </appInfo> |
<application> provides information about an application which has acted upon the document. [2.3.10. The Application Information Element] | |||||||||||||
Module | header | ||||||||||||
Attributes | Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.typed (@type, @subtype) att.datable (@calendar, @period) (att.datable.w3c (@when, @notBefore, @notAfter, @from, @to)) (att.datable.iso (@when-iso, @notBefore-iso, @notAfter-iso, @from-iso, @to-iso)) (att.datable.custom (@when-custom, @notBefore-custom, @notAfter-custom, @from-custom, @to-custom, @datingPoint, @datingMethod))
| ||||||||||||
Member of | |||||||||||||
Contained by | header: appInfo | ||||||||||||
May contain | |||||||||||||
Declaration | element application { att.global.attributes, att.typed.attributes, att.datable.attributes, attribute ident { data.name }, attribute version { data.versionNumber }, ( model.labelLike+, ( model.ptrLike* | model.pLike* ) ) } | ||||||||||||
Example | <appInfo> <application ident="ImageMarkupTool1" notAfter="2006-06-01" version="1.5"> <label>Image Markup Tool</label> <ptr target="#P1"/> <ptr target="#P2"/> </application> </appInfo> This example shows an appInfo element documenting the fact that version 1.5 of the Image Markup Tool1 application has an interest in two parts of a document which was last saved on June 6 2006. The parts concerned are accessible at the URLs given as target for the two <ptr> elements. |
<author> in a bibliographic reference, contains the name(s) of an author, personal or corporate, of a work; for example in the same form as that provided by a recognized bibliographic name authority. [3.11.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement] | |||||||||||||||||||||
Module | core | ||||||||||||||||||||
Attributes | Attributesatt.global.responsibility (@cert, @resp)
| ||||||||||||||||||||
Member of | |||||||||||||||||||||
Contained by | header: titleStmt | ||||||||||||||||||||
May contain | |||||||||||||||||||||
Declaration | element author { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, attribute n { data.text }?, attribute role { list { ( "Annotator" | "Infrastructure" | "Transcription" )+ } }?, macro.phraseSeq } | ||||||||||||||||||||
Example | <author>British Broadcasting Corporation</author> <author>La Fayette, Marie Madeleine Pioche de la Vergne, comtesse de (1634–1693)</author> <author>Anonymous</author> <author>Bill and Melinda Gates Foundation</author> <author> <persName>Beaumont, Francis</persName> and <persName>John Fletcher</persName> </author> <author> <orgName key="BBC">British Broadcasting Corporation</orgName>: Radio 3 Network </author> | ||||||||||||||||||||
Note | Particularly where cataloguing is likely to be based on the content of the header, it is advisable to use a generally recognized name authority file to supply the content for this element. The attributes key or ref may also be used to reference canonical information about the author(s) intended from any appropriate authority, such as a library catalogue or online resource. In the case of a broadcast, use this element for the name of the company or network responsible for making the broadcast. Where an author is unknown or unspecified, this element may contain text such as Unknown or Anonymous. When the appropriate TEI modules are in use, it may also contain detailed tagging of the names used for people, organizations or places, in particular where multiple names are given. |
<authority> (release authority) supplies the name of a person or other agency responsible for making a work available, other than a publisher or distributor. [2.2.4. Publication, Distribution, Licensing, etc.] | |
Module | header |
Attributes | Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) |
Member of | |
Contained by | header: publicationStmt |
May contain | |
Declaration | element authority { att.global.attributes, macro.phraseSeq.limited } |
Example | <authority>John Smith</authority> |
<availability> supplies information about the availability of a text, for example any restrictions on its use or distribution, its copyright status, any licence applying to it, etc. [2.2.4. Publication, Distribution, Licensing, etc.] | |||||||||
Module | header | ||||||||
Attributes | Attributesatt.global.responsibility (@cert, @resp)
| ||||||||
Member of | |||||||||
Contained by | header: publicationStmt | ||||||||
May contain | core: p | ||||||||
Declaration | element availability { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, attribute status { "free" | "unknown" | "restricted" }, ( model.availabilityPart | model.pLike )+ } | ||||||||
Example | <availability status="restricted"> <p>Available for academic research purposes only.</p> </availability> <availability status="free"> <p>In the public domain</p> </availability> <availability status="restricted"> <p>Available under licence from the publishers.</p> </availability> | ||||||||
Example | <availability> <licence target="http://opensource.org/licenses/MIT"> <p>The MIT License applies to this document.</p> <p>Copyright (C) 2011 by The University of Victoria</p> <p>Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:</p> <p>The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.</p> <p>THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.</p> </licence> </availability> | ||||||||
Note | A consistent format should be adopted |
<change> documents a change or set of changes made during the production of a source document, or during the revision of an electronic file. [2.5. The Revision Description 2.4.1. Creation 11.7. Changes] | |||||||
Module | header | ||||||
Attributes | Attributes att.ascribed (@who) att.datable (@calendar, @period) (att.datable.w3c (@when, @notBefore, @notAfter, @from, @to)) (att.datable.iso (@when-iso, @notBefore-iso, @notAfter-iso, @from-iso, @to-iso)) (att.datable.custom (@when-custom, @notBefore-custom, @notAfter-custom, @from-custom, @to-custom, @datingPoint, @datingMethod)) att.docStatus (@status) att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.typed (@type, @subtype)
| ||||||
Contained by | header: revisionDesc | ||||||
May contain | |||||||
Declaration | element change { att.ascribed.attributes, att.datable.attributes, att.docStatus.attributes, att.global.attributes, att.typed.attributes, attribute target { list { data.pointer+ } }?, macro.specialPara } | ||||||
Example | <titleStmt> <title> ... </title> <editor xml:id="LDB">Lou Burnard</editor> <respStmt xml:id="BZ"> <resp>copy editing</resp> <name>Brett Zamir</name> </respStmt> </titleStmt> <!-- ... --> <revisionDesc status="published"> <change status="public" when="2008-02-02" who="#BZ">Finished chapter 23</change> <change status="draft" when="2008-01-02" who="#BZ">Finished chapter 2</change> <change n="P2.2" when="1991-12-21" who="#LDB">Added examples to section 3</change> <change when="1991-11-11" who="#MSM">Deleted chapter 10</change> </revisionDesc> | ||||||
Example | <profileDesc> <creation> <listChange> <change xml:id="DRAFT1">First draft in pencil</change> <change notBefore="1880-12-09" xml:id="DRAFT2">First revision, mostly using green ink</change> <change notBefore="1881-02-13" xml:id="DRAFT3">Final corrections as supplied to printer.</change> </listChange> </creation> </profileDesc> | ||||||
Note | The who attribute may be used to point to any other element, but will typically specify a <respStmt> or <person> element elsewhere in the header, identifying the person responsible for the change and their role in making it. It is recommended that changes be recorded with the most recent first. The status attribute may be used to indicate the status of a document following the change documented. |
<date> contains a date in any format. [3.5.4. Dates and Times 2.2.4. Publication, Distribution, Licensing, etc. 2.5. The Revision Description 3.11.2.4. Imprint, Size of a Document, and Reprint Information 15.2.3. The Setting Description 13.3.6. Dates and Times] | |||||||||
Module | core | ||||||||
Attributes | Attributesatt.typed (subtype, @type)
| ||||||||
Member of | |||||||||
Contained by | |||||||||
May contain | |||||||||
Declaration | element date { att.typed.attribute.type, attribute when { data.temporal.iso }, ( text | model.gLike | model.phrase | model.global )* } | ||||||||
Example | <date when="1980-02">early February 1980</date> | ||||||||
Example | Given on the <date when="1977-06-12">Twelfth Day of June in the Year of Our Lord One Thousand Nine Hundred and Seventy-seven of the Republic the Two Hundredth and first and of the University the Eighty-Sixth.</date> | ||||||||
Example | <date when="1990-09">September 1990</date> |
<editor> contains a secondary statement of responsibility for a bibliographic item, for example the name of an individual, institution or organization, (or of several such) acting as editor, compiler, translator, etc. [3.11.2.2. Titles, Authors, and Editors] | |||||||||||||||||||||
Module | core | ||||||||||||||||||||
Attributes | Attributesatt.global.responsibility (@cert, @resp)
| ||||||||||||||||||||
Member of | |||||||||||||||||||||
Contained by | header: titleStmt | ||||||||||||||||||||
May contain | |||||||||||||||||||||
Declaration | element editor { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, attribute n { data.text }?, attribute role { list { "CorpusEditor"+ } }?, macro.phraseSeq } | ||||||||||||||||||||
Example | <editor>Eric Johnson</editor> <editor role="illustrator">John Tenniel</editor> | ||||||||||||||||||||
Note | A consistent format should be adopted. Particularly where cataloguing is likely to be based on the content of the header, it is advisable to use generally recognized authority lists for the exact form of personal names. |
<editorialDecl> (editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text. [2.3.3. The Editorial Practices Declaration 2.3. The Encoding Description 15.3.2. Declarable Elements] | |
Module | header |
Attributes | Attributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.global.responsibility (@cert, @resp) |
Member of | |
Contained by | header: encodingDesc |
May contain | core: p header: normalization segmentation |
Declaration | element editorialDecl { att.global.attribute.n, att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, ( model.pLike | model.editorialDeclPart )+ } |
Example | <editorialDecl> <normalization> <p>All words converted to Modern American spelling using Websters 9th Collegiate dictionary </p> </normalization> <quotation marks="all"> <p>All opening quotation marks converted to “ all closing quotation marks converted to &cdq;.</p> </quotation> </editorialDecl> |
<encodingDesc> (encoding description) encoding descriptiondocuments the relationship between an electronic text and the source or sources from which it was derived. For LAUDATIO: The element encodingDescrefers to the different formats of the corpus. for each format a single encodingDesc contains information about what kind of annotation including keys and values and what kind of formats were used. [2.3. The Encoding Description 2.1.1. The TEI Header and Its Components] | |
Module | header |
Attributes | Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) |
Member of | |
Contained by | header: teiHeader |
May contain | core: p header: appInfo editorialDecl projectDesc tagsDecl |
Declaration | element encodingDesc { att.global.attributes, ( ( model.encodingDescPart | model.pLike )+ ) } |
Example | A corpus is available in EXMARaLDA format. The annotations were applied with the help of the EXMARaLDA Partitur Editor in a certain version, see appInfo. Information about the minimal segmentation in the format and the normalizations of the primary data is given in segmentation and normalization. The tagsDecl lists the annotation keys and values which are available in EXMARalDA. Each namesspace has @rend with a fixed a of values which classify the annotation according to their basic functions such as 'Trancription', 'Syntactical' and 'Morphological'. This enables searching certain annotation in more than one corpus in the LAUDATIO-Repository. <encodingDesc n="1"> <appInfo> <application ident="EXMARaLDA" version="1.5.1"> <label>EXMARaLDA Partitur Editor</label> </application> </appInfo> <projectDesc> <p> <ref target="http://korpling.german.hu-berlin.de/ridges/"/>The RIDGES project (Register in Diachronic German Science) is an investigation into the development of the German scientific language in the early modern and modern periods, ranging from the mid 16th to the late 19th century. </p> </projectDesc> <editorialDecl> <segmentation> <p>Multiple segmentations on the basis of "dipl", "clean" and "norm".</p> </segmentation> <normalization> <p>Two normalization are provided by the layer "clean" containing normalizations of special characters and by the layer "norm" containing normalizations of orthography.</p> </normalization> </editorialDecl> <tagsDecl> <namespace name="dipl" rend="Transcription" xml:id="dipl"> <tagUsage gi="NA">The diplomatic transcription of the word form as found on the manuscript. Line-breaks are marked as in the text, usually as 'U+2E17'.</tagUsage> </namespace> <namespace name="pos" rend="Lexical" xml:id="pos"> <tagUsage gi="ADJA">Attributives Adjektiv.</tagUsage> <tagUsage gi="ADJD">Adverbiales oder prädikatives Adjektiv.</tagUsage> <tagUsage gi="ADV">Adverb.</tagUsage> </namespace> </tagsDecl> </encodingDesc> |
<extent> describes the approximate size of a text stored on some carrier medium or of some other object, digital or non-digital, specified in any convenient units. [2.2.3. Type and Extent of File 2.2. The File Description 3.11.2.4. Imprint, Size of a Document, and Reprint Information 10.7.1. Object Description] | |||||||||||
Module | header | ||||||||||
Attributes | Attributesatt.global.responsibility (@cert, @resp) att.typed (type, @subtype)
| ||||||||||
Contained by | header: fileDesc | ||||||||||
May contain | |||||||||||
Declaration | element extent { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, att.typed.attribute.subtype, attribute type { "Tokens" | "Words" }, macro.phraseSeq } | ||||||||||
Example | <extent>3200 sentences</extent> <extent>between 10 and 20 Mb</extent> <extent>ten 3.5 inch high density diskettes</extent> | ||||||||||
Example | The <measure> element may be used to supplied normalised or machine tractable versions of the size or sizes concerned. <extent> <measure quantity="4.2" unit="MiB">About four megabytes</measure> <measure quantity="245" unit="pages">245 pages of source material</measure> </extent> |
<fileDesc> (file description) [2.2. The File Description 2.1.1. The TEI Header and Its Components] | |
Module | header |
Attributes | Attributesatt.global.responsibility (@cert, @resp) |
Contained by | header: teiHeader |
May contain | header: extent publicationStmt sourceDesc titleStmt |
Declaration | element fileDesc { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, ( ( titleStmt, editionStmt?, extent?, publicationStmt, seriesStmt?, notesStmt? ), sourceDesc+ ) } |
Example | The <fileDsec> refers to the whole corpus. Each corpus has one or more editors and annotators and may have some technical supporters, see editor and author. All project and institution references can be made in affiliation for person or in publicationStmt in general. All documents of a corpus are listed in sourceDesc with a technical short name. <fileDesc> <titleStmt> <title type="Corpus">RIDGES Herbology Version 2.0</title> <editor n="1" role="CorpusEditor"> <persName> <forename>Anke</forename> <surname>Lüdeling</surname> </persName> <affiliation> <orgName type="Department">Institut für deutsche Sprache und Linguistik</orgName> <orgName type="Institution">Humboldt-Universität zu Berlin</orgName> </affiliation> </editor> <author n="1" role="Infrastructure"> <persName> <forename>Carolin</forename> <surname>Odebrecht</surname> </persName> <affiliation> <orgName type="Department">Institut für deutsche Sprache und Linguistik</orgName> <orgName type="Institution">Humboldt-Universität zu Berlin</orgName> </affiliation> </author> </titleStmt> <extent type="Tokens">60720</extent> <publicationStmt> <authority>Humboldt-Universität zu Berlin</authority> <idno>RIDGES Herbology Project.</idno> <availability status="free"> <p>Open Source Project. Open Source Project. All corpus data generated by the RIDGES project is licensed under a Creative Commons Attribution 3.0 Unported License.</p> </availability> <date type="CorpusRelease" when="2011">First corpus release.</date> <date type="CorpusRelease" when="2012-05-31">Second corpus release.</date> </publicationStmt> <sourceDesc> <list type="CorpusDocument"> <item corresp="deutsche.pflanzennamen.1870" n="1"/> <item corresp="noch.einige.worte.1840" n="2"/> <item corresp="flora.7.1821" n="3"/> </list> </sourceDesc> </fileDesc> |
Note | The major source of information for those seeking to create a catalogue entry or bibliographic citation for an electronic file. As such, it provides a title and statements of responsibility together with details of the publication or distribution of the file, of any series to which it belongs, and detailed bibliographic notes for matters not addressed elsewhere in the header. It also contains a full bibliographic description for the source or sources from which the electronic text was derived. |
<forename> contains a forename, given or baptismal name. [13.2.1. Personal Names] | |
Module | namesdates |
Attributes | Attributesatt.global.responsibility (@cert, @resp) |
Member of | |
Contained by | |
May contain | |
Declaration | element forename { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, macro.phraseSeq } |
Example | <persName> <roleName>Ex-President</roleName> <forename>George</forename> <surname>Bush</surname> </persName> |
<idno> (identifier) supplies any form of identifier used to identify some object, such as a bibliographic item, a person, a title, an organization, etc. in a standardized way. [2.2.4. Publication, Distribution, Licensing, etc. 2.2.5. The Series Statement 3.11.2.4. Imprint, Size of a Document, and Reprint Information] | |||||||
Module | header | ||||||
Attributes | Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.sortable (@sortKey)
| ||||||
Member of | |||||||
Contained by | |||||||
May contain | header: idno | ||||||
Declaration | element idno { att.global.attributes, att.sortable.attributes, attribute type { data.enumerated }?, ( text | model.gLike | idno )* } | ||||||
Example | <idno type="ISBN">978-1-906964-22-1</idno> <idno type="ISSN">0143-3385</idno> <idno type="DOI">10.1000/123</idno> <idno type="URI">http://www.worldcat.org/oclc/185922478</idno> <idno type="URI">http://authority.nzetc.org/463/</idno> <idno type="LT">Thomason Tract E.537(17)</idno> <idno type="Wing">C695</idno> <idno type="oldCat"> <g ref="#sym"/>345 </idno> In the last case, the identifier includes a non-Unicode character which is defined elsewhere by means of a <glyph> or <char> element referenced here as | ||||||
Note | idno should be used for labels which identify an object or concept in a formal cataloguing system such as a database or an RDF store, or in a distributed system such as the World Wide Web. Some suggested values for type on idno are |
<item> contains one component of a list. [3.7. Lists 2.5. The Revision Description] | |||||||||||||
Module | core | ||||||||||||
Attributes | Attributesatt.global.responsibility (@cert, @resp)
| ||||||||||||
Contained by | core: list | ||||||||||||
May contain | |||||||||||||
Declaration | element item { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, attribute n { data.text }, attribute corresp { text }, macro.specialPara } | ||||||||||||
Example | <list rend="numbered"> <head>Here begin the chapter headings of Book IV</head> <item n="4.1">The death of Queen Clotild.</item> <item n="4.2">How King Lothar wanted to appropriate one third of the Church revenues.</item> <item n="4.3">The wives and children of Lothar.</item> <item n="4.4">The Counts of the Bretons.</item> <item n="4.5">Saint Gall the Bishop.</item> <item n="4.6">The priest Cato.</item> <item> ...</item> </list> | ||||||||||||
Note | May contain simple prose or a sequence of chunks. Whatever string of characters is used to label a list item in the copy text may be used as the value of the global n attribute, but it is not required that numbering be recorded explicitly. In ordered lists, the n attribute on the item element is by definition synonymous with the use of the label element to record the enumerator of the list item. In glossary lists, however, the term being defined should be given with the label element, not n. |
<label> contains any label or heading used to identify part of a text, typically but not exclusively in a list or glossary. [3.7. Lists] | |
Module | core |
Attributes | Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.typed (@type, @subtype) att.placement (@place) |
Member of | |
Contained by | |
May contain | |
Declaration | element label { att.global.attributes, att.typed.attributes, att.placement.attributes, macro.phraseSeq } |
Example | Labels are commonly used for the headwords in glossary lists; note the use of the global xml:lang attribute to set the default language of the glossary list to Middle English, and identify the glosses and headings as modern English or Latin: <list type="gloss" xml:lang="enm"> <head xml:lang="en">Vocabulary</head> <headLabel xml:lang="en">Middle English</headLabel> <headItem xml:lang="en">New English</headItem> <label>nu</label> <item xml:lang="en">now</item> <label>lhude</label> <item xml:lang="en">loudly</item> <label>bloweth</label> <item xml:lang="en">blooms</item> <label>med</label> <item xml:lang="en">meadow</item> <label>wude</label> <item xml:lang="en">wood</item> <label>awe</label> <item xml:lang="en">ewe</item> <label>lhouth</label> <item xml:lang="en">lows</item> <label>sterteth</label> <item xml:lang="en">bounds, frisks (cf. <cit> <ref>Chaucer, K.T.644</ref> <quote>a courser, <term>sterting</term>as the fyr</quote> </cit> </item> <label>verteth</label> <item xml:lang="la">pedit</item> <label>murie</label> <item xml:lang="en">merrily</item> <label>swik</label> <item xml:lang="en">cease</item> <label>naver</label> <item xml:lang="en">never</item> </list> |
Example | Labels may also be used to record explicitly the numbers or letters which mark list items in ordered lists, as in this extract from Gibbon's Autobiography. In this usage the label element is synonymous with the n attribute on the item element: I will add two facts, which have seldom occurred in the composition of six, or at least of five quartos. <list rend="runon" type="ordered"> <label>(1)</label> <item>My first rough manuscript, without any intermediate copy, has been sent to the press.</item> <label>(2) </label> <item>Not a sheet has been seen by any human eyes, excepting those of the author and the printer: the faults and the merits are exclusively my own.</item> </list> |
Example | Labels may also be used for other structured list items, as in this extract from the journal of Edward Gibbon: <list type="gloss"> <label>March 1757.</label> <item>I wrote some critical observations upon Plautus.</item> <label>March 8th.</label> <item>I wrote a long dissertation upon some lines of Virgil.</item> <label>June.</label> <item>I saw Mademoiselle Curchod — <quote xml:lang="la">Omnia vincit amor, et nos cedamus amori.</quote> </item> <label>August.</label> <item>I went to Crassy, and staid two days.</item> </list> Note that the label might also appear within the item rather than as its sibling. Though syntactically valid, this usage is not recommended TEI practice. |
Example | Labels may also be used to represent a label or heading attached to a paragraph or sequence of paragraphs not treated as a structural division, or to a group of verse lines. Note that, in this case, the label element appears within the p or <lg> element, rather than as a preceding sibling of it. <p>[...] <lb/>& n’entrer en mauuais & mal-heu- <lb/>ré meſnage. Or des que le conſente- <lb/>ment des parties y eſt le mariage eſt <lb/> arreſté, quoy que de faict il ne ſoit <label place="margin">Puiſſance maritale entre les Romains.</label> <lb/> conſommé. Depuis la conſomma- <lb/>tion du mariage la femme eſt ſoubs <lb/> la puiſſance du mary, s’il n’eſt eſcla- <lb/>ue ou enfant de famille : car en ce <lb/> cas, la femme, qui a eſpouſé vn en- <lb/>fant de famille, eſt ſous la puiſſance [...]</p> In this example the text of the label appears in the right hand margin of the original source, next to the paragraph it describes, but approximately in the middle of it. If so desired the type attribute may be used to distinguish different categories of label. |
<language> characterizes a single language or sublanguage used within a text. [2.4.2. Language Usage] | |||||||||||||||||
Module | header | ||||||||||||||||
Attributes | Attributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.global.responsibility (@cert, @resp)
| ||||||||||||||||
Contained by | header: langUsage | ||||||||||||||||
May contain | |||||||||||||||||
Declaration | element language { att.global.attribute.n, att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, attribute style { "Language" | "LanguageType" | "LanguageArea" }, attribute ident { data.language }, macro.phraseSeq.limited } | ||||||||||||||||
Example | <langUsage> <language ident="en-US" usage="75">modern American English</language> <language ident="i-az-Arab" usage="20">Azerbaijani in Arabic script</language> <language ident="x-lap" usage="05">Pig Latin</language> </langUsage> | ||||||||||||||||
Note | Particularly for sublanguages, an informal prose characterization should be supplied as content for the element. |
<langUsage> (language usage) describes the languages, sublanguages, registers, dialects, etc. represented within a text. [2.4.2. Language Usage 2.4. The Profile Description 15.3.2. Declarable Elements] | |
Module | header |
Attributes | Attributesatt.global.responsibility (@cert, @resp) |
Member of | |
Contained by | header: profileDesc |
May contain | header: language |
Declaration | element langUsage { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, language+ } |
Example | <langUsage> <language ident="fr-CA" usage="60">Québecois</language> <language ident="en-CA" usage="20">Canadian business English</language> <language ident="en-GB" usage="20">British English</language> </langUsage> |
<list> contains any sequence of items organized as a list. [3.7. Lists] | |||||||||||||||
Module | core | ||||||||||||||
Attributes | Attributesatt.global.responsibility (@cert, @resp) att.typed (type, @subtype)
| ||||||||||||||
Member of | |||||||||||||||
Contained by | header: change revisionDesc sourceDesc tagUsage | ||||||||||||||
May contain | |||||||||||||||
Declaration | element list { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, att.typed.attribute.subtype, attribute type { "CorpusDocument" }, ( ( ( model.divTop ) | ( model.global ) )*, ( ( item, model.global* )+ | ( headLabel?, headItem?, ( label, model.global*, item, model.global* )+ ) ), ( ( model.divBottom ), model.global* )* ) } | ||||||||||||||
Schematron | <sch:rule context="tei:list[@type='gloss']"> <sch:assert test="tei:label">The content of a "gloss" list should include a sequence of one or more pairs of a label element followed by an item element</sch:assert></sch:rule> | ||||||||||||||
Example | <list rend="numbered"> <item>a butcher</item> <item>a baker</item> <item>a candlestick maker, with <list rend="bulleted"> <item>rings on his fingers</item> <item>bells on his toes</item> </list> </item> </list> | ||||||||||||||
Example | <list rend="bulleted" type="syllogism"> <item>All Cretans are liars.</item> <item>Epimenides is a Cretan.</item> <item>ERGO Epimenides is a liar.</item> </list> | ||||||||||||||
Example | <list rend="simple" type="litany"> <item>God save us from drought.</item> <item>God save us from pestilence.</item> <item>God save us from wickedness in high places.</item> <item>Praise be to God.</item> </list> | ||||||||||||||
Example | The following example treats the short numbered clauses of Anglo-Saxon legal codes as lists of items. The text is from an ordinance of King Athelstan (924–939): <div1 type="section"> <head>Athelstan's Ordinance</head> <list rend="numbered"> <item n="1">Concerning thieves. First, that no thief is to be spared who is caught with the stolen goods, [if he is] over twelve years and [if the value of the goods is] over eightpence. <list rend="numbered"> <item n="1.1">And if anyone does spare one, he is to pay for the thief with his wergild — and the thief is to be no nearer a settlement on that account — or to clear himself by an oath of that amount.</item> <item n="1.2">If, however, he [the thief] wishes to defend himself or to escape, he is not to be spared [whether younger or older than twelve].</item> <item n="1.3">If a thief is put into prison, he is to be in prison 40 days, and he may then be redeemed with 120 shillings; and the kindred are to stand surety for him that he will desist for ever.</item> <item n="1.4">And if he steals after that, they are to pay for him with his wergild, or to bring him back there.</item> <item n="1.5">And if he steals after that, they are to pay for him with his wergild, whether to the king or to him to whom it rightly belongs; and everyone of those who supported him is to pay 120 shillings to the king as a fine.</item> </list> </item> <item n="2">Concerning lordless men. And we pronounced about these lordless men, from whom no justice can be obtained, that one should order their kindred to fetch back such a person to justice and to find him a lord in public meeting. <list rend="numbered"> <item n="2.1">And if they then will not, or cannot, produce him on that appointed day, he is then to be a fugitive afterwards, and he who encounters him is to strike him down as a thief.</item> <item n="2.2">And he who harbours him after that, is to pay for him with his wergild or to clear himself by an oath of that amount.</item> </list> </item> <item n="3">Concerning the refusal of justice. The lord who refuses justice and upholds his guilty man, so that the king is appealed to, is to repay the value of the goods and 120 shillings to the king; and he who appeals to the king before he demands justice as often as he ought, is to pay the same fine as the other would have done, if he had refused him justice. <list rend="numbered"> <item n="3.1">And the lord who is an accessory to a theft by his slave, and it becomes known about him, is to forfeit the slave and be liable to his wergild on the first occasionp if he does it more often, he is to be liable to pay all that he owns.</item> <item n="3.2">And likewise any of the king's treasurers or of our reeves, who has been an accessory of thieves who have committed theft, is to liable to the same.</item> </list> </item> <item n="4">Concerning treachery to a lord. And we have pronounced concerning treachery to a lord, that he [who is accused] is to forfeit his life if he cannot deny it or is afterwards convicted at the three-fold ordeal.</item> </list> </div1> Note that nested lists have been used so the tagging mirrors the structure indicated by the two-level numbering of the clauses. The clauses could have been treated as a one-level list with irregular numbering, if desired. | ||||||||||||||
Example | <p>These decrees, most blessed Pope Hadrian, we propounded in the public council ... and they confirmed them in our hand in your stead with the sign of the Holy Cross, and afterwards inscribed with a careful pen on the paper of this page, affixing thus the sign of the Holy Cross. <list rend="simple"> <item>I, Eanbald, by the grace of God archbishop of the holy church of York, have subscribed to the pious and catholic validity of this document with the sign of the Holy Cross.</item> <item>I, Ælfwold, king of the people across the Humber, consenting have subscribed with the sign of the Holy Cross.</item> <item>I, Tilberht, prelate of the church of Hexham, rejoicing have subscribed with the sign of the Holy Cross.</item> <item>I, Higbald, bishop of the church of Lindisfarne, obeying have subscribed with the sign of the Holy Cross.</item> <item>I, Ethelbert, bishop of Candida Casa, suppliant, have subscribed with thef sign of the Holy Cross.</item> <item>I, Ealdwulf, bishop of the church of Mayo, have subscribed with devout will.</item> <item>I, Æthelwine, bishop, have subscribed through delegates.</item> <item>I, Sicga, patrician, have subscribed with serene mind with the sign of the Holy Cross.</item> </list> </p> | ||||||||||||||
Note | May contain an optional heading followed by a series of items, or a series of label and item pairs, the latter being optionally preceded by one or two specialized headings. |
<namespace> supplies the formal name of the namespace to which the elements documented by its children belong. [2.3.4. The Tagging Declaration] | |||||||||||||||||||||
Module | header | ||||||||||||||||||||
Attributes | Attributesatt.global (n, xml:lang, xml:base, xml:space, @xml:id) att.global.responsibility (@cert, @resp)
| ||||||||||||||||||||
Contained by | header: tagsDecl | ||||||||||||||||||||
May contain | header: tagUsage | ||||||||||||||||||||
Declaration | element namespace { att.global.attribute.xmlid, att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, attribute rend { list { ( "Lexical" | "Syntactical" | "Morphological" | "MarkUp" | "Graphical" | "Transcription" | "Other" | "Meta" )+ } }, attribute corresp { text }?, attribute name { data.text }, tagUsage+ } | ||||||||||||||||||||
Example | <namespace name="http://www.tei-c.org/ns/1.0"> <tagUsage gi="hi" occurs="28" render="#it" withId="2"> Used only to mark English words italicized in the copy text </tagUsage> </namespace> |
<normalization> indicates the extent of normalization or regularization of the original source carried out in converting it to electronic form. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements] | |
Module | header |
Attributes | Attributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.global.responsibility (@cert, @resp) |
Member of | |
Contained by | header: editorialDecl |
May contain | core: p |
Declaration | element normalization { att.global.attribute.n, att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, model.pLike+ } |
Example | <editorialDecl> <normalization method="markup"> <p>Where both upper- and lower-case i, j, u, v, and vv have been normalized, to modern 20th century typographical practice, the <gi>choice</gi> element has been used to enclose <gi>orig</gi> and <gi>reg</gi> elements giving the original and new values respectively. ... </p> </normalization> <normalization method="silent"> <p>Spacing between words and following punctuation has been regularized to zero spaces; spacing between words has been regularized to one space.</p> </normalization> <normalization source="http://www.dict.sztaki.hu/webster"> <p>Spelling converted throughout to Modern American usage, based on Websters 9th Collegiate dictionary.</p> </normalization> </editorialDecl> |
<orgName> (organization name) contains an organizational name. [13.2.2. Organizational Names] | |
Module | namesdates |
Attributes | Attributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.typed (subtype, @type) |
Member of | |
Contained by | |
May contain | |
Declaration | element orgName { att.global.attribute.n, att.typed.attribute.type, macro.phraseSeq } |
Example | About a year back, a question of considerable interest was agitated in the <orgName key="PAS1" type="voluntary"> <placeName key="PEN">Pennsyla.</placeName> Abolition Society </orgName>.... |
<p> (paragraph) marks paragraphs in prose. [3.1. Paragraphs 7.2.5. Speech Contents] | |
Module | core |
Attributes | Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.declaring (@decls) att.fragmentable (@part) |
Member of | |
Contained by | |
May contain | |
Declaration | element p { att.global.attributes, att.declaring.attributes, att.fragmentable.attributes, macro.paraContent } |
Example | <p>Hallgerd was outside. <q>There is blood on your axe,</q> she said. <q>What have you done?</q> </p> <p> <q>I have now arranged that you can be married a second time,</q> replied Thjostolf. </p> <p> <q>Then you must mean that Thorvald is dead,</q> she said. </p> <p> <q>Yes,</q> said Thjostolf. <q>And now you must think up some plan for me.</q> </p> |
<persName> (personal name) contains a proper noun or proper-noun phrase referring to a person, possibly including one or more of the person's forenames, surnames, honorifics, added names, etc. [13.2.1. Personal Names] | |
Module | namesdates |
Member of | |
Contained by | |
May contain | |
Declaration | element persName { macro.phraseSeq } |
Example | <persName> <forename>Edward</forename> <forename>George</forename> <surname type="linked">Bulwer-Lytton</surname>, <roleName>Baron Lytton of <placeName>Knebworth</placeName> </roleName> </persName> |
<profileDesc> (text-profile description) The langUsage defines which language the document has. [2.4. The Profile Description 2.1.1. The TEI Header and Its Components] | |
Module | header |
Attributes | Attributesatt.global.responsibility (@cert, @resp) |
Member of | |
Contained by | header: teiHeader |
May contain | header: langUsage |
Declaration | element profileDesc { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, ( model.profileDescPart* ) } |
Example | <profileDesc> <langUsage> <language ident="de" style="Language">Early New High German, Modern German</language> <language ident="de" style="LanguageType">UpperGerman</language> <language ident="de" style="LanguageArea">Bavarian</language> </langUsage> </profileDesc> |
Note | Although the content model permits it, it is rarely meaningful to supply multiple occurrences for any of the child elements of profileDesc unless these are documenting multiple texts. In earlier versions of these Guidelines, it was required that the <creation> element appear first. |
<projectDesc> (project description) describes in detail the aim or purpose for which an electronic file was encoded, together with any other relevant information concerning the process by which it was assembled or collected. [2.3.1. The Project Description 2.3. The Encoding Description 15.3.2. Declarable Elements] | |
Module | header |
Attributes | Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.declarable (@default) |
Member of | |
Contained by | header: encodingDesc |
May contain | core: p |
Declaration | element projectDesc { att.global.attributes, att.declarable.attributes, model.pLike+ } |
Example | <projectDesc> <p>Texts collected for use in the Claremont Shakespeare Clinic, June 1990</p> </projectDesc> |
<publicationStmt> (publication statement) groups information concerning the publication or distribution of an electronic or other text. [2.2.4. Publication, Distribution, Licensing, etc. 2.2. The File Description] | |
Module | header |
Attributes | Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) |
Contained by | header: fileDesc |
May contain | header: authority availability idno |
Declaration | element publicationStmt { att.global.attributes, ( ( ( model.publicationStmtPart.agency ), model.publicationStmtPart.detail* )+ | model.pLike+ ) } |
Example | <publicationStmt> <publisher>C. Muquardt </publisher> <pubPlace>Bruxelles & Leipzig</pubPlace> <date when="1846"/> </publicationStmt> |
Example | <publicationStmt> <publisher>Chadwyck Healey</publisher> <pubPlace>Cambridge</pubPlace> <availability> <p>Available under licence only</p> </availability> <date when="1992">1992</date> </publicationStmt> |
Note | Where a publication statement contains several members of the |
<ref> (reference) defines a reference to another location, possibly modified by additional text or comment. [3.6. Simple Links and Cross-References 16.1. Links] | |
Module | core |
Attributes | Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.pointing (@targetLang, @target, @evaluate) att.internetMedia (@mimeType) att.typed (@type, @subtype) att.declaring (@decls) att.cReferencing (@cRef) |
Member of | |
Contained by | |
May contain | |
Declaration | element ref { att.global.attributes, att.pointing.attributes, att.internetMedia.attributes, att.typed.attributes, att.declaring.attributes, att.cReferencing.attributes, macro.paraContent } |
Schematron | <s:report test="@target and @cRef">Only one of the attributes @target' and @cRef' may be supplied on <s:name/></s:report> |
Example | See especially <ref target="http://www.natcorp.ox.ac.uk/Texts/A02.xml#s2">the second sentence</ref> |
Example | See also <ref target="#locution">s.v. <term>locution</term> </ref>. |
Note | The target and cRef attributes are mutually exclusive. |
<revisionDesc> (revision description) summarizes the revision history for a file. [2.5. The Revision Description 2.1.1. The TEI Header and Its Components] | |
Module | header |
Attributes | Attributesatt.global (xml:id, xml:lang, @n, @xml:base, @xml:space) att.global.rendition (style, rendition, @rend) att.global.responsibility (@cert, @resp) |
Contained by | header: teiHeader |
May contain | |
Declaration | element revisionDesc { att.global.attribute.n, att.global.attribute.xmlbase, att.global.attribute.xmlspace, att.global.rendition.attribute.rend, att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, ( list | listChange | change+ ) } |
Example | <revisionDesc status="embargoed"> <change when="1991-11-11" who="#LB"> deleted chapter 10 </change> </revisionDesc> |
Note | If present on this element, the status attribute should indicate the current status of the document. The same attribute may appear on any change to record the status at the time of that change. Conventionally change elements should be given in reverse date order, with the most recent change at the start of the list. |
<segmentation> describes the principles according to which the text has been segmented, for example into sentences, tone-units, graphemic strata, etc. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements] | |
Module | header |
Attributes | Attributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.global.responsibility (@cert, @resp) |
Member of | |
Contained by | header: editorialDecl |
May contain | core: p |
Declaration | element segmentation { att.global.attribute.n, att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, model.pLike+ } |
Example | <segmentation> <p> <gi>s</gi> elements mark orthographic sentences and are numbered sequentially within their parent <gi>div</gi> element </p> </segmentation> |
Example | <p> <gi>seg</gi> elements are used to mark functional constituents of various types within each <gi>s</gi>; the typology used is defined by a <gi>taxonomy</gi> element in the corpus header <gi>classDecl</gi> </p> |
<sourceDesc> (source description) describes the source from which an electronic text was derived or generated, typically a bibliographic description in the case of a digitized text, or a phrase such as "born digital" for a text which has no previous existence. [2.2.7. The Source Description] | |
Module | header |
Attributes | Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.declarable (@default) |
Contained by | header: fileDesc |
May contain | |
Declaration | element sourceDesc { att.global.attributes, att.declarable.attributes, ( model.pLike+ | ( model.biblLike | model.sourceDescPart | model.listLike )+ ) } |
Example | <sourceDesc> <bibl> <title level="a">The Interesting story of the Children in the Wood</title>. In <author>Victor E Neuberg</author>, <title>The Penny Histories</title>. <publisher>OUP</publisher> <date>1968</date>. </bibl> </sourceDesc> |
Example | <sourceDesc> <p>Born digital: no previous source exists.</p> </sourceDesc> |
<surname> contains a family (inherited) name, as opposed to a given, baptismal, or nick name. [13.2.1. Personal Names] | |
Module | namesdates |
Attributes | Attributesatt.global.responsibility (@cert, @resp) |
Member of | |
Contained by | |
May contain | |
Declaration | element surname { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, macro.phraseSeq } |
Example | <surname type="combine">St John Stevas</surname> |
<tagsDecl> (tagging declaration) provides detailed information about the tagging applied to a document. [2.3.4. The Tagging Declaration 2.3. The Encoding Description] | |||||||||
Module | header | ||||||||
Attributes | Attributesatt.global.responsibility (@cert, @resp)
| ||||||||
Member of | |||||||||
Contained by | header: encodingDesc | ||||||||
May contain | header: namespace | ||||||||
Declaration | element tagsDecl { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, attribute partial { data.truthValue }?, ( rendition*, namespace* ) } | ||||||||
Example | <tagsDecl> <rendition xml:id="rend-it">to be rendered in italic font</rendition> <namespace name="http://www.tei-c.org/ns/1.0"> <tagUsage gi="hi" occurs="467" render="#rend-it"/> <tagUsage gi="title" occurs="45" render="#rend-it"/> </namespace> <namespace name="http://docbook.org/ns/docbook"> <tagUsage gi="para" occurs="10"/> </namespace> </tagsDecl> |
<tagUsage> supplies information about the usage of a specific element within a text. [2.3.4. The Tagging Declaration] | |||||||||||||||||||||||||
Module | header | ||||||||||||||||||||||||
Attributes | Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp))
| ||||||||||||||||||||||||
Contained by | header: namespace | ||||||||||||||||||||||||
May contain | |||||||||||||||||||||||||
Declaration | element tagUsage { att.global.attributes, attribute gi { data.text }, attribute occurs { data.count }?, attribute withId { data.count }?, attribute render { list { data.pointer+ } }?, macro.limitedContent } | ||||||||||||||||||||||||
Example | <tagsDecl> <rendition xml:id="it">Render using a slant or italic variant on the current font</rendition> <!-- ... --> <namespace name="http://www.tei-c.org/ns/1.0"> <tagUsage gi="hi" occurs="28" render="#it" withId="2"> Used to mark English words italicized in the copy text.</tagUsage> <tagUsage gi="foreign" render="#it">Used to mark non-English words in the copy text.</tagUsage> <!-- ... --> </namespace> </tagsDecl> |
<TEI> (TEI document) contains a single TEI-conformant document, containing a single TEI header, a single text, one or more members of the model.resourceLike class, or a combination of these. A series of TEI elements may be combined together to form a <teiCorpus> element. [4. Default Text Structure 15.1. Varieties of Composite Text] | |||||||||
Module | textstructure | ||||||||
Attributes | Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp))
| ||||||||
Contained by | — | ||||||||
May contain | |||||||||
Declaration | element TEI { att.global.attributes, attribute version { data.version }?, ( teiHeader, ( ( model.resourceLike+, text? ) | text ) ) } | ||||||||
Schematron | <s:ns prefix="tei" uri="http://www.tei-c.org/ns/1.0"/> <s:ns prefix="xs" uri="http://www.w3.org/2001/XMLSchema"/> | ||||||||
Schematron | <s:ns prefix="rng" uri="http://relaxng.org/ns/structure/1.0"/> | ||||||||
Example | <TEI version="5.0" xmlns="http://www.tei-c.org/ns/1.0"> <teiHeader> <fileDesc> <titleStmt> <title>The shortest TEI Document Imaginable</title> </titleStmt> <publicationStmt> <p>First published as part of TEI P2, this is the P5 version using a name space.</p> </publicationStmt> <sourceDesc> <p>No source: this is an original work.</p> </sourceDesc> </fileDesc> </teiHeader> <text> <body> <p>This is about the shortest TEI document imaginable.</p> </body> </text> </TEI> | ||||||||
Example | <TEI version="5.0" xmlns="http://www.tei-c.org/ns/1.0"> <teiHeader> <fileDesc> <titleStmt> <title>A TEI Document containing four page images </title> </titleStmt> <publicationStmt> <p>Unpublished demonstration file.</p> </publicationStmt> <sourceDesc> <p>No source: this is an original work.</p> </sourceDesc> </fileDesc> </teiHeader> <facsimile> <graphic url="page1.png"/> <graphic url="page2.png"/> <graphic url="page3.png"/> <graphic url="page4.png"/> </facsimile> </TEI> | ||||||||
Note | This element is required. It is customary to specify the TEI namespace |
<teiHeader> (TEI header) supplies the descriptive and declarative information making up an electronic title page for every TEI-conformant document. [2.1.1. The TEI Header and Its Components 15.1. Varieties of Composite Text] | |||||||||
Module | header | ||||||||
Attributes | Attributesatt.global (xml:id, xml:lang, xml:base, xml:space, @n) att.global.responsibility (@cert, @resp)
| ||||||||
Contained by | textstructure: TEI | ||||||||
May contain | header: encodingDesc fileDesc profileDesc revisionDesc | ||||||||
Declaration | element teiHeader { att.global.attribute.n, att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, attribute type { "CorpusHeader" | "DocumentHeader" | "PreparationHeader" }, ( fileDesc, model.teiHeaderPart*, revisionDesc? ) } | ||||||||
Example | <teiHeader> <fileDesc> <titleStmt> <title>Shakespeare: the first folio (1623) in electronic form</title> <author>Shakespeare, William (1564–1616)</author> <respStmt> <resp>Originally prepared by</resp> <name>Trevor Howard-Hill</name> </respStmt> <respStmt> <resp>Revised and edited by</resp> <name>Christine Avern-Carr</name> </respStmt> </titleStmt> <publicationStmt> <distributor>Oxford Text Archive</distributor> <address> <addrLine>13 Banbury Road, Oxford OX2 6NN, UK</addrLine> </address> <idno type="OTA">119</idno> <availability> <p>Freely available on a non-commercial basis.</p> </availability> <date when="1968">1968</date> </publicationStmt> <sourceDesc> <bibl>The first folio of Shakespeare, prepared by Charlton Hinman (The Norton Facsimile, 1968)</bibl> </sourceDesc> </fileDesc> <encodingDesc> <projectDesc> <p>Originally prepared for use in the production of a series of old-spelling concordances in 1968, this text was extensively checked and revised for use during the editing of the new Oxford Shakespeare (Wells and Taylor, 1989).</p> </projectDesc> <editorialDecl> <correction> <p>Turned letters are silently corrected.</p> </correction> <normalization> <p>Original spelling and typography is retained, except that long s and ligatured forms are not encoded.</p> </normalization> </editorialDecl> <refsDecl xml:id="ASLREF"> <cRefPattern matchPattern="(\S+) ([^.]+)\.(.*)" replacementPattern="#xpath(//div1[@n='$1']/div2/[@n='$2']//lb[@n='$3'])"> <p>A reference is created by assembling the following, in the reverse order as that listed here: <list> <item>the <att>n</att> value of the preceding <gi>lb</gi> </item> <item>a period</item> <item>the <att>n</att> value of the ancestor <gi>div2</gi> </item> <item>a space</item> <item>the <att>n</att> value of the parent <gi>div1</gi> </item> </list> </p> </cRefPattern> </refsDecl> </encodingDesc> <revisionDesc> <list> <item> <date when="1989-04-12">12 Apr 89</date> Last checked by CAC</item> <item> <date when="1989-03-01">1 Mar 89</date> LB made new file</item> </list> </revisionDesc> </teiHeader> | ||||||||
Note | One of the few elements unconditionally required in any TEI document. |
<text> contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample. [4. Default Text Structure 15.1. Varieties of Composite Text] | |
Module | textstructure |
Attributes | Attributes att.global (@xml:id, @n, @xml:lang, @xml:base, @xml:space) (att.global.rendition (@rend, @style, @rendition)) (att.global.responsibility (@cert, @resp)) att.declaring (@decls) att.typed (@type, @subtype) |
Contained by | textstructure: TEI |
May contain | Empty element |
Declaration | element text { att.global.attributes, att.declaring.attributes, att.typed.attributes, ( model.global*, ( front, model.global* )?, ( body | group ), model.global*, ( back, model.global* )? ) } |
Example | <text> <front> <docTitle> <titlePart>Autumn Haze</titlePart> </docTitle> </front> <body> <l>Is it a dragonfly or a maple leaf</l> <l>That settles softly down upon the water?</l> </body> </text> |
Example | The body of a text may be replaced by a group of nested texts, as in the following schematic: <text> <front> <!-- front matter for the whole group --> </front> <group> <text> <!-- first text --> </text> <text> <!-- second text --> </text> </group> </text> |
Note | This element should not be used to represent a text which is inserted at an arbitrary point within the structure of another, for example as in an embedded or quoted narrative; the <floatingText> is provided for this purpose. |
<title> contains a title for any kind of work. [3.11.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement 2.2.5. The Series Statement] | |||||||||||||
Module | core | ||||||||||||
Attributes | Attributesatt.global (xml:id, n, xml:lang, @xml:base, @xml:space) att.global.rendition (style, rendition, @rend) att.global.responsibility (@cert, @resp)
| ||||||||||||
Member of | |||||||||||||
Contained by | |||||||||||||
May contain | |||||||||||||
Declaration | element title { att.global.attribute.xmlbase, att.global.attribute.xmlspace, att.global.rendition.attribute.rend, att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, attribute type { "Corpus" }?, macro.paraContent } | ||||||||||||
Example | <title>Information Technology and the Research Process: Proceedings of a conference held at Cranfield Institute of Technology, UK, 18–21 July 1989</title> | ||||||||||||
Example | <title>Hardy's Tess of the D'Urbervilles: a machine readable edition</title> | ||||||||||||
Example | <title type="full"> <title type="main">Synthèse</title> <title type="sub">an international journal for epistemology, methodology and history of science</title> </title> | ||||||||||||
Note | The attributes key and ref, inherited from the class att.canonical may be used to indicate the canonical form for the title; the former, by supplying (for example) the identifier of a record in some external library system; the latter by pointing to an XML element somewhere containing the canonical form of the title. |
<titleStmt> (title statement) groups information about the title of a work and those responsible for its content. [2.2.1. The Title Statement 2.2. The File Description] | |
Module | header |
Attributes | Attributesatt.global.responsibility (@cert, @resp) |
Contained by | header: fileDesc |
May contain | |
Declaration | element titleStmt { att.global.responsibility.attribute.cert, att.global.responsibility.attribute.resp, ( title+, model.respLike* ) } |
Example | <titleStmt> <title>Capgrave's Life of St. John Norbert: a machine-readable transcription</title> <respStmt> <resp>compiled by</resp> <name>P.J. Lucas</name> </respStmt> </titleStmt> |
model.addressLike groups elements used to represent a postal or email address. [1. The TEI Infrastructure] | |
Module | tei |
Used by | |
Members | affiliation |
model.applicationLike groups elements used to record application-specific information about a document in its header. | |
Module | tei |
Used by | |
Members | application |
model.dateLike groups elements containing temporal expressions. [3.5.4. Dates and Times 13.3.6. Dates and Times] | |
Module | tei |
Used by | |
Members | date |
model.divBottom groups elements appearing at the end of a text division. [4.2. Elements Common to All Divisions] | |
Module | tei |
Used by | |
Members | model.divBottomPart model.divWrapper |
model.divPart groups paragraph-level elements appearing directly within divisions. [1.3. The TEI Class System] | |
Module | tei |
Used by | |
Members | model.lLike model.pLike [p] |
Note | Note that this element class does not include members of the model.inter class, which can appear either within or between paragraph-level items. |
model.divTop groups elements appearing at the beginning of a text division. [4.2. Elements Common to All Divisions] | |
Module | tei |
Used by | |
Members | model.divTopPart [model.headLike] model.divWrapper |
model.divTopPart groups elements which can occur only at the beginning of a text division. [4.6. Title Pages] | |
Module | tei |
Used by | |
Members | model.headLike |
model.editorialDeclPart groups elements which may be used inside editorialDecl and appear multiple times. | |
Module | tei |
Used by | |
Members | normalization segmentation |
model.emphLike groups phrase-level elements which are typographically distinct and to which a specific function can be attributed. [3.3. Highlighting and Quotation] | |
Module | tei |
Used by | |
Members | title |
model.encodingDescPart groups elements which may be used inside encodingDesc and appear multiple times. | |
Module | tei |
Used by | |
Members | appInfo editorialDecl projectDesc tagsDecl |
model.global groups elements which may appear at any point within a TEI text. [1.3. The TEI Class System] | |
Module | tei |
Used by | |
Members | model.global.edit model.global.meta model.milestoneLike model.noteLike |
model.highlighted groups phrase-level elements which are typographically distinct. [3.3. Highlighting and Quotation] | |
Module | tei |
Used by | |
Members | model.emphLike [title] model.hiLike |
model.inter groups elements which can appear either within or between paragraph-like elements. [1.3. The TEI Class System] | |
Module | tei |
Used by | |
Members | model.biblLike model.egLike model.labelLike [label] model.listLike [list] model.oddDecl model.qLike [model.quoteLike] model.stageLike |
model.labelLike groups elements used to gloss or explain other parts of a document. | |
Module | tei |
Used by | |
Members | label |
model.limitedPhrase groups phrase-level elements excluding those elements primarily intended for transcription of existing sources. [1.3. The TEI Class System] | |
Module | tei |
Used by | |
Members | model.emphLike [title] model.hiLike model.pPart.data [model.addressLike [affiliation] model.dateLike [date] model.measureLike model.nameLike [model.nameLike.agent [orgName persName] model.offsetLike model.persNamePart [forename surname] model.placeStateLike [model.placeNamePart] idno] ] model.pPart.editorial model.pPart.msdesc model.phrase.xml model.ptrLike [ref] |
model.listLike groups list-like elements. [3.7. Lists] | |
Module | tei |
Used by | |
Members | list |
model.nameLike groups elements which name or refer to a person, place, or organization. | |
Module | tei |
Used by | |
Members | model.nameLike.agent [orgName persName] model.offsetLike model.persNamePart [forename surname] model.placeStateLike [model.placeNamePart] idno |
Note | A superset of the naming elements that may appear in datelines, addresses, statements of responsibility, etc. |
model.nameLike.agent groups elements which contain names of individuals or corporate bodies. [3.5. Names, Numbers, Dates, Abbreviations, and Addresses] | |
Module | tei |
Used by | |
Members | orgName persName |
Note | This class is used in the content model of elements which reference names of people or organizations. |
model.persNamePart groups elements which form part of a personal name. [13.2.1. Personal Names] | |
Module | namesdates |
Used by | |
Members | forename surname |
model.phrase groups elements which can occur at the level of individual words or phrases. [1.3. The TEI Class System] | |
Module | tei |
Used by | |
Members | model.graphicLike model.highlighted [model.emphLike [title] model.hiLike] model.lPart model.pPart.data [model.addressLike [affiliation] model.dateLike [date] model.measureLike model.nameLike [model.nameLike.agent [orgName persName] model.offsetLike model.persNamePart [forename surname] model.placeStateLike [model.placeNamePart] idno] ] model.pPart.edit [model.pPart.editorial model.pPart.transcriptional] model.pPart.msdesc model.phrase.xml model.ptrLike [ref] model.segLike model.specDescLike |
Note | This class of elements can occur within paragraphs, list items, lines of verse, etc. |
model.placeStateLike groups elements which describe changing states of a place. | |
Module | tei |
Used by | |
Members | model.placeNamePart |
model.pLike groups paragraph-like elements. | |
Module | tei |
Used by | |
Members | p |
model.pPart.data groups phrase-level elements containing names, dates, numbers, measures, and similar data. [3.5. Names, Numbers, Dates, Abbreviations, and Addresses] | |
Module | tei |
Used by | |
Members | model.addressLike [affiliation] model.dateLike [date] model.measureLike model.nameLike [model.nameLike.agent [orgName persName] model.offsetLike model.persNamePart [forename surname] model.placeStateLike [model.placeNamePart] idno] |
model.pPart.edit groups phrase-level elements for simple editorial correction and transcription. [3.4. Simple Editorial Changes] | |
Module | tei |
Used by | |
Members | model.pPart.editorial model.pPart.transcriptional |
model.profileDescPart groups elements which may be used inside profileDesc and appear multiple times. | |
Module | tei |
Used by | |
Members | langUsage |
model.ptrLike groups elements used for purposes of location and reference. [3.6. Simple Links and Cross-References] | |
Module | tei |
Used by | |
Members | ref |
model.publicationStmtPart.agency groups the child elements of a publicationStmt element of the TEI header that indicate an authorising agent. [2.2.4. Publication, Distribution, Licensing, etc.] | |
Module | tei |
Used by | |
Members | authority |
Note | The ‘agency’ child elements, while not required, are required if one of the ‘detail’ child elements is to be used. It is not valid to have a ‘detail’ child element without a preceding ‘agency’ child element. See also model.publicationStmtPart.detail. |
model.publicationStmtPart.detail groups the agency-specific child elements of the publicationStmt element of the TEI header. [2.2.4. Publication, Distribution, Licensing, etc.] | |
Module | tei |
Used by | |
Members | availability date idno |
Note | A ‘detail’ child element may not occur unless an ‘agency’ child element precedes it. See also model.publicationStmtPart.agency. |
model.qLike groups elements related to highlighting which can appear either within or between chunk-level elements. [3.3. Highlighting and Quotation] | |
Module | tei |
Used by | |
Members | model.quoteLike |
model.respLike groups elements which are used to indicate intellectual or other significant responsibility, for example within a bibliographic element. | |
Module | tei |
Used by | |
Members | author editor |
model.teiHeaderPart groups high level elements which may appear more than once in a TEI header. | |
Module | tei |
Used by | |
Members | encodingDesc profileDesc |
att.ascribed provides attributes for elements representing speech or action that can be ascribed to a specific individual. [3.3.3. Quotation 8.3. Elements Unique to Spoken Texts] | |||||||||||
Module | tei | ||||||||||
Members | change | ||||||||||
Attributes | Attributes
|
att.canonical provides attributes which can be used to associate a representation such as a name or title with canonical information about the object being named or referenced. | |||||||||||||||||||||||
Module | tei | ||||||||||||||||||||||
Members | att.naming [att.personal] | ||||||||||||||||||||||
Attributes | Attributes
|
att.cReferencing provides an attribute which may be used to supply a canonical reference as a means of identifying the target of a pointer. | |||||||||
Module | tei | ||||||||
Members | ref | ||||||||
Attributes | Attributes
|
att.datable provides attributes for normalization of elements that contain dates, times, or datable events. [3.5.4. Dates and Times 13.3.6. Dates and Times] | |||||||||||||||||
Module | tei | ||||||||||||||||
Members | application change | ||||||||||||||||
Attributes | Attributes att.datable.w3c (@when, @notBefore, @notAfter, @from, @to) att.datable.iso (@when-iso, @notBefore-iso, @notAfter-iso, @from-iso, @to-iso) att.datable.custom (@when-custom, @notBefore-custom, @notAfter-custom, @from-custom, @to-custom, @datingPoint, @datingMethod)
| ||||||||||||||||
Note | This ‘superclass’ provides attributes that can be used to provide normalized values of temporal information. By default, the attributes from the att.datable.w3c class are provided. If the module for names & dates is loaded, this class also provides attributes from the att.datable.iso and att.datable.custom classes. In general, the possible values of attributes restricted to the W3C datatypes form a subset of those values available via the ISO 8601 standard. However, the greater expressiveness of the ISO datatypes may not be needed, and there exists much greater software support for the W3C datatypes. |
att.datable.custom provides attributes for normalization of elements that contain datable events to a custom dating system (i.e. other than the Gregorian used by W3 and ISO). [13.3.6. Dates and Times] | |||||||||||||||||||||||||||||||||||||||||||||||||||
Module | namesdates | ||||||||||||||||||||||||||||||||||||||||||||||||||
Members | att.datable [application change] | ||||||||||||||||||||||||||||||||||||||||||||||||||
Attributes | Attributes
|
att.datable.iso provides attributes for normalization of elements that contain datable events using the ISO 8601 standard. [3.5.4. Dates and Times 13.3.6. Dates and Times] | |||||||||||||||||||||||||||||||||||
Module | namesdates | ||||||||||||||||||||||||||||||||||
Members | att.datable [application change] | ||||||||||||||||||||||||||||||||||
Attributes | Attributes
| ||||||||||||||||||||||||||||||||||
Note | The value of these attributes should be a normalized representation of the date, time, or combined date & time intended, in any of the standard formats specified by ISO 8601, using the Gregorian calendar. | ||||||||||||||||||||||||||||||||||
Note | If both when-iso and dur-iso are specified, the values should be interpreted as indicating a span of time by its starting time (or date) and duration. That is, <date dur-iso="P8D" when-iso="2007-06-01"/> indicates the same time period as <date when-iso="2007-06-01/P8D"/> In providing a ‘regularized’ form, no claim is made that the form in the source text is incorrect; the regularized form is simply that chosen as the main form for purposes of unifying variant forms under a single heading. |
att.datable.w3c provides attributes for normalization of elements that contain datable events conforming to the W3C XML Schema Part 2: Datatypes Second Edition. [3.5.4. Dates and Times 13.3.6. Dates and Times] | |||||||||||||||||||||||||||||||||||||
Module | tei | ||||||||||||||||||||||||||||||||||||
Members | att.datable [application change] | ||||||||||||||||||||||||||||||||||||
Attributes | Attributes
| ||||||||||||||||||||||||||||||||||||
Example | <date from="1863-05-28" to="1863-06-01">28 May through 1 June 1863</date> | ||||||||||||||||||||||||||||||||||||
Note | The value of these attributes should be a normalized representation of the date, time, or combined date & time intended, in any of the standard formats specified by XML Schema Part 2: Datatypes Second Edition, using the Gregorian calendar. The most commonly-encountered format for the date portion of a temporal attribute is Note that this format does not currently permit use of the value 0000 to represent the year 1 BCE; instead the value -0001 should be used. |
att.declarable provides attributes for those elements in the TEI header which may be independently selected by means of the special purpose decls attribute. [15.3. Associating Contextual Information with a Text] | |||||||||
Module | tei | ||||||||
Members | projectDesc sourceDesc | ||||||||
Attributes | Attributes
| ||||||||
Note | The rules governing the association of declarable elements with individual parts of a TEI text are fully defined in chapter 15.3. Associating Contextual Information with a Text. Only one element of a particular type may have a default attribute with a value of true. |
att.declaring provides attributes for elements which may be independently associated with a particular declarable element within the header, thus overriding the inherited default for that element. [15.3. Associating Contextual Information with a Text] | |||||||
Module | tei | ||||||
Members | p ref text | ||||||
Attributes | Attributes
| ||||||
Note | The rules governing the association of declarable elements with individual parts of a TEI text are fully defined in chapter 15.3. Associating Contextual Information with a Text. |
att.dimensions provides attributes for describing the size of physical objects. | |||||||||||||||||||||||||||||||||||||||
Module | tei | ||||||||||||||||||||||||||||||||||||||
Members | att.editLike | ||||||||||||||||||||||||||||||||||||||
Attributes | Attributes att.ranging (@atLeast, @atMost, @min, @max, @confidence)
|
att.docStatus provides attributes for use on metadata elements describing the status of a document. | |||||||||
Module | tei | ||||||||
Members | change | ||||||||
Attributes | Attributes
| ||||||||
Example | <revisionDesc status="published"> <change status="published" when="2010-10-21"/> <change status="cleared" when="2010-10-02"/> <change status="embargoed" when="2010-08-02"/> <change status="frozen" when="2010-05-01" who="#MSM"/> <change status="draft" when="2010-03-01" who="#LB"/> </revisionDesc> |
att.fragmentable groups structural elements which may be fragmented, usually as a consequence of some overlapping hierarchy. | |||||||||||
Module | tei | ||||||||||
Members | p | ||||||||||
Attributes | Attributes
|
att.global provides attributes common to all elements in the TEI encoding scheme. [1.3.1.1. Global Attributes] | |||||||||||||||||||||||||||||||||||||||||||||
Module | tei | ||||||||||||||||||||||||||||||||||||||||||||
Members | TEI appInfo application authority change encodingDesc idno label p projectDesc publicationStmt ref sourceDesc tagUsage text | ||||||||||||||||||||||||||||||||||||||||||||
Attributes | Attributes att.global.rendition (@rend, @style, @rendition) att.global.responsibility (@cert, @resp)
|
att.global.rendition provides rendering attributes common to all elements in the TEI encoding scheme. [1.3.1.1.3. Rendition Indicators] | |||||||||||||||||||||||||||||||
Module | tei | ||||||||||||||||||||||||||||||
Members | att.global [TEI appInfo application authority change encodingDesc idno label p projectDesc publicationStmt ref sourceDesc tagUsage text] | ||||||||||||||||||||||||||||||
Attributes | Attributes
|
att.global.responsibility provides attributes indicating the agency responsible for some aspect of the text, the markup or something asserted by the markup, and the degree of certainty associated with it. [3.4. Simple Editorial Changes 11.3.2.2. Hand, Responsibility, and Certainty Attributes 17.3. Spans and Interpretations 13.1.1. Linking Names and Their Referents] | |||||||||||||||
Module | tei | ||||||||||||||
Members | att.global [TEI appInfo application authority change encodingDesc idno label p projectDesc publicationStmt ref sourceDesc tagUsage text] | ||||||||||||||
Attributes | Attributes
| ||||||||||||||
Example | Blessed are the <choice> <sic>placemakers</sic> <corr cert="high" resp="#editor">peacemakers</corr> </choice>: for they shall be called the children of God. | ||||||||||||||
Example | <!-- in the <text> ... --><lg> <!-- ... --> <l>Punkes, Panders, baſe extortionizing sla<choice> <sic>n</sic> <corr resp="#JENS1_transcriber">u</corr> </choice>es,</l> <!-- ... --> </lg> <!-- in the <teiHeader> ... --> <!-- ... --> <respStmt xml:id="JENS1_transcriber"> <resp when="2014">Transcriber</resp> <name>Janelle Jenstad</name> </respStmt> |
att.internetMedia provides attributes for specifying the type of a computer resource using a standard taxonomy. | |||||||
Module | tei | ||||||
Members | ref | ||||||
Attributes | Attributes
| ||||||
Example | In this example mimeType is used to indicate that the URL points to a TEI XML file encoded in UTF-8. <ref mimeType="application/tei+xml; charset=UTF-8" target="http://sourceforge.net/p/tei/code/HEAD/tree/trunk/P5/Source/guidelines-en.xml"/> | ||||||
Note | This attribute class provides an attribute for describing a computer resource, typically available over the internet, using a value taken from a standard taxonomy. At present only a single taxonomy is supported, the Multipurpose Internet Mail Extensions (MIME) Media Type system. This typology of media types is defined by the Internet Engineering Task Force in RFC 2046. The list of types is maintained by the Internet Assigned Numbers Authority (IANA). The mimeType attribute must have a value taken from this list. |
att.naming provides attributes common to elements which refer to named persons, places, organizations etc. [3.5.1. Referring Strings 13.3.5. Names and Nyms] | |||||||||||||||
Module | tei | ||||||||||||||
Members | att.personal | ||||||||||||||
Attributes | Attributes att.canonical (@key, @ref)
|
att.placement provides attributes for describing where on the source page or object a textual element appears. [3.4.3. Additions, Deletions, and Omissions 11.3.1.4. Additions and Deletions] | |||||||||||||
Module | tei | ||||||||||||
Members | label | ||||||||||||
Attributes | Attributes
|
att.pointing defines a set of attributes used by all elements which point to other elements by means of one or more URI references. [1.3.1.1.2. Language Indicators 3.6. Simple Links and Cross-References] | |||||||||||||||||||||||||||||||
Module | tei | ||||||||||||||||||||||||||||||
Members | ref | ||||||||||||||||||||||||||||||
Attributes | Attributes
|
att.ranging provides attributes for describing numerical ranges. | |||||||||||||||||||||||||||||||
Module | tei | ||||||||||||||||||||||||||||||
Members | att.dimensions [att.editLike] | ||||||||||||||||||||||||||||||
Attributes | Attributes
| ||||||||||||||||||||||||||||||
Example | The MS. was lost in transmission by mail from <del rend="overstrike"> <gap atLeast="1" atMost="2" extent="one or two letters" reason="illegible" unit="chars"/> </del> Philadelphia to the Graphic office, New York. |
att.sortable provides attributes for elements in lists or groups that are sortable, but whose sorting key cannot be derived mechanically from the element content. [9.1. Dictionary Body and Overall Structure] | |||||||||||
Module | tei | ||||||||||
Members | idno | ||||||||||
Attributes | Attributes
|
att.source provides attributes for pointing to the source of a bibliographic reference. [3.3.3. Quotation 8.3.4. Writing] | |||||||
Module | tei | ||||||
Members | att.editLike | ||||||
Attributes | Attributes
| ||||||
Example | <p> <!-- ... --> As Willard McCarty (<bibl xml:id="mcc_2012">2012, p.2</bibl>) tells us, <quote source="#mcc_2012">‘Collaboration’ is a problematic and should be a contested term.</quote> <!-- ... --> </p> | ||||||
Example | <p> <!-- ... --> <quote source="#chicago_15_ed">Grammatical theories are in flux, and the more we learn, the less we seem to know.</quote> <!-- ... --> </p> <!-- ... --> <bibl xml:id="chicago_15_ed"> <title level="m">The Chicago Manual of Style</title>, <edition>15th edition</edition>. <pubPlace>Chicago</pubPlace>: <publisher>University of Chicago Press</publisher> (<date>2003</date>), <biblScope unit="page">p.147</biblScope>. </bibl> |
att.typed provides attributes which can be used to classify or subclassify elements in any way. [1.3.1. Attribute Classes 17.1.1. Words and Above 3.5.1. Referring Strings 3.6. Simple Links and Cross-References 3.5.5. Abbreviations and Their Expansions 3.12.1. Core Tags for Verse 7.2.5. Speech Contents 4.1.1. Un-numbered Divisions 4.1.2. Numbered Divisions 4.2.1. Headings and Trailers 4.4. Virtual Divisions 13.3.2.3. Personal Relationships 11.3.1.1. Core Elements for Transcriptional Work 16.1.1. Pointers and Links 16.3. Blocks, Segments, and Anchors 12.2. Linking the Apparatus to the Text 22.4.4.2. RELAX NG Content Models 8.3. Elements Unique to Spoken Texts 23.3.1.4. Modification of Attribute and Attribute Value Lists] | |||||||||||||||||||
Module | tei | ||||||||||||||||||
Members | application change label ref text | ||||||||||||||||||
Attributes | Attributes
| ||||||||||||||||||
Schematron | <sch:rule context="*[@subtype]"> <sch:assert test="@type">The <sch:name/> element should not be categorized in detail with @subtype unless also categorized in general with @type</sch:assert></sch:rule> | ||||||||||||||||||
Note | When appropriate, values from an established typology should be used. Alternatively a typology may be defined in the associated TEI header. If values are to be taken from a project-specific list, this should be defined using the <valList> element in the project-specific schema description, as described in 23.3.1.4. Modification of Attribute and Attribute Value Lists . |
data.certainty defines the range of attribute values expressing a degree of certainty. | |
Module | tei |
Used by | |
Declaration | data.certainty = "high" | "medium" | "low" | "unknown" |
Note | Certainty may be expressed by one of the predefined symbolic values high, medium, or low. The value unknown should be used in cases where the encoder does not wish to assert an opinion about the matter. For more precise indication, data.probability may be used instead or in addition. |
data.count defines the range of attribute values used for a non-negative integer value used as a count. | |
Module | tei |
Used by | |
Declaration | data.count = xsd:nonNegativeInteger |
Note | Only positive integer values (including zero) are permitted |
data.duration.iso defines the range of attribute values available for representation of a duration in time using ISO 8601 standard formats | |
Module | tei |
Used by | |
Declaration | data.duration.iso = token { pattern = "[0-9.,DHMPRSTWYZ/:+\-]+" } |
Example | <time dur-iso="PT0,75H">three-quarters of an hour</time> |
Example | <date dur-iso="P1,5D">a day and a half</date> |
Example | <date dur-iso="P14D">a fortnight</date> |
Example | <time dur-iso="PT0.02S">20 ms</time> |
Note | A duration is expressed as a sequence of number-letter pairs, preceded by the letter P; the letter gives the unit and may be Y (year), M (month), D (day), H (hour), M (minute), or S (second), in that order. The numbers are all unsigned integers, except for the last, which may have a decimal component (using either For complete details, see ISO 8601 Data elements and interchange formats — Information interchange — Representation of dates and times. |
data.duration.w3c defines the range of attribute values available for representation of a duration in time using W3C datatypes. | |
Module | tei |
Used by | |
Declaration | data.duration.w3c = xsd:duration |
Example | <time dur="PT45M">forty-five minutes</time> |
Example | <date dur="P1DT12H">a day and a half</date> |
Example | <date dur="P7D">a week</date> |
Example | <time dur="PT0.02S">20 ms</time> |
Note | A duration is expressed as a sequence of number-letter pairs, preceded by the letter P; the letter gives the unit and may be Y (year), M (month), D (day), H (hour), M (minute), or S (second), in that order. The numbers are all unsigned integers, except for the For complete details, see the W3C specification. |
data.enumerated defines the range of attribute values expressed as a single XML name taken from a list of documented possibilities. | |
Module | tei |
Used by | |
Declaration | data.enumerated = data.word |
Note | Attributes using this datatype must contain a single word matching the pattern defined for this datatype: for example it cannot include whitespace but may begin with digits. Typically, the list of documented possibilities will be provided (or exemplified) by a value list in the associated attribute specification, expressed with a <valList> element. |
data.language defines the range of attribute values used to identify a particular combination of human language and writing system. [6.1. Language Identification] | |
Module | tei |
Used by | Element:
|
Declaration | data.language = xsd:language | "" |
Note | The values for this attribute are language ‘tags’ as defined in BCP 47. Currently BCP 47 comprises RFC 4646 and RFC 4647; over time, other IETF documents may succeed these as the best current practice. A ‘language tag’, per BCP 47, is assembled from a sequence of components or subtags separated by the hyphen character (-, U+002D). The tag is made of the following subtags, in the following order. Every subtag except the first is optional. If present, each occurs only once, except the fourth and fifth components (variant and extension), which are repeatable.
There are two exceptions to the above format. First, there are language tags in the IANA registry that do not match the above syntax, but are present because they have been ‘grandfathered’ from previous specifications. Second, an entire language tag can consist of only a private use subtag. These tags start with Examples include
The W3C Internationalization Activity has published a useful introduction to BCP 47, Language tags in HTML and XML. |
data.name defines the range of attribute values expressed as an XML Name. | |
Module | tei |
Used by | Element:
|
Declaration | data.name = xsd:Name |
Note | Attributes using this datatype must contain a single word which follows the rules defining a legal XML name (see http://www.w3.org/TR/REC-xml/#dt-name): for example they cannot include whitespace or begin with digits. |
data.numeric defines the range of attribute values used for numeric values. | |
Module | tei |
Used by | |
Declaration | data.numeric = xsd:double | token { pattern = "(\-?[\d]+/\-?[\d]+)" } | xsd:decimal |
Note | Any numeric value, represented as a decimal number, in floating point format, or as a ratio. To represent a floating point number, expressed in scientific notation, ‘E notation’, a variant of ‘exponential notation’, may be used. In this format, the value is expressed as two numbers separated by the letter E. The first number, the significand (sometimes called the mantissa) is given in decimal format, while the second is an integer. The value is obtained by multiplying the mantissa by 10 the number of times indicated by the integer. Thus the value represented in decimal notation as 1000.0 might be represented in scientific notation as 10E3. A value expressed as a ratio is represented by two integer values separated by a solidus (/) character. Thus, the value represented in decimal notation as 0.5 might be represented as a ratio by the string 1/2. |
data.outputMeasurement defines a range of values for use in specifying the size of an object that is intended for display. | |
Module | tei |
Used by | |
Declaration |
data.outputMeasurement =
token
{
pattern = "[\-+]?\d+(\.\d+)?(%|cm|mm|in|pt|pc|px|em|ex|gd|rem|vw|vh|vm)"
} |
Example | <figure> <head>The TEI Logo</head> <figDesc>Stylized yellow angle brackets with the letters <mentioned>TEI</mentioned> in between and <mentioned>text encoding initiative</mentioned> underneath, all on a white background.</figDesc> <graphic height="600px" url="http://www.tei-c.org/logos/TEI-600.jpg" width="600px"/> </figure> |
Note | These values map directly onto the values used by XSL-FO and CSS. For definitions of the units see those specifications; at the time of this writing the most complete list is in the CSS3 working draft. |
data.pattern (regular expression pattern) defines attribute values which are expressed as a regular expression. | |
Module | tei |
Used by | |
Declaration | data.pattern = token |
Note | A regular expression, often called a pattern, is an expression that describes a set of strings. They are usually used to give a concise description of a set, without having to list all elements. For example, the set containing the three strings Handel, Händel, and Haendel can be described by the pattern Wikipedia H(ä|ae?)ndel (or alternatively, it is said that the pattern H(ä|ae?)ndel matches each of the three strings) |
data.pointer defines the range of attribute values used to provide a single URI, absolute or relative, pointing to some other resource, either within the current document or elsewhere. | |
Module | tei |
Used by | |
Declaration | data.pointer = xsd:anyURI |
Note | The range of syntactically valid values is defined by RFC 3986 Uniform Resource Identifier (URI): Generic Syntax. Note that the values themselves are encoded using RFC 3987 Internationalized Resource Identifiers (IRIs) mapping to URIs. For example, |
data.probability defines the range of attribute values expressing a probability. | |
Module | tei |
Used by | |
Declaration | data.probability = xsd:double { minInclusive = "0" maxInclusive = "1" } |
Note | Probability is expressed as a real number between 0 and 1; 0 representing certainly false and 1 representing certainly true. |
data.replacement defines attribute values which contain a replacement template. | |
Module | tei |
Used by | |
Declaration | data.replacement = text |
data.temporal.iso defines the range of attribute values expressing a temporal expression such as a date, a time, or a combination of them, that conform to the international standard Data elements and interchange formats – Information interchange – Representation of dates and times. | |
Module | tei |
Used by | Element:
|
Declaration | data.temporal.iso = xsd:date | xsd:gYear | xsd:gMonth | xsd:gDay | xsd:gYearMonth | xsd:gMonthDay | xsd:time | xsd:dateTime | token { pattern = "[0-9.,DHMPRSTWYZ/:+\-]+" } |
Note | If it is likely that the value used is to be compared with another, then a time zone indicator should always be included, and only the dateTime representation should be used. For all representations for which ISO 8601 describes both a basic and an extended format, these Guidelines recommend use of the extended format. While ISO 8601 permits the use of both |
data.temporal.w3c defines the range of attribute values expressing a temporal expression such as a date, a time, or a combination of them, that conform to the W3C XML Schema Part 2: Datatypes Second Edition specification. | |
Module | tei |
Used by | |
Declaration | data.temporal.w3c = xsd:date | xsd:gYear | xsd:gMonth | xsd:gDay | xsd:gYearMonth | xsd:gMonthDay | xsd:time | xsd:dateTime |
Note | If it is likely that the value used is to be compared with another, then a time zone indicator should always be included, and only the dateTime representation should be used. |
data.text defines the range of attribute values used to express some kind of identifying string as a single sequence of unicode characters possibly including whitespace. | |
Module | tei |
Used by | |
Declaration | data.text = string |
Note | Attributes using this datatype must contain a single ‘token’ in which whitespace and other punctuation characters are permitted. |
data.truthValue defines the range of attribute values used to express a truth value. | |
Module | tei |
Used by | Element:
|
Declaration | data.truthValue = xsd:boolean |
Note | The possible values of this datatype are 1 or true, or 0 or false. This datatype applies only for cases where uncertainty is inappropriate; if the attribute concerned may have a value other than true or false, e.g. unknown, or inapplicable, it should have the extended version of this datatype: data.xTruthValue. |
data.version defines the range of attribute values which may be used to specify a TEI or Unicode version number. | |
Module | tei |
Used by | Element:
|
Declaration | data.version = token { pattern = "[\d]+(\.[\d]+){0,2}" } |
Note | The value of this attribute follows the pattern specified by the Unicode consortium for its version number (http://unicode.org/versions/). A version number contains digits and fullstop characters only. The first number supplied identifies the major version number. A second and third number, for minor and sub-minor version numbers, may also be supplied. |
data.versionNumber defines the range of attribute values used for version numbers. | |
Module | tei |
Used by | Element:
|
Declaration | data.versionNumber = token { pattern = "[\d]+[a-z]*[\d]*(\.[\d]+[a-z]*[\d]*){0,3}" } |
data.word defines the range of attribute values expressed as a single word or token. | |
Module | tei |
Used by | |
Declaration | data.word = token { pattern = "(\p{L}|\p{N}|\p{P}|\p{S})+" } |
Note | Attributes using this datatype must contain a single ‘word’ which contains only letters, digits, punctuation characters, or symbols: thus it cannot include whitespace. |
data.xTruthValue (extended truth value) defines the range of attribute values used to express a truth value which may be unknown. | |
Module | tei |
Used by | |
Declaration | data.xTruthValue = xsd:boolean | "unknown" | "inapplicable" |
Note | In cases where where uncertainty is inappropriate, use the datatype data.TruthValue. |
macro.limitedContent (paragraph content) defines the content of prose elements that are not used for transcription of extant materials. [1.3. The TEI Class System] | |
Module | tei |
Used by | |
Declaration | macro.limitedContent = ( text | model.limitedPhrase | model.inter )* |
macro.paraContent (paragraph content) defines the content of paragraphs and similar elements. [1.3. The TEI Class System] | |
Module | tei |
Used by | |
Declaration | macro.paraContent = ( text | model.gLike | model.phrase | model.inter | model.global | lg | l )* |
macro.phraseSeq (phrase sequence) defines a sequence of character data and phrase-level elements. [1.4.1. Standard Content Models] | |
Module | tei |
Used by | |
Declaration | macro.phraseSeq = ( text | model.gLike | model.phrase | model.global )* |
macro.phraseSeq.limited (limited phrase sequence) defines a sequence of character data and those phrase-level elements that are not typically used for transcribing extant documents. [1.4.1. Standard Content Models] | |
Module | tei |
Used by | |
Declaration | macro.phraseSeq.limited = ( text | model.limitedPhrase | model.global )* |
macro.specialPara ('special' paragraph content) defines the content model of elements such as notes or list items, which either contain a series of component-level elements or else have the same structure as a paragraph, containing a series of phrase-level and inter-level elements. [1.3. The TEI Class System] | |
Module | tei |
Used by | |
Declaration | macro.specialPara = ( text | model.gLike | model.phrase | model.inter | model.divPart | model.global )* |
Carolin Odebrecht,
Korpuslinguistik und Morphologie. Institut für deutsche Sprache und Linguistik, Humboldt-universität zu Berlin.
carolin.odebrecht@hu-berlin.de
LAUDATIO: Long-term Access and Usage of Deeply Annotated Information. The management and archiving of digital research data is an overlapping field for linguistics, library and information science (LIS) and computer science. These disciplines are cooperating in the LAUDATIO project. The name LAUDATIO is an abbreviation for Long-term Access and Usage of Deeply Annotated Information. The project is funded by the German Research Foundation from 2011-2014. The departments of Corpus Linguistics as well as Historical Linguistics, and the Computer and Media Service (CMS) at Humboldt-Universität zu Berlin and The National Institute for Research in Computer Science and Control (INRIA France) are project partners cooperating with the Berlin School of Library and Information Science (BSLIS).
Funded by Deutsche Forschungsgemeinschaft (DFG).
The Creative Commons Attribution 3.0 Unported (CC BY 3.0) Licence applies to this document.