egXML
egXML
egXML
content descendants must be in the
namespaces
'http://relaxng.org/ns/compatibility/annotations/1.0', 'http://relaxng.org/ns/structure/1.0', 'http://www.tei-c.org/ns/1.0'
egXML
egXML
datatype descendants must be in the
namespaces
'http://relaxng.org/ns/structure/1.0', 'http://www.tei-c.org/ns/1.0'
(anchored) indicates whether the copy text shows the exact place of reference for the note.
(target end) points to the end of the span to which the note is attached, if the note is not embedded in the text at that point.
\S+
indicates the person, or group of people, to whom the element content is ascribed.
\S+
indicates the person, or group of people, to whom a speech act or action is directed.
\S+
provides an externally-defined means of identifying the entity (or entities) being named, using a coded value of some kind.
(reference) provides an explicit means of locating a full definition or identity for the entity being named by means of one or more URIs.
\S+
gives a minimum estimated value for the approximate measurement.
(\-?[\d]+/\-?[\d]+)
gives a maximum estimated value for the approximate measurement.
(\-?[\d]+/\-?[\d]+)
where the measurement summarizes more than one observation or a range, supplies the minimum value observed.
(\-?[\d]+/\-?[\d]+)
where the measurement summarizes more than one observation or a range, supplies the maximum value observed.
(\-?[\d]+/\-?[\d]+)
specifies the degree of statistical confidence (between zero and one) that a value falls within the range specified by @min
and @max
, or the proportion of observed values that fall within that range.
names the unit used for the measurement
Suggested values include: 1] cm (centimetres); 2] mm (millimetres); 3] in (inches); 4] line; 5] char (characters)
cm
(centimetres)
mm
(millimetres)
in
(inches)
line
lines of text
char
(characters) characters of text
[^\p{C}\p{Z}]+
specifies the length in the units specified
(\-?[\d]+/\-?[\d]+)
indicates the size of the object concerned using a project-specific vocabulary combining quantity and units in a single string of words.
characterizes the precision of the values specified by the other attributes.
high
medium
low
unknown
where the measurement summarizes more than one observation, specifies the applicability of this measurement.
Sample values include: 1] all; 2] most; 3] range
[^\p{C}\p{Z}]+
points to a <handNote>
element describing the hand considered responsible for the content of the element concerned.
\S+
categorizes the cause of the damage, if it can be identified.
Sample values include: 1] rubbing; 2] mildew; 3] smoke
[^\p{C}\p{Z}]+
provides a coded representation of the degree of damage, either as a number between 0 (undamaged) and 1 (very extensively damaged), or as one of the codes high, medium, low, or unknown. The <damage>
element with the @degree
attribute should only be used where the text may be read with some confidence; text supplied from other sources should be tagged as <supplied>
.
high
medium
low
unknown
assigns an arbitrary number to each stretch of damage regarded as forming part of the same physical phenomenon.
indicates whether or not the element bearing this attribute should be considered to mark the end of an orthographic token in the same way as whitespace.
[^\p{C}\p{Z}]+
(canonical reference) specifies the destination of the pointer by supplying a canonical reference expressed using the scheme defined in a <refsDecl>
element in the TEI header
supplies the value of the date or time in a standard form, e.g. yyyy-mm-dd.
specifies the earliest possible date for the event in standard form, e.g. yyyy-mm-dd.
specifies the latest possible date for the event in standard form, e.g. yyyy-mm-dd.
indicates the starting point of the period in standard form, e.g. yyyy-mm-dd.
indicates the ending point of the period in standard form, e.g. yyyy-mm-dd.
The @when attribute cannot be used with any other att.datable.w3c attributes.
The @from and @notBefore attributes cannot be used together.
The @to and @notAfter attributes cannot be used together.
indicates one or more systems or calendars to which the date represented by the content of this element belongs.
\S+
@calendar indicates one or more
systems or calendars to which the date represented by the content of this element belongs,
but this element has no textual content.
supplies pointers to one or more definitions of named periods of time (typically <category>
s or <calendar>
s) within which the datable item is understood to have occurred.
\S+
provides a pointer to a definition of, and/or general information about, (a) an information container (element or attribute) or (b) a value of an information container (element content or attribute value), by referencing an external taxonomy or ontology. If @valueDatcat
is present in the immediate context, this attribute takes on role (a), while @valueDatcat
performs role (b).
\S+
provides a definition of, and/or general information about a value of an information container (element content or attribute value), by reference to an external taxonomy or ontology. Used especially where a contrast with @datcat
is needed.
\S+
provides a definition of, and/or general information about, information structure of an object referenced or modeled by the containing element, by reference to an external taxonomy or ontology. This attribute has the characteristics of the @datcat
attribute, except that it addresses not its containing element, but an object that is being referenced or modeled by its containing element.
\S+
indicates whether or not this element is selected by default when its parent is selected.
true
This element is selected if its parent is selected
false
This element can only be selected explicitly, unless it is the only one of its kind, in which case it is selected if its parent is selected.
(declarations) identifies one or more declarable elements within the header, which are understood to apply to the element bearing this attribute and its content.
\S+
specifies whether or not its parent element is fragmented in some way, typically by some other overlapping structure: for example a speech which is divided between two or more verse stanzas, a paragraph which is split across a page division, a verse line which is divided between two speakers.
Y
(yes) the element is fragmented in some (unspecified) respect
N
(no) the element is not fragmented, or no claim is made as to its completeness
I
(initial) this is the initial part of a fragmented element
M
(medial) this is a medial part of a fragmented element
F
(final) this is the final part of a fragmented element
(organization) specifies how the content of the division is organized.
composite
no claim is made about the sequence in which the immediate contents of this division are to be processed, or their inter-relationships.
uniform
the immediate contents of this element are regarded as forming a logical unit, to be processed in sequence.
indicates whether this division is a sample of the original source and if so, from which part.
initial
division lacks material present at end in source.
medial
division lacks material at start and end.
final
division lacks material at start.
unknown
position of sampled material within original unknown.
complete
division is not a sample.
describes the status of a document either currently or, when associated with a dated element, at the time indicated.
Sample values include: 1] approved; 2] candidate; 3] cleared; 4] deprecated; 5] draft; 6] embargoed; 7] expired; 8] frozen; 9] galley; 10] proposed; 11] published; 12] recommendation; 13] submitted; 14] unfinished; 15] withdrawn
[^\p{C}\p{Z}]+
(duration) indicates the length of this element in time.
(certainty) signifies the degree of certainty associated with the intervention or interpretation.
high
medium
low
unknown
(responsible party) indicates the agency responsible for the intervention or interpretation, for example an editor or transcriber.
\S+
indicates the nature of the evidence supporting the reliability or accuracy of the intervention or interpretation.
Suggested values include: 1] internal; 2] external; 3] conjecture
internal
there is internal evidence to support the intervention.
external
there is external evidence to support the intervention.
conjecture
the intervention or interpretation has been made by the editor, cataloguer, or scholar on the basis of their expertise.
[^\p{C}\p{Z}]+
indicates whether this is an instant revision or not.
unknown
inapplicable
(rendition) indicates how the element in question was rendered or presented in the source text.
[^\p{C}\p{Z}]+
contains an expression in some formal style definition language which defines the rendering or presentation used for this element in the source text
points to a description of the rendering or presentation used for this element in the source text.
\S+
specifies the source from which some aspect of this element is drawn.
\S+
When used on a schema description element (like
), the @source attribute
should have only 1 value. (This one has .)
(identifier) provides a unique identifier for the element bearing the attribute.
(number) gives a number (or other label) for an element, which is not necessarily unique within the document.
(language) indicates the language of the element content using a tag generated according to BCP 47.
provides a base URI reference with which applications can resolve relative URI references into absolute URI references.
\S+
signals an intention about how white space should be managed by applications.
default
signals that the application's default white-space processing modes are acceptable
preserve
indicates the intent that applications preserve all white space
gives a name or other identifier for the scribe believed to be responsible for this hand.
points to a full description of the scribe concerned, typically supplied by a <person>
element elsewhere in the description.
\S+
characterizes the particular script or writing style used by this hand, for example secretary, copperplate, Chancery, Italian, etc.
points to a full description of the script or writing style used by this hand, typically supplied by a <scriptNote>
element elsewhere in the description.
\S+
describes the tint or type of ink, e.g. brown, or other writing medium, e.g. pencil
[^\p{C}\p{Z}]+
specifies how widely this hand is used in the manuscript.
sole
only this hand is used throughout the manuscript
major
this hand is used through most of the manuscript
minor
this hand is used occasionally in the manuscript
(MIME media type) specifies the applicable multimedia internet mail extension (MIME) media type
[^\p{C}\p{Z}]+
Where the media are displayed, indicates the display width
[\-+]?\d+(\.\d+)?(%|cm|mm|in|pt|pc|px|em|ex|ch|rem|vw|vh|vmin|vmax)
Where the media are displayed, indicates the display height
[\-+]?\d+(\.\d+)?(%|cm|mm|in|pt|pc|px|em|ex|ch|rem|vw|vh|vmin|vmax)
Where the media are displayed, indicates a scale factor to be applied when generating the desired display size
(\-?[\d]+/\-?[\d]+)
(uniform resource locator) specifies the URL from which the media concerned may be obtained.
\S+
indicates what kind of phenomenon is being noted in the passage.
Sample values include: 1] image; 2] character; 3] theme; 4] allusion
[^\p{C}\p{Z}]+
(subtype) provides a sub-categorization of the phenomenon is being noted in the passage, if needed
[^\p{C}\p{Z}]+
(instances) points to instances of the analysis or interpretation represented by the current element.
\S+
(unit) indicates the units used for the measurement, usually using the standard symbol for the desired units.
Suggested values include: 1] m (metre); 2] kg (kilogram); 3] s (second); 4] Hz (hertz); 5] Pa (pascal); 6] Ω (ohm); 7] L (litre); 8] t (tonne); 9] ha (hectare); 10] Å (ångström); 11] mL (millilitre); 12] cm (centimetre); 13] dB (decibel); 14] kbit (kilobit); 15] Kibit (kibibit); 16] kB (kilobyte); 17] KiB (kibibyte); 18] MB (megabyte); 19] MiB (mebibyte)
m
(metre) SI base unit of length
kg
(kilogram) SI base unit of mass
s
(second) SI base unit of time
Hz
(hertz) SI unit of frequency
Pa
(pascal) SI unit of pressure or stress
Ω
(ohm) SI unit of electric resistance
L
(litre) 1 dm³
t
(tonne) 10³ kg
ha
(hectare) 1 hm²
Å
(ångström) 10⁻¹⁰ m
mL
(millilitre)
cm
(centimetre)
dB
(decibel) see remarks, below
kbit
(kilobit) 10³ or 1000 bits
Kibit
(kibibit) 2¹⁰ or 1024 bits
kB
(kilobyte) 10³ or 1000 bytes
KiB
(kibibyte) 2¹⁰ or 1024 bytes
MB
(megabyte) 10⁶ or 1 000 000 bytes
MiB
(mebibyte) 2²⁰ or 1 048 576 bytes
[^\p{C}\p{Z}]+
points to a unique identifier stored in the @xml:id
of a <unitDef>
element that defines a unit of measure.
\S+
(quantity) specifies the number of the specified units that comprise the measurement
(\-?[\d]+/\-?[\d]+)
(commodity) indicates the substance that is being measured
[^\p{C}\p{Z}]+
The @unit attribute may be unnecessary when @unitRef is present.
may be used to specify further information about the entity referenced by this name in the form of a set of whitespace-separated values, for example the occupation of a person, or the status of a place.
[^\p{C}\p{Z}]+
(reference to the canonical name) provides a means of locating the canonical form (nym) of the names associated with the object named by the element bearing it.
\S+
names the notation used for the content of the element.
[^\p{C}\p{Z}]+
specifies where this item is placed.
Suggested values include: 1] top; 2] bottom; 3] margin; 4] opposite; 5] overleaf; 6] above; 7] right; 8] below; 9] left; 10] end; 11] inline; 12] inspace
top
at the top of the page
bottom
at the foot of the page
margin
in the margin (left, right, or both)
opposite
on the opposite, i.e. facing, page
overleaf
on the other side of the leaf
above
above the line
right
to the right, e.g. to the right of a vertical line of text, or to the right of a figure
below
below the line
left
to the left, e.g. to the left of a vertical line of text, or to the left of a figure
end
at the end of e.g. chapter or volume.
inline
within the body of the text.
inspace
in a predefined space, for example left by an earlier scribe.
[^\p{C}\p{Z}]+
characterizes the element in some sense, using any convenient classification scheme or typology.
[^\p{C}\p{Z}]+
(subtype) provides a sub-categorization of the element, if needed
[^\p{C}\p{Z}]+
The element should not be categorized in detail with @subtype unless also categorized in general with @type
specifies the language of the content to be found at the destination referenced by @target
, using a language tag generated according to BCP 47.
@targetLang should only be used on if @target is specified.
specifies the destination of the reference by supplying one or more URI References
\S+
(evaluate) specifies the intended meaning when the target of a pointer is itself a pointer.
all
if the element pointed to is itself a pointer, then the target of that pointer will be taken, and so on, until an element is found which is not a pointer.
one
if the element pointed to is itself a pointer, then its target (whether a pointer or not) is taken as the target of this pointer.
none
no further evaluation of targets is carried out beyond that needed to find the element specified in the pointer's target.
optionally specifies the identifiers of the elements within which all elements indicated by the contents of this element lie.
\S+
\S+
\S+
(target function) describes the function of each of the values of the @target
attribute of the enclosed <link>
, <join>
, or <alt>
tags.
[^\p{C}\p{Z}]+
[^\p{C}\p{Z}]+
[^\p{C}\p{Z}]+
points at one or more sets of zero or more elements each.
\S+
supplies an XPath selection pattern using the syntax defined in which identifies a set of nodes, selected within the context identified by the @target
attribute if this is supplied, or within the context of the parent element if it is not.
(function) characterizes the function of the segment.
[^\p{C}\p{Z}]+
supplies the sort key for this element in an index, list or group which contains it.
[^\p{C}\p{Z}]+
(edition) supplies a sigil or other arbitrary identifier for the source edition in which the associated feature (for example, a page, column, or line break) occurs at this point in the text.
[^\p{C}\p{Z}]+
(edition reference) provides a pointer to the source edition in which the associated feature (for example, a page, column, or line break) occurs at this point in the text.
\S+
indicates the end of a span initiated by the element bearing this attribute.
\S+
The element indicated by @spanTo () must follow the current element
identifies the language used to describe the rendition.
css
Cascading Stylesheet Language
xslfo
Extensible Stylesheet Language Formatting Objects
free
Informal free text description
other
A user-defined rendition description language
supplies a version number for the style language provided in @scheme
.
[\d]+[a-z]*[\d]*(\.[\d]+[a-z]*[\d]*){0,3}
@schemeVersion can only be used if @scheme is specified.
indicates the location within a temporal alignment at which this element begins.
\S+
indicates the location within a temporal alignment at which this element ends.
\S+
indicates the effect of the intervention, for example in the case of a deletion, strikeouts which include too much or too little text, or in the case of an addition, an insertion which duplicates some of the text already present.
Sample values include: 1] duplicate; 2] duplicate-partial; 3] excessStart; 4] excessEnd; 5] shortStart; 6] shortEnd; 7] partial; 8] unremarkable
[^\p{C}\p{Z}]+
documents the presumed cause for the intervention.
[^\p{C}\p{Z}]+
(sequence) assigns a sequence number related to the order in which the encoded features carrying this attribute are believed to have occurred.
identifies the unit of information conveyed by the element, e.g. columns, pages, volume, entry.
Suggested values include: 1] volume (volume); 2] issue; 3] page (page); 4] line; 5] chapter (chapter); 6] part; 7] column; 8] entry
volume
(volume) the element contains a volume number.
issue
the element contains an issue number, or volume and issue numbers.
page
(page) the element contains a page number or page range.
line
the element contains a line number or line range.
chapter
(chapter) the element contains a chapter indication (number and/or title)
part
the element identifies a part of a book or collection.
column
the element identifies a column.
entry
the element identifies an entry number or label in a list of entries.
[^\p{C}\p{Z}]+
specifies the starting point of the range of units indicated by the @unit
attribute.
[^\p{C}\p{Z}]+
specifies the end-point of the range of units indicated by the @unit
attribute.
[^\p{C}\p{Z}]+
A @formula
is provided to describe a mathematical calculation such as a conversion between measurement systems.
indicates one or more locations by pointing to a <place>
element or other canonical description.
\S+
indicates whether the pronunciation or orthography applies to all or part of a word.
Suggested values include: 1] full (full form); 2] pref (prefix); 3] suff (suffix); 4] inf (infix); 5] part (partial)
full
(full form)
pref
(prefix)
suff
(suffix)
inf
(infix)
part
(partial)
[^\p{C}\p{Z}]+
indicates whether the name component is given in full, as an abbreviation or simply as an initial.
yes
(yes) the name component is spelled out in full.
abb
(abbreviated) the name component is given in an abbreviated form.
init
(initial letter) the name component is indicated only by one initial.
(sort) specifies the sort order of the name component in relation to others within the name.
(duration) indicates the length of this element in time.
[0-9.,DHMPRSTWYZ/:+\-]+
provides a conventional name for the kind of section changing at this milestone.
Suggested values include: 1] page; 2] column; 3] line; 4] book; 5] poem; 6] canto; 7] speaker; 8] stanza; 9] act; 10] scene; 11] section; 12] absent; 13] unnumbered
page
physical page breaks (synonymous with the <pb>
element).
column
column breaks.
line
line breaks (synonymous with the <lb>
element).
book
any units termed book, liber, etc.
poem
individual poems in a collection.
canto
cantos or other major sections of a poem.
speaker
changes of speaker or narrator.
stanza
stanzas within a poem, book, or canto.
act
acts within a play.
scene
scenes within a play or act.
section
sections of any kind.
absent
passages not present in the reference edition.
unnumbered
passages present in the text, but not to be included as part of the reference.
[^\p{C}\p{Z}]+
(paragraph) marks paragraphs in prose. [3.1. Paragraphs 7.2.5. Speech Contents]
Abstract model violation: Paragraphs may not occur inside other paragraphs or ab elements.
Abstract model violation: Lines may not contain higher-level structural elements such as div, p, or ab, unless p is a child of figure or note, or is a descendant of floatingText.
(foreign) identifies a word or phrase as belonging to some language other than that of the surrounding text. [3.3.2.1. Foreign Words or Expressions]
(emphasized) marks words or phrases which are stressed or emphasized for linguistic or rhetorical effect. [3.3.2.2. Emphatic Words and Phrases 3.3.2. Emphasis, Foreign Words, and Unusual Language]
(highlighted) marks a word or phrase as graphically distinct from the surrounding text, for reasons concerning which no claim is made. [3.3.2.2. Emphatic Words and Phrases 3.3.2. Emphasis, Foreign Words, and Unusual Language]
identifies any word or phrase which is regarded as linguistically distinct, for example as archaic, technical, dialectal, non-preferred, etc., or as forming part of a sublanguage. [3.3.2.3. Other Linguistically Distinct Material]
specifies the sublanguage or register to which the word or phrase is being assigned
[^\p{C}\p{Z}]+
specifies how the phrase is distinct diachronically
specifies how the phrase is distinct diatopically
specifies how the phrase is distinct diastratically
(speech or thought) indicates passages thought or spoken aloud, whether explicitly indicated in the source or not, whether directly or indirectly reported, whether by real people or fictional characters. [3.3.3. Quotation]
may be used to indicate whether the quoted matter is regarded as having been vocalized or signed.
unknown
inapplicable
may be used to indicate whether the quoted matter is regarded as direct or indirect speech.
unknown
inapplicable
(quotation) contains a phrase or passage attributed by the narrator or author to some agency external to the text. [3.3.3. Quotation 4.3.1. Grouped Texts]
(quoted) contains material which is distinguished from the surrounding text using quotation marks or a similar method, for any one of a variety of reasons including, but not limited to: direct speech or thought, technical terms or jargon, authorial distance, quotations from elsewhere, and passages that are mentioned but not used. [3.3.3. Quotation]
(type) may be used to indicate whether the offset passage is spoken or thought, or to characterize it more finely.
Suggested values include: 1] spoken (spoken); 2] thought (thought); 3] written (written); 4] soCalled (so called); 5] foreign (foreign); 6] distinct (distinct); 7] term; 8] emph (emph); 9] mentioned (mentioned)
spoken
(spoken) representation of speech
thought
(thought) representation of thought, e.g. internal monologue
written
(written) quotation from a written source
soCalled
(so called) authorial distance
foreign
(foreign)
distinct
(distinct) linguistically distinct
term
technical term
emph
(emph) rhetorically emphasized
mentioned
(mentioned) refering to itself, not its normal referent
[^\p{C}\p{Z}]+
(cited quotation) contains a quotation from some other document, together with a bibliographic reference to its source. In a dictionary it may contain an example text with at least one occurrence of the word form, used in the sense being described, or a translation of the headword, or an example. [3.3.3. Quotation 4.3.1. Grouped Texts 9.3.5.1. Examples]
marks words or phrases mentioned, not used. [3.3.3. Quotation]
(so called) contains a word or phrase for which the author or narrator indicates a disclaiming of responsibility, for example by the use of scare quotes or italics. [3.3.3. Quotation]
(description) contains a short description of the purpose, function, or use of its parent element, or when the parent is a documentation element, describes or defines the object being documented. [22.4.1. Description of Components]
Information about a
deprecation should only be present in a specification element
that is being deprecated: that is, only an element that has a
@validUntil attribute should have a child <desc
type="deprecationInfo">.
characterizes the element in some sense, using any convenient classification scheme or typology.
Suggested values include: 1] deprecationInfo (deprecation information)
deprecationInfo
(deprecation
information) This element describes why or how its parent element is being deprecated, typically including recommendations for alternate encoding.
[^\p{C}\p{Z}]+
(gloss) identifies a phrase or word used to provide a gloss or definition for some other word or phrase. [3.4.1. Terms and Glosses 22.4.1. Description of Components]
(term) contains a single-word, multi-word, or symbolic designation which is regarded as a technical term. [3.4.1. Terms and Glosses]
(ruby container) contains a passage of base text along with its associated ruby gloss(es). [3.4.2. Ruby Annotations]
(ruby base) contains the base text annotated by a ruby gloss. [3.4.2. Ruby Annotations]
(ruby text) contains a ruby text, an annotation closely associated with a passage of the main text. [3.4.2. Ruby Annotations]
supplies a pointer to the base being glossed by this ruby text.
\S+
When target= is
present, neither from= nor to= should be.
points to the starting point of the span of text being glossed by this ruby text.
\S+
When from= is present, the to=
attribute of is required.
points to the ending point of the span of text being glossed.
\S+
When to= is present, the from=
attribute of is required.
(Latin for thus or so) contains text reproduced although apparently incorrect or inaccurate. [3.5.1. Apparent Errors]
(correction) contains the correct form of a passage apparently erroneous in the copy text. [3.5.1. Apparent Errors]
(choice) groups a number of alternative encodings for the same point in a text. [3.5. Simple Editorial Changes]
(regularization) contains a reading which has been regularized or normalized in some sense. [3.5.2. Regularization and
Normalization 12. Critical Apparatus]
(original form) contains a reading which is marked as following the original, rather than being normalized or corrected. [3.5.2. Regularization and
Normalization 12. Critical Apparatus]
(gap) indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible. [3.5.3. Additions, Deletions, and Omissions]
(reason) gives the reason for omission
Suggested values include: 1] cancelled (cancelled); 2] deleted (deleted); 3] editorial (editorial); 4] illegible (illegible); 5] inaudible (inaudible); 6] irrelevant (irrelevant); 7] sampling (sampling)
cancelled
(cancelled)
deleted
(deleted)
editorial
(editorial) for features omitted from transcription due to editorial policy
illegible
(illegible)
inaudible
(inaudible)
irrelevant
(irrelevant)
sampling
(sampling)
[^\p{C}\p{Z}]+
(agent) in the case of text omitted because of damage, categorizes the cause of the damage, if it can be identified.
Sample values include: 1] rubbing (rubbing); 2] mildew (mildew); 3] smoke (smoke)
[^\p{C}\p{Z}]+
(deliberately marked omission) indicates a purposeful marking in the source document signalling that content has been omitted, and may also supply or describe the omitted content. [3.5.3. Additions, Deletions, and Omissions]
(addition) contains letters, words, or phrases inserted in the source text by an author, scribe, or a previous annotator or corrector. [3.5.3. Additions, Deletions, and Omissions]
(deletion) contains a letter, word, or passage deleted, marked as deleted, or otherwise indicated as superfluous or spurious in the copy text by an author, scribe, or a previous annotator or corrector. [3.5.3. Additions, Deletions, and Omissions]
(unclear) contains a word, phrase, or passage which cannot be transcribed with certainty because it is illegible or inaudible in the source. [11.3.3.1. Damage, Illegibility, and Supplied Text 3.5.3. Additions, Deletions, and Omissions]
indicates why the material is hard to transcribe.
Suggested values include: 1] illegible (illegible); 2] inaudible (inaudible); 3] faded (faded); 4] background_noise (background noise); 5] eccentric_ductus (eccentric ductus)
illegible
(illegible)
inaudible
(inaudible)
faded
(faded)
background_noise
(background noise)
eccentric_ductus
(eccentric ductus) indicates illegibility due to an unusual, awkward, or incompetent execution of a glyph or glyphs
[^\p{C}\p{Z}]+
Where the difficulty in transcription arises from damage, categorizes the cause of the damage, if it can be identified.
Sample values include: 1] rubbing; 2] mildew; 3] smoke
[^\p{C}\p{Z}]+
(name, proper noun) contains a proper noun or noun phrase. [3.6.1. Referring Strings]
(referencing string) contains a general purpose name or referring string. [13.2.1. Personal Names 3.6.1. Referring Strings]
(electronic mail address) contains an email address identifying a location to which email messages can be delivered. [3.6.2. Addresses]
(address) contains a postal address, for example of a publisher, an organization, or an individual. [3.6.2. Addresses 2.2.4. Publication, Distribution, Licensing, etc. 3.12.2.4. Imprint, Size of a Document, and Reprint Information]
(address line) contains one line of a postal address. [3.6.2. Addresses 2.2.4. Publication, Distribution, Licensing, etc. 3.12.2.4. Imprint, Size of a Document, and Reprint Information]
contains a full street address including any name or number identifying a building as well as the name of the street or route on which it is located. [3.6.2. Addresses]
(postal code) contains a numerical or alphanumeric code used as part of a postal address to simplify sorting or delivery of mail. [3.6.2. Addresses]
(postal box or post office box) contains a number or other identifier for some postal delivery point other than a street address. [3.6.2. Addresses]
(number) contains a number, written in any form. [3.6.3. Numbers and
Measures]
indicates the type of numeric value.
Suggested values include: 1] cardinal; 2] ordinal; 3] fraction; 4] percentage
cardinal
absolute number, e.g. 21, 21.5
ordinal
ordinal number, e.g. 21st
fraction
fraction, e.g. one half or three-quarters
percentage
a percentage
[^\p{C}\p{Z}]+
supplies the value of the number in standard form.
(\-?[\d]+/\-?[\d]+)
(measure) contains a word or phrase referring to some quantity of an object or commodity, usually comprising a number, a unit, and a commodity name. [3.6.3. Numbers and
Measures]
specifies the type of measurement in any convenient typology.
[^\p{C}\p{Z}]+
(measure group) contains a group of dimensional specifications which relate to the same object, for example the height and width of a manuscript page. [10.3.4. Dimensions]
contains a symbol, a word or a phrase referring to a unit of measurement in any kind of formal or informal system. [3.6.3. Numbers and
Measures]
(date) contains a date in any format. [3.6.4. Dates and Times 2.2.4. Publication, Distribution, Licensing, etc. 2.6. The Revision Description 3.12.2.4. Imprint, Size of a Document, and Reprint Information 15.2.3. The Setting Description 13.4. Dates]
(time) contains a phrase defining a time of day in any format. [3.6.4. Dates and Times]
(abbreviation) contains an abbreviation of any sort. [3.6.5. Abbreviations and Their Expansions]
(type) allows the encoder to classify the abbreviation according to some convenient typology.
Sample values include: 1] suspension (suspension); 2] contraction (contraction); 3] brevigraph; 4] superscription (superscription); 5] acronym (acronym); 6] title (title); 7] organization (organization); 8] geographic (geographic)
[^\p{C}\p{Z}]+
(expansion) contains the expansion of an abbreviation. [3.6.5. Abbreviations and Their Expansions]
(pointer) defines a pointer to another location. [3.7. Simple Links and Cross-References 16.1. Links]
Only one of the
attributes @target and @cRef may be supplied on .
(reference) defines a reference to another location, possibly modified by additional text or comment. [3.7. Simple Links and Cross-References 16.1. Links]
Only one of the
attributes @target' and @cRef' may be supplied on
(list) contains any sequence of items organized as a list. [3.8. Lists]
The content of a "gloss" list should include a sequence of one or more pairs of a label element followed by an item element
(type) describes the nature of the items in the list.
Suggested values include: 1] gloss (gloss); 2] index (index); 3] instructions (instructions); 4] litany (litany); 5] syllogism (syllogism)
gloss
(gloss) each list item glosses some term or concept, which is given by a <label>
element preceding the list item.
index
(index) each list item is an entry in an index such as the alphabetical topical index at the back of a print volume.
instructions
(instructions) each list item is a step in a sequence of instructions, as in a recipe.
litany
(litany) each list item is one of a sequence of petitions, supplications or invocations, typically in a religious ritual.
syllogism
(syllogism) each list item is part of an argument consisting of two or more propositions and a final conclusion derived from them.
[^\p{C}\p{Z}]+
(item) contains one component of a list. [3.8. Lists 2.6. The Revision Description]
(label) contains any label or heading used to identify part of a text, typically but not exclusively in a list or glossary. [3.8. Lists]
(heading) contains any type of heading, for example the title of a section, or the heading of a list, glossary, manuscript description, etc. [4.2.1. Headings and Trailers]
(heading for list labels) contains the heading for the label or term column in a glossary list or similar structured list. [3.8. Lists]
(heading for list items) contains the heading for the item or gloss column in a glossary list or similar structured list. [3.8. Lists]
(note) contains a note or annotation. [3.9.1. Notes and Simple Annotation 2.2.6. The Notes Statement 3.12.2.8. Notes and Statement of Language 9.3.5.4. Notes within Entries]
(note group) contains a group of notes [3.9.1.1. Encoding Grouped Notes]
(index entry) marks a location to be indexed for whatever purpose. [3.9.2. Index Entries]
a single word which follows the rules defining a legal XML name (see ), supplying a name to specify which index (of several) the index entry belongs to.
indicates the location of any form of external media such as an audio or video clip etc. [3.10. Graphics and Other Non-textual Components]
(MIME media type) specifies the applicable multimedia internet mail extension (MIME) media type
[^\p{C}\p{Z}]+
(graphic) indicates the location of a graphic or illustration, either forming part of a text, or providing an image of it. [3.10. Graphics and Other Non-textual Components 11.1. Digital Facsimiles]
provides encoded binary data representing an inline graphic, audio, video or other object. [3.10. Graphics and Other Non-textual Components]
The encoding used to encode the binary data. If not specified, this is assumed to be Base64.
[^\p{C}\p{Z}]+
(milestone) marks a boundary point separating any kind of section of a text, typically but not necessarily indicating a point at which some part of a standard reference system changes, where the change is not represented by a structural element. [3.11.3. Milestone
Elements]
(gathering beginning) marks the beginning of a new gathering or quire in a transcribed codex. [3.11.3. Milestone
Elements]
(page beginning) marks the beginning of a new page in a paginated document. [3.11.3. Milestone
Elements]
(line beginning) marks the beginning of a new (typographic) line in some edition or version of a text. [3.11.3. Milestone
Elements 7.2.5. Speech Contents]
(column beginning) marks the beginning of a new column of a text on a multi-column page. [3.11.3. Milestone
Elements]
(analytic level) contains bibliographic elements describing an item (e.g. an article or poem) published within a monograph or journal and not as an independent publication. [3.12.2.1. Analytic, Monographic, and Series Levels]
(monographic level) contains bibliographic elements describing an item (e.g. a book or journal) published as an independent item (i.e. as a separate physical object). [3.12.2.1. Analytic, Monographic, and Series Levels]
(series information) contains information about the series in which a book or other bibliographic item has appeared. [3.12.2.1. Analytic, Monographic, and Series Levels]
(author) in a bibliographic reference, contains the name(s) of an author, personal or corporate, of a work; for example in the same form as that provided by a recognized bibliographic name authority. [3.12.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement]
contains a secondary statement of responsibility for a bibliographic item, for example the name of an individual, institution or organization, (or of several such) acting as editor, compiler, translator, etc. [3.12.2.2. Titles, Authors, and Editors]
(statement of responsibility) supplies a statement of responsibility for the intellectual content of a text, edition, recording, or series, where the specialized elements for authors, editors, etc. do not suffice or do not apply. May also be used to encode information about individuals or organizations which have played a role in the production or distribution of a bibliographic work. [3.12.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement 2.2.2. The Edition Statement 2.2.5. The Series Statement]
(responsibility) contains a phrase describing the nature of a person's intellectual responsibility, or an organization's role in the production or distribution of a work. [3.12.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement 2.2.2. The Edition Statement 2.2.5. The Series Statement]
(title) contains a title for any kind of work. [3.12.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement 2.2.5. The Series Statement]
classifies the title according to some convenient typology.
Sample values include: 1] main; 2] sub (subordinate); 3] alt (alternate); 4] short; 5] desc (descriptive)
[^\p{C}\p{Z}]+
indicates the bibliographic level for a title, that is, whether it identifies an article, book, journal, series, or unpublished material.
a
(analytic) the title applies to an analytic item, such as an article, poem, or other work published as part of a larger item.
m
(monographic) the title applies to a monograph such as a book or other item considered to be a distinct publication, including single volumes of multi-volume works
j
(journal) the title applies to any serial or periodical publication such as a journal, magazine, or newspaper
s
(series) the title applies to a series of otherwise distinct publications such as a collection
u
(unpublished) the title applies to any unpublished material (including theses and dissertations unless published by a commercial press)
contains the formalized descriptive title for a meeting or conference, for use in a bibliographic description for an item derived from such a meeting, or as a heading or preamble to publications emanating from it. [3.12.2.2. Titles, Authors, and Editors]
groups information relating to the publication or distribution of a bibliographic item. [3.12.2.4. Imprint, Size of a Document, and Reprint Information]
(publisher) provides the name of the organization responsible for the publication or distribution of a bibliographic item. [3.12.2.4. Imprint, Size of a Document, and Reprint Information 2.2.4. Publication, Distribution, Licensing, etc.]
(scope of bibliographic reference) defines the scope of a bibliographic reference, for example as a list of page numbers, or a named subdivision of a larger work. [3.12.2.5. Scopes and Ranges in Bibliographic Citations]
(cited range) defines the range of cited content, often represented by pages or other units [3.12.2.5. Scopes and Ranges in Bibliographic Citations]
(publication place) contains the name of the place where a bibliographic item was published. [3.12.2.4. Imprint, Size of a Document, and Reprint Information]
(bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may not be explicitly tagged. [3.12.1. Methods of Encoding Bibliographic References and Lists of References 2.2.7. The Source Description 15.3.2. Declarable Elements]
(structured bibliographic citation) contains a structured bibliographic citation, in which only bibliographic sub-elements appear and in a specified order. [3.12.1. Methods of Encoding Bibliographic References and Lists of References 2.2.7. The Source Description 15.3.2. Declarable Elements]
(citation list) contains a list of bibliographic citations of any kind. [3.12.1. Methods of Encoding Bibliographic References and Lists of References 2.2.7. The Source Description 15.3.2. Declarable Elements]
contains or references some other bibliographic item which is related to the present one in some specified manner, for example as a constituent or alternative version of it. [3.12.2.7. Related Items]
If the @target attribute on is used, the
relatedItem element must be empty
A relatedItem element should have either a 'target' attribute
or a child element to indicate the related bibliographic item
points to the related bibliographic element by means of an absolute or relative URI reference
\S+
(verse line) contains a single, possibly incomplete, line of verse. [3.13.1. Core Tags for Verse 3.13. Passages of Verse or Drama 7.2.5. Speech Contents]
Abstract model violation: Lines may not contain lines or lg elements.
(line group) contains one or more verse lines functioning as a formal unit, e.g. a stanza, refrain, verse paragraph, etc. [3.13.1. Core Tags for Verse 3.13. Passages of Verse or Drama 7.2.5. Speech Contents]
An lg element
must contain at least one child l, lg, or gap element.
Abstract model violation: Lines may not contain line groups.
(speech) contains an individual speech in a performance text, or a passage presented as such in a prose or verse text. [3.13.2. Core Tags for Drama 3.13. Passages of Verse or Drama 7.2.2. Speeches and Speakers]
contains a specialized form of heading or label, giving the name of one or more speakers in a dramatic text or fragment. [3.13.2. Core Tags for Drama]
(stage direction) contains any kind of stage direction within a dramatic text or fragment. [3.13.2. Core Tags for Drama 3.13. Passages of Verse or Drama 7.2.4. Stage Directions]
indicates the kind of stage direction.
Suggested values include: 1] setting; 2] entrance; 3] exit; 4] business; 5] novelistic; 6] delivery; 7] modifier; 8] location; 9] mixed
setting
describes a setting.
entrance
describes an entrance.
exit
describes an exit.
business
describes stage business.
novelistic
is a narrative, motivating stage direction.
delivery
describes how a character speaks.
modifier
gives some detail about a character.
location
describes a location.
mixed
more than one of the above
[^\p{C}\p{Z}]+
(TEI corpus) contains the whole of a TEI encoded corpus, comprising a single corpus header and one or more <TEI>
elements, each containing a single text header and a text. [4. Default Text Structure 15.1. Varieties of Composite Text]
(version) specifies the version number of the TEI Guidelines against which this document is valid.
[\d]+(\.[\d]+){0,2}
(automatically generated text division) indicates the location at which a textual division generated automatically by a text-processing application is to appear. [3.9.2. Index Entries]
specifies what type of generated text division (e.g. index, table of contents, etc.) is to appear.
Sample values include: 1] index; 2] toc; 3] figlist; 4] tablist
[^\p{C}\p{Z}]+
(text language) describes the languages and writing systems identified within the bibliographic work being described, rather than its description. [3.12.2.4. Imprint, Size of a Document, and Reprint Information 10.6.6. Languages and Writing Systems]
(main language) supplies a code which identifies the chief language used in the bibliographic work.
(other languages) one or more codes identifying any other languages used in the bibliographic work.
(use) supplies an XPath selection pattern using the syntax defined in . The XPath pattern is relative to the context given in @match
, which will either be a sibling attribute in the case of `<citeStructure>` or on the parent `<citeStructure>` in the case of `<citeData>`.
specifies a regular expression against which the values of other attributes can be matched.
specifies a replacement pattern, that is, the skeleton of a relative or absolute URI containing references to groups in the @matchPattern
which, once subpattern substitution has been performed, complete the URI.
(TEI header) supplies descriptive and declarative metadata associated with a digital resource or set of resources. [2.1.1. The TEI Header and Its Components 15.1. Varieties of Composite Text]
(file description) contains a full bibliographic description of an electronic file. [2.2. The File Description 2.1.1. The TEI Header and Its Components]
(title statement) groups information about the title of a work and those responsible for its content. [2.2.1. The Title Statement 2.2. The File Description]
(sponsor) specifies the name of a sponsoring organization or institution. [2.2.1. The Title Statement]
(funding body) specifies the name of an individual, institution, or organization responsible for the funding of a project or text. [2.2.1. The Title Statement]
(principal researcher) supplies the name of the principal researcher responsible for the creation of an electronic text. [2.2.1. The Title Statement]
(edition statement) groups information relating to one edition of a text. [2.2.2. The Edition Statement 2.2. The File Description]
(edition) describes the particularities of one edition of a text. [2.2.2. The Edition Statement]
(extent) describes the approximate size of a text stored on some carrier medium or of some other object, digital or non-digital, specified in any convenient units. [2.2.3. Type and Extent of File 2.2. The File Description 3.12.2.4. Imprint, Size of a Document, and Reprint Information 10.7.1. Object Description]
(publication statement) groups information concerning the publication or distribution of an electronic or other text. [2.2.4. Publication, Distribution, Licensing, etc. 2.2. The File Description]
(distributor) supplies the name of a person or other agency responsible for the distribution of a text. [2.2.4. Publication, Distribution, Licensing, etc.]
(release authority) supplies the name of a person or other agency responsible for making a work available, other than a publisher or distributor. [2.2.4. Publication, Distribution, Licensing, etc.]
(identifier) supplies any form of identifier used to identify some object, such as a bibliographic item, a person, a title, an organization, etc. in a standardized way. [13.3.1. Basic Principles 2.2.4. Publication, Distribution, Licensing, etc. 2.2.5. The Series Statement 3.12.2.4. Imprint, Size of a Document, and Reprint Information]
categorizes the identifier, for example as an ISBN, Social Security number, etc.
Suggested values include: 1] ISBN; 2] ISSN; 3] DOI; 4] URI; 5] VIAF; 6] ESTC; 7] OCLC
ISBN
International Standard Book Number: a 13- or (if assigned prior to 2007) 10-digit identifying number assigned by the publishing industry to a published book or similar item, registered with the International ISBN Agency.
ISSN
International Standard Serial Number: an eight-digit number to uniquely identify a serial publication.
DOI
Digital Object Identifier: a unique string of letters and numbers assigned to an electronic document.
URI
Uniform Resource Identifier: a string of characters to uniquely identify a resource, following the syntax of RFC 3986.
VIAF
A data number in the Virtual Internet Authority File assigned to link different names in catalogs around the world for the same entity.
ESTC
English Short-Title Catalogue number: an identifying number assigned to a document in English printed in the British Isles or North America before 1801.
OCLC
OCLC control number (record number) for the union catalog record in WorldCat, a union catalog for member libraries in the Online Computer Library Center global cooperative.
[^\p{C}\p{Z}]+
(availability) supplies information about the availability of a text, for example any restrictions on its use or distribution, its copyright status, any licence applying to it, etc. [2.2.4. Publication, Distribution, Licensing, etc.]
(status) supplies a code identifying the current availability of the text.
free
(free) the text is freely available.
unknown
(unknown) the status of the text is unknown.
restricted
(restricted) the text is not freely available.
contains information about a licence or other legal agreement applicable to the text. [2.2.4. Publication, Distribution, Licensing, etc.]
(series statement) groups information about the series, if any, to which a publication belongs. [2.2.5. The Series Statement 2.2. The File Description]
(notes statement) collects together any notes providing information about a text additional to that recorded in other parts of the bibliographic description. [2.2.6. The Notes Statement 2.2. The File Description]
(source description) describes the source(s) from which an electronic text was derived or generated, typically a bibliographic description in the case of a digitized text, or a phrase such as "born digital" for a text which has no previous existence. [2.2.7. The Source Description]
(fully-structured bibliographic citation) contains a fully-structured bibliographic citation, in which all components of the TEI file description are present. [3.12.1. Methods of Encoding Bibliographic References and Lists of References 2.2. The File Description 2.2.7. The Source Description 15.3.2. Declarable Elements]
(encoding description) documents the relationship between an electronic text and the source or sources from which it was derived. [2.3. The Encoding Description 2.1.1. The TEI Header and Its Components]
(schema reference) describes or points to a related customization or schema file [2.3.10. The Schema Specification]
the identifier used for the customization or schema
(project description) describes in detail the aim or purpose for which an electronic file was encoded, together with any other relevant information concerning the process by which it was assembled or collected. [2.3.1. The Project Description 2.3. The Encoding Description 15.3.2. Declarable Elements]
(sampling declaration) contains a prose description of the rationale and methods used in sampling texts in the creation of a corpus or collection. [2.3.2. The Sampling Declaration 2.3. The Encoding Description 15.3.2. Declarable Elements]
(editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text. [2.3.3. The Editorial Practices Declaration 2.3. The Encoding Description 15.3.2. Declarable Elements]
(correction principles) states how and under what circumstances corrections have been made in the text. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements]
indicates the degree of correction applied to the text.
high
the text has been thoroughly checked and proofread.
medium
the text has been checked at least once.
low
the text has not been checked.
unknown
the correction status of the text is unknown.
indicates the method adopted to indicate corrections within the text.
silent
corrections have been made silently
markup
corrections have been represented using markup
(normalization) indicates the extent of normalization or regularization of the original source carried out in converting it to electronic form. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements]
indicates the method adopted to indicate normalizations within the text.
silent
normalization made silently
markup
normalization represented using markup
(quotation) specifies editorial practice adopted with respect to quotation marks in the original. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements]
On , either the @marks attribute should be used, or a paragraph of description provided
(quotation marks) indicates whether or not quotation marks have been retained as content within the text.
none
no quotation marks have been retained
some
some quotation marks have been retained
all
all quotation marks have been retained
(hyphenation) summarizes the way in which hyphenation in a source text has been treated in an encoded version of it. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements]
(end-of-line) indicates whether or not end-of-line hyphenation has been retained in a text.
all
all end-of-line hyphenation has been retained, even though the lineation of the original may not have been.
some
end-of-line hyphenation has been retained in some cases.
hard
all soft end-of-line hyphenation has been removed: any remaining end-of-line hyphenation should be retained.
none
all end-of-line hyphenation has been removed: any remaining hyphenation occurred within the line.
(segmentation) describes the principles according to which the text has been segmented, for example into sentences, tone-units, graphemic strata, etc. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements]
(standard values) specifies the format used when standardized date or number values are supplied. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements]
(interpretation) describes the scope of any analytic or interpretive information added to the text in addition to the transcription. [2.3.3. The Editorial Practices Declaration]
specifies editorial practice adopted with respect to punctuation marks in the original. [2.3.3. The Editorial Practices Declaration 3.2. Treatment of Punctuation]
indicates whether or not punctation marks have been retained as content within the text.
none
no punctuation marks have been retained
some
some punctuation marks have been retained
all
all punctuation marks have been retained
indicates the positioning of punctuation marks that are associated with marked up text as being encoded within the element surrounding the text or immediately before or after it.
internal
punctuation marks found at the start or end of a marked up text component are included within its surrounding element;
external
punctuation marks found at the start or end of a marked up text component appear immediately before or after the surrounding element
(tagging declaration) provides detailed information about the tagging applied to a document. [2.3.4. The Tagging Declaration 2.3. The Encoding Description]
indicates whether the element types listed exhaustively include all those found within <text>
, or represent only a subset.
(element usage) documents the usage of a specific element within a specified document. [2.3.4. The Tagging Declaration]
(generic identifier) specifies the name (generic identifier) of the element indicated by the tag, within the namespace indicated by the parent <namespace>
element.
specifies the number of occurrences of this element within the text.
(with unique identifier) specifies the number of occurrences of this element within the text which bear a distinct value for the global @xml:id
attribute.
(namespace) supplies the formal name of the namespace to which the elements documented by its children belong. [2.3.4. The Tagging Declaration]
specifies the full formal name of the namespace concerned.
\S*
(rendition) supplies information about the rendition or appearance of one or more elements in the source text. [2.3.4. The Tagging Declaration]
where CSS is used, provides a way of defining pseudo-elements, that is, styling rules applicable to specific sub-portions of an element.
Sample values include: 1] first-line; 2] first-letter; 3] before; 4] after
[^\p{C}\p{Z}]+
contains a selector or series of selectors specifying the elements to which the contained style description applies, expressed in the language specified in the @scheme
attribute.
(style definition language declaration) specifies the name of the formal language in which style or renditional information is supplied elsewhere in the document. The specific version of the scheme may also be supplied. [2.3.5. The Default Style Definition Language Declaration]
(references declaration) specifies how canonical references are constructed for this text. [2.3.6.3. Milestone Method 2.3. The Encoding Description 2.3.6. The Reference System Declaration]
(citation structure) declares a structure and method for citing the current document. [3.11.4. Declaring Reference Systems 16.2.5.4. Citation Structures]
(delimiter) supplies a delimiting string preceding the structural component.
.+
(match) supplies an XPath selection pattern using the syntax defined in which identifies a set of nodes which are citable structural components. The expression may be absolute (beginning with `/`) or relative. @match
on a <citeStructure>
without a <citeStructure>
parent must be an absolute XPath. If it is relative, its context is set by the @match
of the parent <citeStructure>
.
An XPath in @match on the outer must start with '/'.
An XPath in @match must not start with '/' except on the outer .
(unit) describes the structural unit indicated by the <citeStructure>
.
Sample values include: 1] book; 2] chapter; 3] entry; 4] poem; 5] letter; 6] line; 7] section; 8] verse; 9] volume
[^\p{C}\p{Z}]+
(citation data) specifies how information may be extracted from citation structures. [3.11.4. Declaring Reference Systems 16.2.5.4. Citation Structures]
(property) A URI indicating a property definition.
\S+
(canonical reference pattern) specifies an expression and replacement pattern for transforming a canonical reference into a URI. [2.3.6.3. Milestone Method 2.3.6. The Reference System Declaration 2.3.6.2. Search-and-Replace Method]
(prefix definition) defines a prefixing scheme used in teidata.pointer values, showing how abbreviated URIs using the scheme may be expanded into full URIs. [16.2.3. Using Abbreviated Pointers]
supplies a name which functions as the prefix for an abbreviated pointing scheme such as a private URI scheme. The prefix constitutes the text preceding the first colon.
[a-z][a-z0-9\+\.\-]*
(list of prefix definitions) contains a list of definitions of prefixing schemes used in teidata.pointer values, showing how abbreviated URIs using each scheme may be expanded into full URIs. [16.2.3. Using Abbreviated Pointers]
(reference state) specifies one component of a canonical reference defined by the milestone method. [2.3.6.3. Milestone Method 2.3.6. The Reference System Declaration]
specifies the fixed length of the reference component.
(delimiter) supplies a delimiting string following the reference component.
(classification declarations) contains one or more taxonomies defining any classificatory codes used elsewhere in the text. [2.3.7. The Classification Declaration 2.3. The Encoding Description]
(taxonomy) defines a typology either implicitly, by means of a bibliographic citation, or explicitly by a structured taxonomy. [2.3.7. The Classification Declaration]
(category) contains an individual descriptive category, possibly nested within a superordinate category, within a user-defined taxonomy. [2.3.7. The Classification Declaration]
(category description) describes some category within a taxonomy or text typology, either in the form of a brief prose description or in terms of the situational parameters used by the TEI formal <textDesc>
. [2.3.7. The Classification Declaration]
(geographic coordinates declaration) documents the notation and the datum used for geographic coordinates expressed as content of the <geo>
element elsewhere within the document. [2.3.8. The Geographic Coordinates Declaration]
supplies a commonly used code name for the datum employed.
Suggested values include: 1] WGS84 (World Geodetic System); 2] MGRS (Military Grid Reference System); 3] OSGB36 (ordnance survey great britain); 4] ED50 (European Datum coordinate system)
WGS84
(World Geodetic System) a pair of numbers to be interpreted as latitude followed by longitude according to the World Geodetic System.
MGRS
(Military Grid Reference System) the values supplied are geospatial entity object codes, based on
OSGB36
(ordnance survey great britain) the value supplied is to be interpreted as a British National Grid Reference.
ED50
(European Datum coordinate system) the value supplied is to be interpreted as latitude followed by longitude according to the European Datum coordinate system.
[^\p{C}\p{Z}]+
(unit declarations) provides information about units of measurement that are not members of the International System of Units. [2.3.9. The Unit Declaration]
(unit definition) contains descriptive information related to a specific unit of measurement. [2.3.9. The Unit Declaration]
defines how to calculate one unit of measure in terms of another. [2.3.9. The Unit Declaration]
indicates a source unit of measure that is to be converted into another unit indicated in @toUnit
.
\S+
the target unit of measurement for a conversion from a source unit referenced in @fromUnit
.
\S+
(application information) records information about an application which has edited the TEI file. [2.3.11. The Application Information Element]
provides information about an application which has acted upon the document. [2.3.11. The Application Information Element]
supplies an identifier for the application, independent of its version number or display name.
supplies a version number for the application, independent of its identifier or display name.
[\d]+[a-z]*[\d]*(\.[\d]+[a-z]*[\d]*){0,3}
(text-profile description) provides a detailed description of non-bibliographic aspects of a text, specifically the languages and sublanguages used, the situation in which it was produced, the participants and their setting. [2.4. The Profile Description 2.1.1. The TEI Header and Its Components]
(note on hand) describes a particular style or hand distinguished within a manuscript. [10.7.2. Writing, Decoration, and Other Notations]
contains a summary or formal abstract prefixed to an existing source document by the encoder. [2.4.4. Abstracts]
(creation) contains information about the creation of a text. [2.4.1. Creation 2.4. The Profile Description]
(language usage) describes the languages, sublanguages, registers, dialects, etc. represented within a text. [2.4.2. Language Usage 2.4. The Profile Description 15.3.2. Declarable Elements]
(language) characterizes a single language or sublanguage used within a text. [2.4.2. Language Usage]
(identifier) Supplies a language code constructed as defined in BCP 47 which is used to identify the language documented by this element, and which is referenced by the global @xml:lang
attribute.
specifies the approximate percentage (by volume) of the text which uses this language.
(text classification) groups information which describes the nature or topic of a text in terms of a standard classification scheme, thesaurus, etc. [2.4.3. The Text Classification]
(keywords) contains a list of keywords or phrases identifying the topic or nature of a text. [2.4.3. The Text Classification]
identifies the controlled vocabulary within which the set of keywords concerned is defined, for example by a <taxonomy>
element, or by some other resource.
\S+
(classification code) contains the classification code used for this text in some standard classification system. [2.4.3. The Text Classification]
identifies the classification system in use, as defined by, e.g. a <taxonomy>
element, or some other resource.
\S+
(category reference) specifies one or more defined categories within some taxonomy or text typology. [2.4.3. The Text Classification]
identifies the classification scheme within which the set of categories concerned is defined, for example by a <taxonomy>
element, or by some other resource.
\S+
(calendar description) contains a description of the calendar system used in any dating expression found in the text. [2.4. The Profile Description 2.4.5. Calendar Description]
(calendar) describes a calendar or dating system used in a dating formula in the text. [2.4.5. Calendar Description]
(correspondence description) contains a description of the actions related to one act of correspondence. [2.4.6. Correspondence Description]
(correspondence action) contains a structured description of the place, the name of a person/organization and the date related to the sending/receiving of a message or any other action related to the correspondence. [2.4.6. Correspondence Description]
describes the nature of the action.
Suggested values include: 1] sent; 2] received; 3] transmitted; 4] redirected; 5] forwarded
sent
information concerning the sending or dispatch of a message.
received
information concerning the receipt of a message.
transmitted
information concerning the transmission of a message, i.e. between the dispatch and the next receipt, redirect or forwarding.
redirected
information concerning the redirection of an unread message.
forwarded
information concerning the forwarding of a message.
[^\p{C}\p{Z}]+
(correspondence context) provides references to preceding or following correspondence related to this piece of correspondence. [2.4.6. Correspondence Description]
(non-TEI metadata) provides a container element into which metadata in non-TEI formats may be placed. [2.5. Non-TEI Metadata]
(revision description) summarizes the revision history for a file. [2.6. The Revision Description 2.1.1. The TEI Header and Its Components]
(change) documents a change or set of changes made during the production of a source document, or during the revision of an electronic file. [2.6. The Revision Description 2.4.1. Creation 11.7. Identifying Changes and Revisions]
(target) points to one or more elements that belong to this change.
\S+
describes a particular script distinguished within the description of a manuscript or similar resource. [10.7.2. Writing, Decoration, and Other Notations]
groups a number of change descriptions associated with either the creation of a source text or the revision of an encoded text. [2.6. The Revision Description 11.7. Identifying Changes and Revisions]
indicates whether the ordering of its child <change>
elements is to be considered significant or not
(TEI document) contains a single TEI-conformant document, combining a single TEI header with one or more members of the model.resource class. Multiple <TEI>
elements may be combined within a <TEI>
(or <teiCorpus>
) element. [4. Default Text Structure 15.1. Varieties of Composite Text]
specifies the version number of the TEI Guidelines against which this document is valid.
[\d]+(\.[\d]+){0,2}
(text) contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample. [4. Default Text Structure 15.1. Varieties of Composite Text]
(text body) contains the whole body of a single unitary text, excluding any front or back matter. [4. Default Text Structure]
(group) contains the body of a composite text, grouping together a sequence of distinct texts (or groups of such texts) which are regarded as a unit for some purpose, for example the collected works of an author, a sequence of prose essays, etc. [4. Default Text Structure 4.3.1. Grouped Texts 15.1. Varieties of Composite Text]
(floating text) contains a single text of any kind, whether unitary or composite, which interrupts the text containing it at any point and after which the surrounding text resumes. [4.3.2. Floating Texts]
(text division) contains a subdivision of the front, body, or back of a text. [4.1. Divisions of the Body]
Abstract model violation: Lines may not contain higher-level structural elements such as div, unless div is a descendant of floatingText.
Abstract model violation: p and ab may not contain higher-level structural elements such as div, unless div is a descendant of floatingText.
(level-1 text division) contains a first-level subdivision of the front, body, or back of a text. [4.1.2. Numbered Divisions]
(level-2 text division) contains a second-level subdivision of the front, body, or back of a text. [4.1.2. Numbered Divisions]
(level-3 text division) contains a third-level subdivision of the front, body, or back of a text. [4.1.2. Numbered Divisions]
(level-4 text division) contains a fourth-level subdivision of the front, body, or back of a text. [4.1.2. Numbered Divisions]
(level-5 text division) contains a fifth-level subdivision of the front, body, or back of a text. [4.1.2. Numbered Divisions]
(level-6 text division) contains a sixth-level subdivision of the front, body, or back of a text. [4.1.2. Numbered Divisions]
(level-7 text division) contains the smallest possible subdivision of the front, body or back of a text, larger than a paragraph. [4.1.2. Numbered Divisions]
contains a closing title or footer appearing at the end of a division of a text. [4.2.4. Content of Textual Divisions 4.2. Elements Common to All Divisions]
(byline) contains the primary statement of responsibility given for a work on its title page or at the head or end of the work. [4.2.2. Openers and Closers 4.5. Front Matter]
(dateline) contains a brief description of the place, date, time, etc. of production of a letter, newspaper story, or other work, prefixed or suffixed to it as a kind of heading or trailer. [4.2.2. Openers and Closers]
(argument) contains a formal list or prose description of the topics addressed by a subdivision of a text. [4.2. Elements Common to All Divisions 4.6. Title Pages]
(epigraph) contains a quotation, anonymous or attributed, appearing at the start or end of a section or on a title page. [4.2.3. Arguments, Epigraphs, and Postscripts 4.2. Elements Common to All Divisions 4.6. Title Pages]
(opener) groups together dateline, byline, salutation, and similar phrases appearing as a preliminary group at the start of a division, especially of a letter. [4.2. Elements Common to All Divisions]
(closer) groups together salutations, datelines, and similar phrases appearing as a final group at the end of a division, especially of a letter. [4.2.2. Openers and Closers 4.2. Elements Common to All Divisions]
(salutation) contains a salutation or greeting prefixed to a foreword, dedicatory epistle, or other division of a text, or the salutation in the closing of a letter, preface, etc. [4.2.2. Openers and Closers]
(signature) contains the closing salutation, etc., appended to a foreword, dedicatory epistle, or other division of a text. [4.2.2. Openers and Closers]
contains a postscript, e.g. to a letter. [4.2. Elements Common to All Divisions]
(title page) contains the title page of a text, appearing within the front or back matter. [4.6. Title Pages]
classifies the title page according to any convenient typology.
[^\p{C}\p{Z}]+
(document title) contains the title of a document, including all its constituents, as given on a title page. [4.6. Title Pages]
(title part) contains a subsection or division of the title of a work, as indicated on a title page. [4.6. Title Pages]
(type) specifies the role of this subdivision of the title.
Suggested values include: 1] main (main); 2] sub (subordinate); 3] alt (alternate); 4] short (short); 5] desc (descriptive)
main
(main) main title of the work
sub
(subordinate) subtitle of the work
alt
(alternate) alternative title of the work
short
(short) abbreviated form of title
desc
(descriptive) descriptive paraphrase of the work
[^\p{C}\p{Z}]+
(document author) contains the name of the author of the document, as given on the title page (often but not always contained in a byline). [4.6. Title Pages]
(imprimatur) contains a formal statement authorizing the publication of a work, sometimes required to appear on a title page or its verso. [4.6. Title Pages]
(document edition) contains an edition statement as presented on a title page of a document. [4.6. Title Pages]
(document imprint) contains the imprint statement (place and date of publication, publisher name), as given (usually) at the foot of a title page. [4.6. Title Pages]
(document date) contains the date of a document, as given on a title page or in a dateline. [4.6. Title Pages]
(when) gives the value of the date in standard form, i.e. YYYY-MM-DD.
(front matter) contains any prefatory matter (headers, abstracts, title page, prefaces, dedications, etc.) found at the start of a document, before the main body. [4.6. Title Pages 4. Default Text Structure]
(back matter) contains any appendixes, etc. following the main part of a text. [4.7. Back Matter 4. Default Text Structure]
provides the name of the character or glyph property being defined.
provides the value of the character or glyph property being defined.
specifies the version number of the Unicode Standard in which this property name is defined.
Suggested values include: 1] 1.0.1; 2] 1.1; 3] 2.0; 4] 2.1; 5] 3.0; 6] 3.1; 7] 3.2; 8] 4.0; 9] 4.1; 10] 5.0; 11] 5.1; 12] 5.2; 13] 6.0; 14] 6.1; 15] 6.2; 16] 6.3; 17] 7.0; 18] 8.0; 19] 9.0; 20] 10.0; 21] 11.0; 22] 12.0; 23] 12.1; 24] unassigned
1.0.1
1.1
2.0
2.1
3.0
3.1
3.2
4.0
4.1
5.0
5.1
5.2
6.0
6.1
6.2
6.3
7.0
8.0
9.0
10.0
11.0
12.0
12.1
unassigned
[^\p{C}\p{Z}]+
(character or glyph) represents a glyph, or a non-standard character. [5. Characters, Glyphs, and Writing Modes]
points to a description of the character or glyph intended.
\S+
(character declarations) provides information about nonstandard characters and glyphs. [5.2. Markup Constructs for Representation of Characters and Glyphs]
(character) provides descriptive information about a character. [5.2. Markup Constructs for Representation of Characters and Glyphs]
(character glyph) provides descriptive information about a character glyph. [5.2. Markup Constructs for Representation of Characters and Glyphs]
(locally defined property) provides a locally defined character (or glyph) property. [5.2.1. Character Properties]
(character mapping) contains one or more characters which are related to the parent character or glyph in some respect, as specified by the @type
attribute. [5.2. Markup Constructs for Representation of Characters and Glyphs]
(unihan property) holds the name and value of a normative or informative Unihan character (or glyph) property as part of its attributes. [5.2.1. Character Properties]
specifies the normalized name of a unicode han database (Unihan) property
kZVariant
kAccountingNumeric
kBigFive
kCCCII
kCNS1986
kCNS1992
kCangjie
kCantonese
kCheungBauer
kCheungBauerIndex
kCihaiT
kCompatibilityVariant
kCowles
kDaeJaweon
kDefinition
kEACC
kFenn
kFennIndex
kFourCornerCode
kFrequency
kGB0
kGB1
kGB3
kGB5
kGB7
kGB8
kGSR
kGradeLevel
kHDZRadBreak
kHKGlyph
kHKSCS
kHanYu
kHangul
kHanyuPinlu
kHanyuPinyin
kIBMJapan
kIICore
kIRGDaeJaweon
kIRGDaiKanwaZiten
kIRGHanyuDaZidian
kIRGKangXi
kIRG_GSource
kIRG_HSource
kIRG_JSource
kIRG_KPSource
kIRG_KSource
kIRG_MSource
kIRG_TSource
kIRG_USource
kIRG_VSource
kJIS0213
kJa
kJapaneseKun
kJapaneseOn
kJinmeiyoKanji
kJis0
kJis1
kJoyoKanji
kKPS0
kKPS1
kKSC0
kKSC1
kKangXi
kKarlgren
kKorean
kKoreanEducationHanja
kKoreanName
kLau
kMainlandTelegraph
kMandarin
kMatthews
kMeyerWempe
kMorohashi
kNelson
kOtherNumeric
kPhonetic
kPrimaryNumeric
kPseudoGB1
kRSAdobe_Japan1_6
kRSJapanese
kRSKanWa
kRSKangXi
kRSKorean
kRSUnicode
kSBGY
kSemanticVariant
kSimplifiedVariant
kSpecializedSemanticVariant
kTGH
kTaiwanTelegraph
kTang
kTotalStrokes
kTraditionalVariant
kVietnamese
kXHC1983
kXerox
specifies the value of a named Unihan property
[^\p{C}\p{Z}]+
(unicode property) provides a Unicode property for a character (or glyph). [5.2.1. Character Properties]
specifies the normalized name of a Unicode property.
Age
AHex
Alpha
Alphabetic
ASCII_Hex_Digit
bc
Bidi_C
Bidi_Class
Bidi_Control
Bidi_M
Bidi_Mirrored
Bidi_Mirroring_Glyph
Bidi_Paired_Bracket
Bidi_Paired_Bracket_Type
blk
Block
bmg
bpb
bpt
Canonical_Combining_Class
Case_Folding
Case_Ignorable
Cased
ccc
CE
cf
Changes_When_Casefolded
Changes_When_Casemapped
Changes_When_Lowercased
Changes_When_NFKC_Casefolded
Changes_When_Titlecased
Changes_When_Uppercased
CI
Comp_Ex
Composition_Exclusion
CWCF
CWCM
CWKCF
CWL
CWT
CWU
Dash
Decomposition_Mapping
Decomposition_Type
Default_Ignorable_Code_Point
Dep
Deprecated
DI
Dia
Diacritic
dm
dt
ea
East_Asian_Width
EqUIdeo
Equivalent_Unified_Ideograph
Expands_On_NFC
Expands_On_NFD
Expands_On_NFKC
Expands_On_NFKD
Ext
Extender
FC_NFKC
FC_NFKC_Closure
Full_Composition_Exclusion
gc
GCB
General_Category
Gr_Base
Gr_Ext
Gr_Link
Grapheme_Base
Grapheme_Cluster_Break
Grapheme_Extend
Grapheme_Link
Hangul_Syllable_Type
Hex
Hex_Digit
hst
Hyphen
ID_Continue
ID_Start
IDC
Ideo
Ideographic
IDS
IDS_Binary_Operator
IDS_Trinary_Operator
IDSB
IDST
Indic_Positional_Category
Indic_Syllabic_Category
InPC
InSC
isc
ISO_Comment
Jamo_Short_Name
jg
Join_C
Join_Control
Joining_Group
Joining_Type
JSN
jt
kAccountingNumeric
kCompatibilityVariant
kIICore
kIRG_GSource
kIRG_HSource
kIRG_JSource
kIRG_KPSource
kIRG_KSource
kIRG_MSource
kIRG_TSource
kIRG_USource
kIRG_VSource
kOtherNumeric
kPrimaryNumeric
kRSUnicode
lb
lc
Line_Break
LOE
Logical_Order_Exception
Lower
Lowercase
Lowercase_Mapping
Math
na
na1
Name
Name_Alias
NChar
NFC_QC
NFC_Quick_Check
NFD_QC
NFD_Quick_Check
NFKC_Casefold
NFKC_CF
NFKC_QC
NFKC_Quick_Check
NFKD_QC
NFKD_Quick_Check
Noncharacter_Code_Point
nt
Numeric_Type
Numeric_Value
nv
OAlpha
ODI
OGr_Ext
OIDC
OIDS
OLower
OMath
Other_Alphabetic
Other_Default_Ignorable_Code_Point
Other_Grapheme_Extend
Other_ID_Continue
Other_ID_Start
Other_Lowercase
Other_Math
Other_Uppercase
OUpper
Pat_Syn
Pat_WS
Pattern_Syntax
Pattern_White_Space
PCM
Prepended_Concatenation_Mark
QMark
Quotation_Mark
Radical
Regional_Indicator
RI
SB
sc
scf
Script
Script_Extensions
scx
SD
Sentence_Break
Sentence_Terminal
Simple_Case_Folding
Simple_Lowercase_Mapping
Simple_Titlecase_Mapping
Simple_Uppercase_Mapping
slc
Soft_Dotted
stc
STerm
suc
tc
Term
Terminal_Punctuation
Titlecase_Mapping
uc
UIdeo
Unicode_1_Name
Unified_Ideograph
Upper
Uppercase
Uppercase_Mapping
Variation_Selector
Vertical_Orientation
vo
VS
WB
White_Space
Word_Break
WSpace
XID_Continue
XID_Start
XIDC
XIDS
XO_NFC
XO_NFD
XO_NFKC
XO_NFKD
specifies the value of a named Unicode property.
(metrical structure, conventional) contains a user-specified encoding for the conventional metrical structure of the element.
(metrical structure, realized) contains a user-specified encoding for the actual realization of the conventional metrical structure applicable to the element.
(rhyme scheme) specifies the rhyme scheme applicable to a group of verse lines.
(enjambement) indicates that the end of a verse line is marked by enjambement.
Sample values include: 1] no; 2] yes; 3] weak; 4] strong
[^\p{C}\p{Z}]+
(metrical notation declaration) documents the notation employed to represent a metrical pattern when this is specified as the value of a @met
, @real
, or @rhyme
attribute on any structural element of a metrical text (e.g. <lg>
, <l>
, or <seg>
). [6.6. Metrical Notation Declaration 6.4. Rhyme and Metrical Analysis]
indicates whether the notation conveys the abstract metrical form, its actual prosodic realization, or the rhyme scheme, or some combination thereof.
met
(met attribute) declaration applies to the abstract metrical form recorded on the @met
attribute
real
(real attribute) declaration applies to the actual realization of the conventional metrical structure recorded on the @real
attribute
rhyme
declaration applies to the rhyme scheme recorded on the @rhyme
attribute
met
(met attribute) declaration applies to the abstract metrical form recorded on the @met
attribute
real
(real attribute) declaration applies to the actual realization of the conventional metrical structure recorded on the @real
attribute
rhyme
declaration applies to the rhyme scheme recorded on the @rhyme
attribute
met
(met attribute) declaration applies to the abstract metrical form recorded on the @met
attribute
real
(real attribute) declaration applies to the actual realization of the conventional metrical structure recorded on the @real
attribute
rhyme
declaration applies to the rhyme scheme recorded on the @rhyme
attribute
(regular expression pattern) specifies a regular expression defining any value that is legal for this notation.
(metrical notation symbol) documents the intended significance of a particular character or character sequence within a metrical notation, either explicitly or in terms of other <metSym>
elements in the same <metDecl>
. [6.6. Metrical Notation Declaration]
specifies the character or character sequence being documented.
[^\p{C}\p{Z}]+
specifies whether the symbol is defined in terms of other symbols (@terminal
is set to false) or in prose (@terminal
is set to true).
marks the point at which a metrical line may be divided. [6.2. Components of the Verse Line]
marks the rhyming part of a metrical line. [6.5. Rhyme]
provides a label (usually a single letter) to identify which part of a rhyme scheme this rhyming string instantiates.
[^\p{C}\p{Z}]+
(setting) contains a description of the setting, time, locale, appearance, etc., of the action of a play, typically found in the front matter of a printed performance text (not a stage direction). [7.1. Front and Back Matter
]
(prologue) contains the prologue to a drama, typically spoken by an actor out of character, possibly in association with a particular performance or venue. [7.1.2. Prologues and Epilogues 7.1. Front and Back Matter
]
(epilogue) contains the epilogue to a drama, typically spoken by an actor out of character, possibly in association with a particular performance or venue. [7.1.2. Prologues and Epilogues 7.1. Front and Back Matter
]
(performance) contains a section of front or back matter describing how a dramatic piece is to be performed in general or how it was performed on some specific occasion. [7.1.3. Records of Performances 7.1. Front and Back Matter
]
(cast list) contains a single cast list or dramatis personae. [7.1.4. Cast Lists 7.1. Front and Back Matter
]
(cast list grouping) groups one or more individual <castItem>
elements within a cast list. [7.1.4. Cast Lists]
(cast list item) contains a single entry within a cast list, describing either a single role or a list of non-speaking roles. [7.1.4. Cast Lists]
characterizes the cast item.
role
the item describes a single role.
list
the item describes a list of non-speaking roles.
(role) contains the name of a dramatic role, as given in a cast list. [7.1.4. Cast Lists]
specifies the gender of the role.
[^\p{C}\p{Z}]+
(role description) describes a character's role in a drama. [7.1.4. Cast Lists]
contains the name of an actor appearing within a cast list. [7.1.4. Cast Lists]
specifies the sex of the actor.
[^\p{C}\p{Z}]+
specifies the gender of the actor.
[^\p{C}\p{Z}]+
(speech group) contains a group of speeches or songs in a performance text presented in a source as constituting a single unit or number. [7.2.3. Grouped Speeches]
(movement) marks the actual movement of one or more characters. [7.2.4. Stage Directions]
characterizes the movement, for example as an entrance or exit.
Suggested values include: 1] entrance; 2] exit; 3] onStage
entrance
character is entering the stage.
exit
character is exiting the stage.
onStage
character moves on stage
[^\p{C}\p{Z}]+
specifies the direction of a stage movement.
Sample values include: 1] L (left); 2] R (right); 3] C (center)
[^\p{C}\p{Z}]+
\S+
(performance) identifies the performance or performances in which this movement occurred as specified by pointing to one or more <performance>
elements.
\S+
(view) describes the visual context of some part of a screen play in terms of what the spectator sees, generally independent of any dialogue. [7.3.1. Technical Information 7.3. Other Types of Performance Text]
(camera) describes a particular camera angle or viewpoint in a screen play. [7.3.1. Technical Information 7.3. Other Types of Performance Text]
(sound) describes a sound effect or musical sequence specified within a screen play or radio script. [7.3.1. Technical Information 7.3. Other Types of Performance Text]
categorizes the sound in some respect, e.g. as music, special effect, etc.
[^\p{C}\p{Z}]+
indicates whether the sound overlaps the surrounding speeches or interrupts them.
unknown
inapplicable
(caption) contains the text of a caption or other text displayed as part of a film script or screenplay. [7.3.1. Technical Information 7.3. Other Types of Performance Text]
(technical stage direction) describes a special-purpose stage direction that is not meant for the actors. [7.3.1. Technical Information]
categorizes the technical stage direction.
Suggested values include: 1] light; 2] sound; 3] prop; 4] block
light
a lighting cue
sound
a sound cue
prop
a prop cue
block
a blocking instruction
[^\p{C}\p{Z}]+
(performance) points to one or more <performance>
elements documenting the performance or performances to which this technical direction applies.
\S+
(script statement) contains a citation giving details of the script used for a spoken text. [8.2. Documenting the Source of Transcribed Speech 2.2.7. The Source Description 15.3.2. Declarable Elements]
(recording statement) describes a set of recordings used as the basis for transcription of a spoken text. [8.2. Documenting the Source of Transcribed Speech 2.2.7. The Source Description]
(recording event) provides details of an audio or video recording event used as the source of a spoken text, either directly or from a public broadcast. [8.2. Documenting the Source of Transcribed Speech 15.3.2. Declarable Elements]
the kind of recording.
audio
audio recording
video
audio and video recording
(equipment) provides technical details of the equipment and media used for an audio or video recording used as the source for a spoken text. [8.2. Documenting the Source of Transcribed Speech 15.3.2. Declarable Elements]
(broadcast) describes a broadcast used as the source of a spoken text. [8.2. Documenting the Source of Transcribed Speech 15.3.2. Declarable Elements]
(transcription description) describes the set of transcription conventions used, particularly for spoken material. [8.2. Documenting the Source of Transcribed Speech]
supplies an identifier for the encoding convention, independent of any version number.
supplies a version number for the encoding conventions used, if any.
[\d]+[a-z]*[\d]*(\.[\d]+[a-z]*[\d]*){0,3}
(utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker. [8.3.1. Utterances]
(transition) indicates the nature of the transition between this utterance and the previous one.
smooth
this utterance begins without unusual pause or rapidity.
latching
this utterance begins with a markedly shorter pause than normal.
overlap
this utterance begins before the previous one has finished.
pause
this utterance begins after a noticeable pause.
(pause) marks a pause either between or within utterances. [8.3.2. Pausing]
(vocal) marks any vocalized but not necessarily lexical phenomenon, for example voiced pauses, non-lexical backchannels, etc. [8.3.3. Vocal, Kinesic, Incident]
indicates whether or not the phenomenon is repeated.
unknown
inapplicable
(kinesic) marks any communicative phenomenon, not necessarily vocalized, for example a gesture, frown, etc. [8.3.3. Vocal, Kinesic, Incident]
indicates whether or not the phenomenon is repeated.
unknown
inapplicable
(incident) marks any phenomenon or occurrence, not necessarily vocalized or communicative, for example incidental noises or other events affecting communication. [8.3.3. Vocal, Kinesic, Incident]
(writing) contains a passage of written text revealed to participants in the course of a spoken text. [8.3.4. Writing]
indicates whether the writing is revealed all at once or gradually.
unknown
inapplicable
(shift) marks the point at which some paralinguistic feature of a series of utterances by any one speaker changes. [8.3.6. Shifts]
The @new attribute should always be supplied; use the special value
"normal" to indicate that the feature concerned ceases to be
remarkable at this point.
a paralinguistic feature.
Suggested values include: 1] tempo; 2] loud; 3] pitch; 4] tension; 5] rhythm; 6] voice
tempo
speed of utterance.
loud
loudness.
pitch
pitch range.
tension
tension or stress pattern.
rhythm
rhythmic qualities.
voice
voice quality.
[^\p{C}\p{Z}]+
specifies the new state of the paralinguistic feature specified.
[^\p{C}\p{Z}]+
groups together various annotations, e.g. for parallel interpretations of a spoken segment. [8.4.6. Analytic Coding]
(normalized) provides the normalized/standardized form of information present in the source text in a non-normalized form
(original) gives the original string or is the empty string when the element does not appear in the source text.
provides a lemma (base form) for the word, typically uninflected and serving both as an identifier (e.g. in dictionary contexts, as a headword), and as a basis for potential inflections.
provides a pointer to a definition of the lemma for the word, for example in an online lexicon.
\S+
(part of speech) indicates the part of speech assigned to a token (i.e. information on whether it is a noun, adjective, or verb), usually according to some official reference vocabulary (e.g. for German: STTS, for English: CLAWS, for Polish: NKJP, etc.).
(morphosyntactic description) supplies morphosyntactic information for a token, usually according to some official reference vocabulary (e.g. for German: STTS-large tagset; for a feature description system designed as (pragmatically) universal, see Universal Features).
when present, provides information on whether the token in question is adjacent to another, and if so, on which side.
no
the token is not adjacent to another
left
there is no whitespace on the left side of the token
right
there is no whitespace on the right side of the token
both
there is no whitespace on either side of the token
overlap
the token overlaps with another; other devices (specifying the extent and the area of overlap) are needed to more precisely locate this token in the character stream
(analysis) indicates one or more elements containing interpretations of the element on which the @ana
attribute appears.
\S+
(s-unit) contains a sentence-like division of a text. [17.1. Linguistic Segment Categories 8.4.1. Segmentation]
You may not nest one s element within
another: use seg instead
(clause) represents a grammatical clause. [17.1. Linguistic Segment Categories]
(phrase) represents a grammatical phrase. [17.1. Linguistic Segment Categories]
(word) represents a grammatical (not necessarily orthographic) word. [17.1. Linguistic Segment Categories 17.4.2. Lightweight Linguistic Annotation]
(morpheme) represents a grammatical morpheme. [17.1. Linguistic Segment Categories]
supplies the morpheme's base form.
[^\p{C}\p{Z}]+
(character) represents a character. [17.1. Linguistic Segment Categories]
(punctuation character) contains a character or string of characters regarded as constituting a single punctuation mark. [17.1.2. Below the Word Level 17.4.2. Lightweight Linguistic Annotation]
indicates the extent to which this punctuation mark conventionally separates words or phrases
strong
the punctuation mark is a word separator
weak
the punctuation mark is not a word separator
inter
the punctuation mark may or may not be a word separator
provides a name for the kind of unit delimited by this punctuation mark.
[^\p{C}\p{Z}]+
indicates whether this punctuation mark precedes or follows the unit it delimits.
associates an interpretative annotation directly with a span of text. [17.3. Spans and Interpretations]
Only one of the attributes @target and @from may be supplied on
Only one of the attributes @target and @to may be supplied on
If @to is supplied on , @from must be supplied as well
The attributes @to and @from on may each contain only a single value
gives the identifier of the node which is the starting point of the span of text being annotated; if not accompanied by a @to
attribute, gives the identifier of the node of the entire span of text being annotated.
\S+
gives the identifier of the node which is the end-point of the span of text being annotated.
\S+
(span group) collects together span tags. [17.3. Spans and Interpretations]
(interpretation) summarizes a specific interpretative annotation which can be linked to a span of text. [17.3. Spans and Interpretations]
(interpretation group) collects together a set of related interpretations which share responsibility or type. [17.3. Spans and Interpretations]
indicates type of entry, in dictionaries with multiple types.
Suggested values include: 1] main; 2] hom (homograph); 3] xref (cross reference); 4] affix; 5] abbr (abbreviation); 6] supplemental; 7] foreign
main
a main entry (default).
hom
(homograph) groups information relating to one homograph within an entry.
xref
(cross reference) a reduced entry whose only function is to point to another main entry (e.g. for forms of an irregular verb or for variant spellings: was pointing to be, or esthete to aesthete).
affix
an entry for a prefix, infix, or suffix.
abbr
(abbreviation) an entry for an abbreviation.
supplemental
a supplemental entry (for use in dictionaries which issue supplements to their main work in which they include updated information about entries).
foreign
an entry for a foreign word in a monolingual dictionary.
[^\p{C}\p{Z}]+
(expand) gives an expanded form of information presented more concisely in the dictionary
(split) gives the list of split values for a merged form
(value) gives a value which lacks any realization in the printed source text.
(location) indicates an <anchor>
element typically elsewhere in the document, but possibly in another document, which is the original location of this component.
\S+
(merged into) gives a reference to another element, where the original appears as a merged form.
\S+
(optional) indicates whether the element is optional or not
(super entry) groups a sequence of entries within any kind of lexical resource, such as a dictionary or lexicon which function as a single unit, for example a set of homographs. [9.1. Dictionary Body and Overall Structure]
(entry) contains a single structured entry in any kind of lexical resource, such as a dictionary or lexicon. [9.1. Dictionary Body and Overall Structure 9.2. The Structure of Dictionary Entries]
(unstructured entry) contains a single unstructured entry in any kind of lexical resource, such as a dictionary or lexicon. [9.1. Dictionary Body and Overall Structure 9.2. The Structure of Dictionary Entries]
(homograph) groups information relating to one homograph within an entry. [9.2. The Structure of Dictionary Entries]
groups together all information relating to one word sense in a dictionary entry, for example definitions, examples, and translation equivalents. [9.2. The Structure of Dictionary Entries]
gives the nesting depth of this sense.
(dictionary scrap) encloses a part of a dictionary entry in which other phrase-level dictionary elements are freely combined. [9.1. Dictionary Body and Overall Structure 9.2. The Structure of Dictionary Entries]
(form information group) groups all the information on the written and spoken forms of one headword. [9.3.1. Information on Written and Spoken Forms]
classifies form as simple, compound, etc.
Suggested values include: 1] simple; 2] lemma; 3] variant; 4] compound; 5] derivative; 6] inflected; 7] phrase
simple
single free lexical item
lemma
the headword itself
variant
a variant form
compound
word formed from simple lexical items
derivative
word derived from headword
inflected
word in other than usual dictionary form
phrase
multiple-word lexical item
[^\p{C}\p{Z}]+
(orthographic form) gives the orthographic form of a dictionary headword. [9.3.1. Information on Written and Spoken Forms]
gives the type of spelling.
[^\p{C}\p{Z}]+
(pronunciation) contains the pronunciation(s) of the word. [9.3.1. Information on Written and Spoken Forms]
(hyphenation) contains a hyphenated form of a dictionary headword, or hyphenation information in some other form. [9.3.1. Information on Written and Spoken Forms]
(syllabification) contains the syllabification of the headword. [9.3.1. Information on Written and Spoken Forms]
(stress) contains the stress pattern for a dictionary headword, if given separately. [9.3.1. Information on Written and Spoken Forms]
(grammatical information) within an entry in a dictionary or a terminological data file, contains grammatical information relating to a term, word, or form. [9.3.2. Grammatical Information]
classifies grammatical information provided by <gram>
according to some convenient and shared typology, ideally one defined in an external reference taxonomy, such as the CLARIN Concept
Registry.
Sample values include: 1] pos (part of speech); 2] gen (gender); 3] num (number); 4] animate; 5] proper
[^\p{C}\p{Z}]+
(gender) identifies the morphological gender of a lexical item, as given in the dictionary. [9.3.1. Information on Written and Spoken Forms]
(number) indicates grammatical number associated with a form, as given in a dictionary. [9.3.1. Information on Written and Spoken Forms 9.3.2. Grammatical Information]
(case) contains grammatical case information given by a dictionary for a given form. [9.3.1. Information on Written and Spoken Forms]
(person) contains an indication of the grammatical person (1st, 2nd, 3rd, etc.) associated with a given inflected form in a dictionary. [9.3.1. Information on Written and Spoken Forms]
(tense) indicates the grammatical tense associated with a given inflected form in a dictionary. [9.3.1. Information on Written and Spoken Forms]
(mood) contains information about the grammatical mood of verbs (e.g. indicative, subjunctive, imperative). [9.3.1. Information on Written and Spoken Forms]
(inflectional class) indicates the inflectional class associated with a lexical item. [9.3.1. Information on Written and Spoken Forms]
indicates the type of indicator used to specify the inflection class, when it is necessary to distinguish between the usual abbreviated indications (e.g. inv) and other kinds of indicators, such as special codes referring to conjugation patterns, etc.
Sample values include: 1] abbrev; 2] verbTable
[^\p{C}\p{Z}]+
(grammatical information group) groups morpho-syntactic information about a lexical item, e.g. <pos>
, <gen>
, <number>
, <case>
, or <iType>
(inflectional class). [9.3.2. Grammatical Information]
(part of speech) indicates the part of speech assigned to a dictionary headword such as noun, verb, or adjective. [9.3.2. Grammatical Information]
(subcategorization) contains subcategorization information (transitive/intransitive, countable/non-countable, etc.) [9.3.2. Grammatical Information]
(collocate) contains any sequence of words that co-occur with the headword with significant frequency. [9.3.2. Grammatical Information]
(definition) contains definition text in a dictionary entry. [9.3.3.1. Definitions]
(etymology) encloses the etymological information in a dictionary entry. [9.3.4. Etymological Information]
(language name) contains the name of a language mentioned in etymological or other linguistic discussion. [9.3.4. Etymological Information]
(usage) contains usage information in a dictionary entry. [9.3.5.2. Usage Information and Other Labels]
classifies the usage information using any convenient typology.
Sample values include: 1] geo (geographic); 2] time; 3] dom (domain); 4] register (register); 5] style; 6] plev (preference level); 7] lang (language); 8] gram (grammatical); 9] syn (synonym); 10] hyper (hypernym); 11] colloc (collocation); 12] comp (complement); 13] obj (object); 14] subj (subject); 15] verb; 16] hint
[^\p{C}\p{Z}]+
(label) contains a label for a form, example, translation, or other piece of information, e.g. abbreviation for, contraction of, literally, approximately, synonyms:, etc. [9.3.1. Information on Written and Spoken Forms 9.3.3.2. Translation Equivalents 9.3.5.3. Cross-References to Other Entries]
classifies the label using any convenient typology.
[^\p{C}\p{Z}]+
(cross-reference phrase) contains a phrase, sentence, or icon referring the reader to some other location in this or another text. [9.3.5.3. Cross-References to Other Entries]
indicates the type of cross reference, using any convenient typology.
Sample values include: 1] syn (synonym); 2] etym (etymological); 3] cf (compare or consult); 4] illus (illustration)
[^\p{C}\p{Z}]+
(related entry) contains a dictionary entry for a lexical item related to the headword, such as a compound phrase or derived form, embedded inside a larger entry. [9.3.6. Related Entries]
(orthographic-form reference) in a dictionary example, indicates a reference to the orthographic form(s) of the headword. [9.4. Headword and Pronunciation References]
indicates the kind of typographic modification made to the headword in the reference.
Sample values include: 1] cap (capital); 2] noHyph (no hyphen)
[^\p{C}\p{Z}]+
(pronunciation reference) in a dictionary example, indicates a reference to the pronunciation(s) of the headword. [9.4. Headword and Pronunciation References]
indicates whether the passage being quoted is defective, i.e. incomplete through loss or damage.
unknown
inapplicable
identifies the text types or classifications applicable to this item by pointing to other elements or resources defining the classification concerned.
\S+
(manuscript description) contains a description of a single identifiable manuscript or other text-bearing object such as an early printed book. [10.1. Overview]
(catchwords) describes the system used to ensure correct ordering of the quires or similar making up a codex, incunable, or other object typically by means of annotations at the foot of the page. [10.3.7. Catchwords, Signatures, Secundo Folio]
The element should not be used outside of msDesc.
(dimensions) contains a dimensional specification. [10.3.4. Dimensions]
The element may appear once only
The element may appear once only
The element may appear once only
indicates which aspect of the object is being measured.
Sample values include: 1] leaves; 2] ruled; 3] pricked; 4] written; 5] miniatures; 6] binding; 7] box
[^\p{C}\p{Z}]+
contains any single measurement forming part of a dimensional specification of some sort. [10.3.4. Dimensions]
(height) contains a measurement measured along the axis at a right angle to the bottom of the object. [10.3.4. Dimensions]
(depth) contains a measurement from the front to the back of an object, perpendicular to the measurement given by the <width>
element. [10.3.4. Dimensions]
(width) contains a measurement of an object along the axis parallel to its bottom, e.g. perpendicular to the spine of a book or codex. [10.3.4. Dimensions]
(heraldry) contains a heraldic formula or phrase, typically found as part of a blazon, coat of arms, etc. [10.3.8. Heraldry]
(locus) defines a location within a manuscript, manuscript part, or other object typically as a (possibly discontinuous) sequence of folio references. [10.3.5. References to Locations within a Manuscript]
(scheme) identifies the foliation scheme in terms of which the location is being specified by pointing to some <foliation>
element defining it, or to some other equivalent resource.
\S+
(from) specifies the starting point of the location in a normalized form, typically a page number.
[^\p{C}\p{Z}]+
(to) specifies the end-point of the location in a normalized form, typically as a page number.
[^\p{C}\p{Z}]+
(locus group) groups a number of locations which together form a distinct but discontinuous item within a manuscript, manuscript part, or other object. [10.3.5. References to Locations within a Manuscript]
(scheme) identifies the foliation scheme in terms of which all the locations contained by the group are specified by pointing to some <foliation>
element defining it, or to some other equivalent resource.
\S+
(material) contains a word or phrase describing the material of which the object being described is composed. [10.3.2. Material and Object Type]
describes the function or use of the material in relation to the object as a whole.
Sample values include: 1] binding; 2] endband; 3] slipcase; 4] support; 5] tie
[^\p{C}\p{Z}]+
identifies one or more elements to which the metamark applies.
\S+
(object type) contains a word or phrase describing the type of object being referred to. [10.3.2. Material and Object Type]
(origin date) contains any form of date, used to identify the date of origin for a manuscript, manuscript part, or other object. [10.3.1. Origination]
(origin place) contains any form of place name, used to identify the place of origin for a manuscript, manuscript part, or other object. [10.3.1. Origination]
(second folio) marks the word or words taken from a fixed point in a codex (typically the beginning of the second leaf) in order to provide a unique identifier for it. [10.3.7. Catchwords, Signatures, Secundo Folio]
The element should not be used outside of msDesc.
(signatures) contains discussion of the leaf or quire signatures found within a codex or similar object. [10.3.7. Catchwords, Signatures, Secundo Folio]
The element should not be used outside of msDesc.
(stamp) contains a word or phrase describing a stamp or similar device. [10.3.3. Watermarks and Stamps]
(watermark) contains a word or phrase describing a watermark or similar device. [10.3.3. Watermarks and Stamps]
(manuscript identifier) contains the information required to identify the manuscript or similar object being described. [10.4. The Manuscript Identifier]
An msIdentifier must contain either a repository or location.
(institution) contains the name of an organization such as a university or library, with which a manuscript or other object is identified, generally its holding institution. [10.4. The Manuscript Identifier]
(repository) contains the name of a repository within which manuscripts or other objects are stored, possibly forming part of an institution. [10.4. The Manuscript Identifier]
(collection) contains the name of a collection of manuscripts or other objects, not necessarily located within a single repository. [10.4. The Manuscript Identifier]
(alternative identifier) contains an alternative or former structured identifier used for a manuscript or other object, such as a former catalogue number. [10.4. The Manuscript Identifier]
(alternative name) contains any form of unstructured alternative name used for a manuscript or other object, such as an ocellus nominum, or nickname. [10.4. The Manuscript Identifier]
(colophon) contains the colophon of an item: that is, a statement providing information regarding the date, place, agency, or reason for production of the manuscript or other object. [10.6.1. The msItem and msItemStruct Elements]
(explicit) contains the explicit of a item, that is, the closing words of the text proper, exclusive of any rubric or colophon which might follow it. [10.6.1. The msItem and msItemStruct Elements]
(filiation) contains information concerning the manuscript or other object's filiation, i.e. its relationship to other surviving manuscripts or other objects of the same text or contents, its protographs, antigraphs and apographs. [10.6.1. The msItem and msItemStruct Elements]
(final rubric) contains the string of words that denotes the end of a text division, often with an assertion as to its author and title, usually set off from the text itself by red ink, by a different size or type of script, or by some other such visual device. [10.6.1. The msItem and msItemStruct Elements]
contains the incipit of a manuscript or similar object item, that is the opening words of the text proper, exclusive of any rubric which might precede it, of sufficient length to identify the work uniquely; such incipits were, in former times, frequently used a means of reference to a work, in place of a title. [10.6.1. The msItem and msItemStruct Elements]
(manuscript contents) describes the intellectual content of a manuscript, manuscript part, or other object either as a series of paragraphs or as a series of structured manuscript items. [10.6. Intellectual Content]
(manuscript item) describes an individual work or item within the intellectual content of a manuscript, manuscript part, or other object. [10.6.1. The msItem and msItemStruct Elements]
(structured manuscript item) contains a structured description for an individual work or item within the intellectual content of a manuscript, manuscript part, or other object. [10.6.1. The msItem and msItemStruct Elements]
(rubric) contains the text of any rubric or heading attached to a particular manuscript item, that is, a string of words through which a manuscript or other object signals the beginning of a text division, often with an assertion as to its author and title, which is in some way set off from the text itself, typically in red ink, or by use of different size or type of script, or some other such visual device. [10.6.1. The msItem and msItemStruct Elements]
contains an overview of the available information concerning some aspect of an item or object (for example, its intellectual content, history, layout, typography etc.) as a complement or alternative to the more detailed information carried by more specific elements. [10.6. Intellectual Content]
(physical description) contains a full physical description of a manuscript, manuscript part, or other object optionally subdivided using more specialized elements from the model.physDescPart class. [10.7. Physical Description]
(object description) contains a description of the physical components making up the object which is being described. [10.7.1. Object Description]
(form) a short project-specific name identifying the physical form of the carrier, for example as a codex, roll, fragment, partial leaf, cutting etc.
[^\p{C}\p{Z}]+
(support description) groups elements describing the physical support for the written part of a manuscript or other object. [10.7.1. Object Description]
(material) a short project-defined name for the material composing the majority of the support
Suggested values include: 1] paper; 2] parch (parchment); 3] mixed
paper
parch
(parchment)
mixed
[^\p{C}\p{Z}]+
(support) contains a description of the materials etc. which make up the physical support for the written part of a manuscript or other object. [10.7.1. Object Description]
(collation) contains a description of how the leaves, bifolia, or similar objects are physically arranged. [10.7.1. Object Description]
(foliation) describes the numbering system or systems used to count the leaves or pages in a codex or similar object. [10.7.1.4. Foliation]
(condition) contains a description of the physical condition of the manuscript or object. [10.7.1.5. Condition]
(layout description) collects the set of layout descriptions applicable to a manuscript or other object. [10.7.2. Writing, Decoration, and Other Notations]
(layout) describes how text is laid out on the page or surface of the object, including information about any ruling, pricking, or other evidence of page-preparation techniques. [10.7.2. Writing, Decoration, and Other Notations]
(columns) specifies the number of columns per page
(textual streams) indicates the number of streams per page, each of which contains an independent textual stream
(ruled lines) specifies the number of ruled lines per column
(written lines) specifies the number of written lines per column
(description of hands) contains a description of all the different hands used in a manuscript or other object. [10.7.2. Writing, Decoration, and Other Notations]
(hands) specifies the number of distinct hands identified within the manuscript
(typeface description) contains a description of the typefaces or other aspects of the printing of an incunable or other printed source. [10.7.2.1. Writing]
(typographic note) describes a particular font or other significant typographic feature distinguished within the description of a printed resource. [10.7.2. Writing, Decoration, and Other Notations]
(script description) contains a description of the scripts used in a manuscript or other object. [10.7.2.1. Writing]
(music notation) contains description of type of musical notation. [10.7.2. Writing, Decoration, and Other Notations]
(decoration description) contains a description of the decoration of a manuscript or other object, either as in paragraphs, or as one or more <decoNote>
elements. [10.7.3. Bindings, Seals, and Additional Material]
(note on decoration) contains a note describing either a decorative component of a manuscript or other object, or a fairly homogenous class of such components. [10.7.3. Bindings, Seals, and Additional Material]
(additions) contains a description of any significant additions found within a manuscript or other object, such as marginalia or other annotations. [10.7.2. Writing, Decoration, and Other Notations]
(binding description) describes the present and former bindings of a manuscript or other object, either as a series of paragraphs or as a series of distinct <binding>
elements, one for each binding of the manuscript. [10.7.3.1. Binding Descriptions]
(binding) contains a description of one binding, i.e. type of covering, boards, etc. applied to a manuscript or other object. [10.7.3.1. Binding Descriptions]
(contemporary) specifies whether or not the binding is contemporary with the majority of its contents
unknown
inapplicable
(seal description) describes the seals or similar items related to the object described, either as a series of paragraphs or as a series of <seal>
elements. [10.7.3.2. Seals]
(seal) contains a description of one seal or similar applied to the object described [10.7.3.2. Seals]
(contemporary) specifies whether or not the seal is contemporary with the item to which it is affixed
unknown
inapplicable
(accompanying material) contains details of any significant additional material which may be closely associated with the manuscript or object being described, such as non-contemporaneous documents or fragments bound in with it at some earlier historical period. [10.7.3.3. Accompanying Material]
(history) groups elements describing the full history of a manuscript, manuscript part, or other object. [10.8. History]
(origin) contains any descriptive or other information concerning the origin of a manuscript, manuscript part, or other object. [10.8. History]
(provenance) contains any descriptive or other information concerning a single identifiable episode during the history of a manuscript, manuscript part, or other object after its creation but before its acquisition. [10.8. History]
(acquisition) contains any descriptive or other information concerning the process by which a manuscript or manuscript part or other object entered the holding institution. [10.8. History]
(additional) groups additional information, combining bibliographic information about a manuscript or other object, or surrogate copies of it, with curatorial or administrative information. [10.9. Additional Information]
(administrative information) contains information about the present custody and availability of the manuscript or other object, and also about the record description itself. [10.9.1. Administrative Information]
(recorded history) provides information about the source and revision status of the parent manuscript or object description itself. [10.9.1. Administrative Information]
(source) describes the original source for the information contained with a manuscript or object description. [10.9.1.1. Record History]
(custodial history) contains a description of a manuscript or other object's custodial history, either as running prose or as a series of dated custodial events. [10.9.1.2. Availability and Custodial History]
(custodial event) describes a single event during the custodial history of a manuscript or other object. [10.9.1.2. Availability and Custodial History]
(surrogates) contains information about any representations of the manuscript or other object being described which may exist in the holding institution or elsewhere. [10.9. Additional Information]
(manuscript part) contains information about an originally distinct manuscript or part of a manuscript, which is now part of a composite manuscript. [10.10. Manuscript Parts]
(manuscript fragment) contains information about a fragment described in relation to a prior context, typically as a description of a virtual reconstruction of a manuscript or other object whose fragments were catalogued separately [10.11. Manuscript Fragments]
(facsimile) points to one or more images, portions of an image, or surfaces which correspond to the current element.
\S+
points to one or more <change>
elements documenting a state or revision campaign to which the element bearing this attribute and its children have been assigned by the encoder.
\S+
indicates the element within a transcription of the text containing at least the start of the writing represented by this zone or surface.
\S+
gives the x coordinate value for the upper left corner of a rectangular space.
(\-?[\d]+/\-?[\d]+)
gives the y coordinate value for the upper left corner of a rectangular space.
(\-?[\d]+/\-?[\d]+)
gives the x coordinate value for the lower right corner of a rectangular space.
(\-?[\d]+/\-?[\d]+)
gives the y coordinate value for the lower right corner of a rectangular space.
(\-?[\d]+/\-?[\d]+)
identifies a two dimensional area by means of a series of pairs of numbers, each of which gives the x,y coordinates of a point on a line enclosing the area.
(-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?)
(-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?)
(-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?)
(-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?)
contains a representation of some written source in the form of a set of images rather than as transcribed or encoded text. [11.1. Digital Facsimiles]
A facsimile element represents a text with images, thus
transcribed text should not be present within it.
contains a transcription or other representation of a single source document potentially forming part of a dossier génétique or collection of sources. [11.1. Digital Facsimiles 11.2.2. Embedded Transcription]
defines a written surface as a two-dimensional coordinate space, optionally grouping one or more graphic representations of that space, zones of interest within that space, and transcriptions of the writing within them. [11.1. Digital Facsimiles 11.2.2. Embedded Transcription]
describes the method by which this surface is or was connected to the main surface
Sample values include: 1] glued; 2] pinned; 3] sewn
[^\p{C}\p{Z}]+
indicates whether the surface is attached and folded in such a way as to provide two writing surfaces
(surface group) defines any kind of useful grouping of written surfaces, for example the recto and verso of a single leaf, which the encoder wishes to treat as a single unit. [11.1. Digital Facsimiles]
defines any two-dimensional area within a <surface>
element. [11.1. Digital Facsimiles 11.2.2. Embedded Transcription]
indicates the amount by which this zone has been rotated clockwise, with respect to the normal orientation of the parent <surface>
element as implied by the dimensions given in the <msDesc>
element or by the coordinates of the <surface>
itself. The orientation is expressed in arc degrees.
(path) defines any line passing through two or more points within a <surface>
element. [11.1. Digital Facsimiles 11.2.2. Embedded Transcription]
The first and
last elements of this path are the same. To specify a closed polygon, use
the zone element rather than the path element.
identifies a line within the container or bounding box specified by the parent element by means of a series of two or more pairs of numbers, each of which gives the x,y coordinates of a point on the line.
(-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?)
(-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?)
(-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?)
(added span of text) marks the beginning of a longer sequence of text added by an author, scribe, annotator or corrector (see also <add>
). [11.3.1.4. Additions and Deletions]
The @spanTo attribute of is required.
L'attribut spanTo est requis.
(damage) contains an area of damage to the text witness. [11.3.3.1. Damage, Illegibility, and Supplied Text]
(damaged span of text) marks the beginning of a longer sequence of text which is damaged in some way but still legible. [11.3.3.1. Damage, Illegibility, and Supplied Text]
The @spanTo attribute of is required.
L'attribut spanTo est requis.
(deleted span of text) marks the beginning of a longer sequence of text deleted, marked as deleted, or otherwise signaled as superfluous or spurious by an author, scribe, annotator, or corrector. [11.3.1.4. Additions and Deletions]
The @spanTo attribute of is required.
L'attribut spanTo est requis.
(editorial expansion) contains a sequence of letters added by an editor or transcriber when expanding an abbreviation. [11.3.1.2. Abbreviation and Expansion]
(forme work) contains a running head (e.g. a header, footer), catchword, or similar material appearing on the current page. [11.6. Headers, Footers, and Similar Matter]
classifies the material encoded according to some useful typology.
Sample values include: 1] header; 2] footer; 3] pageNum (page number); 4] lineNum (line number); 5] sig (signature); 6] catch (catchword)
[^\p{C}\p{Z}]+
contains one or more <handNote>
elements documenting the different hands identified within the source texts. [11.3.2.1. Document Hands]
(handwriting shift) marks the beginning of a sequence of text written in a new hand, or the beginning of a scribal stint. [11.3.2.1. Document Hands]
indicates a <handNote>
element describing the hand concerned.
\S+
(abbreviation marker) contains a sequence of letters or signs present in an abbreviation which are omitted or replaced in the expanded form of the abbreviation. [11.3.1.2. Abbreviation and Expansion]
(restore) indicates restoration of text to an earlier state by cancellation of an editorial or authorial marking or instruction. [11.3.1.6. Cancellation of Deletions and Other Markings]
(space) indicates the location of a significant space in the text. [11.4.1. Space]
(responsible party) (responsible party) indicates the individual responsible for identifying and measuring the space
\S+
(dimension) indicates whether the space is horizontal or vertical.
horizontal
the space is horizontal.
vertical
the space is vertical.
(substitution) groups one or more deletions (or surplus text) with one or more additions when the combination is to be regarded as a single intervention in the text. [11.3.1.5. Substitutions]
must have at least one child add and at least one child del or surplus
(substitution join) identifies a series of possibly fragmented additions, deletions, or other revisions on a manuscript that combine to make up a single intervention in the text [11.3.1.5. Substitutions]
(supplied) signifies text supplied by the transcriber or editor for any reason; for example because the original cannot be read due to physical damage, or because of an obvious omission by the author or scribe. [11.3.3.1. Damage, Illegibility, and Supplied Text]
one or more words indicating why the text has had to be supplied, e.g. overbinding, faded-ink, lost-folio, omitted-in-original.
[^\p{C}\p{Z}]+
(surplus) marks text present in the source which the editor believes to be superfluous or redundant. [11.3.3.1. Damage, Illegibility, and Supplied Text]
one or more words indicating why this text is believed to be superfluous, e.g. repeated, interpolated etc.
[^\p{C}\p{Z}]+
(secluded text) Secluded. Marks text present in the source which the editor believes to be genuine but out of its original place (which is unknown). [11.3.1.7. Text Omitted from or Supplied in the Transcription]
one or more words indicating why this text has been secluded, e.g. interpolated etc.
[^\p{C}\p{Z}]+
contains the transcription of a topographic line in the source document [11.2.2. Embedded Transcription]
supplies a list of transpositions, each of which is indicated at some point in a document typically by means of metamarks. [11.3.4.5. Transpositions]
contains or describes any kind of graphic or written signal within a document the function of which is to determine how it should be read rather than forming part of the actual content of the document. [11.3.4.2. Metamarks]
describes the function (for example status, insertion, deletion, transposition) of the metamark.
[^\p{C}\p{Z}]+
identifies one or more elements to which the metamark applies.
\S+
represents any kind of modification identified within a single document. [11.3.4.1. Generic Modification]
indicates one or more cancelled interventions in a document which have subsequently been marked as reaffirmed or repeated. [11.3.4.4. Confirmation, Cancellation, and Reinstatement of Modifications]
points to one or more elements representing the interventions which are being reasserted.
\S+
contains a sequence of writing which has been retraced, for example by over-inking, to clarify or fix it. [11.3.4.3. Fixation and Clarification]
describes a single textual transposition as an ordered list of at least two pointers specifying the order in which the elements indicated should be re-combined. [11.3.4.5. Transpositions]
indicates one or more marked-up interventions in a document which have subsequently been marked for cancellation. [11.3.4.4. Confirmation, Cancellation, and Reinstatement of Modifications]
points to one or more elements representing the interventions which are to be reverted or undone.
\S+
(witness or witnesses) contains a space-delimited list of one or more sigla indicating the witnesses to this reading beginning or ending at this point.
\S+
(witness or witnesses) contains a space-delimited list of one or more pointers indicating the witnesses which attest to a given reading.
\S+
classifies the reading according to some useful typology.
Sample values include: 1] substantive (substantive); 2] orthographic (orthographic)
[^\p{C}\p{Z}]+
classifies the cause for the variant reading, according to any appropriate typology of possible origins.
Sample values include: 1] homeoteleuton; 2] homeoarchy; 3] paleographicConfusion; 4] haplography; 5] dittography; 6] falseEmendation
[^\p{C}\p{Z}]+
(variant sequence) provides a number indicating the position of this reading in a sequence, when there is reason to presume a sequence to the variants.
points to other readings that are required when adopting the current reading or lemma.
\S+
(apparatus entry) contains one entry in a critical apparatus, with an optional lemma and usually one or more readings or notes on the relevant passage. [12.1.1. The Apparatus Entry]
classifies the variation contained in this element according to some convenient typology.
[^\p{C}\p{Z}]+
identifies the beginning of the lemma in the base text.
\S+
identifies the endpoint of the lemma in the base text.
\S+
(location) indicates the location of the variation, when the location-referenced method of apparatus markup is used.
[^\p{C}\p{Z}]+
(list of apparatus entries) contains a list of apparatus entries. [12.2. Linking the Apparatus to the Text]
(lemma) contains the lemma, or base text, of a textual variation. [12.1. The Apparatus Entry, Readings, and Witnesses]
(reading) contains a single reading within a textual variation. [12.1. The Apparatus Entry, Readings, and Witnesses]
(reading group) within a textual variation, groups two or more readings perceived to have a genetic relationship or other affinity. [12.1. The Apparatus Entry, Readings, and Witnesses]
Only one <lem> element may appear within a <rdgGrp>
(witness detail) gives further information about a particular witness, or witnesses, to a particular reading. [12.1. The Apparatus Entry, Readings, and Witnesses]
describes the type of information given about the witness.
[^\p{C}\p{Z}]+
(witnesses) indicates the sigil or sigla identifying the witness or witnesses to which the detail refers.
\S+
(wit) contains a list of one or more sigla of witnesses attesting a given reading, in a textual variation. [12.1.4. Witness Information]
(witness list) lists definitions for all the witnesses referred to by a critical apparatus, optionally grouped hierarchically. [12.1. The Apparatus Entry, Readings, and Witnesses]
(witness) contains either a description of a single witness referred to within the critical apparatus, or a list of witnesses which is to be referred to by a single sigil. [12.1. The Apparatus Entry, Readings, and Witnesses]
(fragmented witness start) indicates the beginning, or resumption, of the text of a fragmentary witness. [12.1.5. Fragmentary Witnesses]
(fragmented witness end) indicates the end, or suspension, of the text of a fragmentary witness. [12.1.5. Fragmentary Witnesses]
(lacuna start) indicates the beginning of a lacuna in the text of a mostly complete textual witness. [12.1.5. Fragmentary Witnesses]
(lacuna end) indicates the end of a lacuna in a mostly complete textual witness. [12.1.5. Fragmentary Witnesses]
(variant encoding) declares the method used to encode text-critical variants. [12.1.1. The Apparatus Entry]
indicates which method is used to encode the apparatus of variants.
location-referenced
apparatus uses line numbers or other canonical reference scheme referenced in a base text.
double-end-point
apparatus indicates the precise locations of the beginning and ending of each lemma relative to a base text.
parallel-segmentation
alternate readings of a passage are given in parallel in the text; no notion of a base text is necessary.
indicates whether the apparatus appears within the running text or external to it.
internal
apparatus appears within the running text.
external
apparatus appears outside the base text.
The @location value "external" is inconsistent with the
parallel-segmentation method of apparatus markup.
supplies the value of a date or time in some custom standard form.
[^\p{C}\p{Z}]+
specifies the earliest possible date for the event in some custom standard form.
[^\p{C}\p{Z}]+
specifies the latest possible date for the event in some custom standard form.
[^\p{C}\p{Z}]+
indicates the starting point of the period in some custom standard form.
[^\p{C}\p{Z}]+
indicates the ending point of the period in some custom standard form.
[^\p{C}\p{Z}]+
supplies a pointer to some location defining a named point in time with reference to which the datable item is understood to have occurred
\S+
supplies a pointer to a <calendar>
element or other means of interpreting the values of the custom dating attributes.
\S+
supplies the value of a date or time in a standard form.
[0-9.,DHMPRSTWYZ/:+\-]+
specifies the earliest possible date for the event in standard form, e.g. yyyy-mm-dd.
[0-9.,DHMPRSTWYZ/:+\-]+
specifies the latest possible date for the event in standard form, e.g. yyyy-mm-dd.
[0-9.,DHMPRSTWYZ/:+\-]+
indicates the starting point of the period in standard form.
[0-9.,DHMPRSTWYZ/:+\-]+
indicates the ending point of the period in standard form.
[0-9.,DHMPRSTWYZ/:+\-]+
(organization name) contains an organizational name. [13.2.2. Organizational Names]
(personal name) contains a proper noun or proper-noun phrase referring to a person, possibly including one or more of the person's forenames, surnames, honorifics, added names, etc. [13.2.1. Personal Names]
(surname) contains a family (inherited) name, as opposed to a given, baptismal, or nick name. [13.2.1. Personal Names]
(forename) contains a forename, given or baptismal name. [13.2.1. Personal Names]
(generational name component) contains a name component used to distinguish otherwise similar names on the basis of the relative ages or generations of the persons named. [13.2.1. Personal Names]
(name link) contains a connecting phrase or link used within a name but not regarded as part of it, such as van der or of. [13.2.1. Personal Names]
(additional name) contains an additional name component, such as a nickname, epithet, or alias, or any other descriptive phrase used within a personal name. [13.2.1. Personal Names]
(role name) contains a name component which indicates that the referent has a particular role or position in society, such as an official title or rank. [13.2.1. Personal Names]
(place name) contains an absolute or relative place name. [13.2.3. Place Names]
(bloc) contains the name of a geo-political unit consisting of two or more nation states or countries. [13.2.3. Place Names]
(country) contains the name of a geo-political unit, such as a nation, country, colony, or commonwealth, larger than or administratively superior to a region and smaller than a bloc. [13.2.3. Place Names]
(region) contains the name of an administrative unit such as a state, province, or county, larger than a settlement, but smaller than a country. [13.2.3. Place Names]
(settlement) contains the name of a settlement such as a city, town, or village identified as a single geo-political or administrative unit. [13.2.3. Place Names]
(district) contains the name of any kind of subdivision of a settlement, such as a parish, ward, or other administrative or geographic unit. [13.2.3. Place Names]
(offset) marks that part of a relative temporal or spatial expression which indicates the direction of the offset between the two place names, dates, or times involved in the expression. [13.2.3. Place Names]
(geographical name) identifies a name associated with some geographical feature such as Windrush Valley or Mount Sinai. [13.2.3. Place Names]
(geographical feature name) contains a common noun identifying some geographical feature contained within a geographic name, such as valley, mount, etc. [13.2.3. Place Names]
(affiliation) contains an informal description of a person's present or past affiliation with some organization, for example an employer or sponsor. [15.2.2. The Participant Description]
characterizes the element in some sense, using any convenient classification scheme or typology.
Sample values include: 1] sponsor; 2] recommend; 3] discredit; 4] pledged
[^\p{C}\p{Z}]+
(age) specifies the age of a person. [13.3.2.1. Personal Characteristics]
characterizes the element in some sense, using any convenient classification scheme or typology.
Sample values include: 1] western; 2] sui; 3] subjective; 4] objective; 5] inWorld (in world); 6] chronological; 7] biological; 8] psychological; 9] functional
[^\p{C}\p{Z}]+
supplies a numeric code representing the age or age group
(birth) contains information about a person's birth, such as its date and place. [15.2.2. The Participant Description]
characterizes the element in some sense, using any convenient classification scheme or typology.
Sample values include: 1] caesarean (caesarean section); 2] vaginal (vaginal delivery); 3] exNihilo (ex nihilo); 4] incorporated; 5] founded; 6] established
[^\p{C}\p{Z}]+
(climate) contains information about the physical climate of a place. [13.3.4.3. States, Traits, and Events]
(death) contains information about a person's death, such as its date and place. [15.2.2. The Participant Description]
characterizes the element in some sense, using any convenient classification scheme or typology.
Sample values include: 1] proclaimed; 2] assumed; 3] verified; 4] clinical; 5] brain; 6] natural; 7] unnatural; 8] fragmentation; 9] dissolution
[^\p{C}\p{Z}]+
(education) contains a description of the educational experience of a person. [15.2.2. The Participant Description]
characterizes the element in some sense, using any convenient classification scheme or typology.
Sample values include: 1] primary; 2] secondary; 3] undergraduate; 4] graduate; 5] residency; 6] apprenticeship
[^\p{C}\p{Z}]+
(event) contains data relating to any kind of significant event associated with a person, place, or organization. [13.3.1. Basic Principles]
(faith) specifies the faith, religion, or belief set of a person. [13.3.2.1. Personal Characteristics]
characterizes the element in some sense, using any convenient classification scheme or typology.
Sample values include: 1] practicing; 2] clandestine; 3] patrilineal; 4] matrilineal; 5] convert
[^\p{C}\p{Z}]+
(floruit) contains information about a person's period of activity. [13.3.2.1. Personal Characteristics]
(gender) specifies the gender identity of a person, persona, or character. [13.3.2.1. Personal Characteristics]
supplies a coded value for gender identity
[^\p{C}\p{Z}]+
(geographical coordinates) contains any expression of a set of geographic coordinates, representing a point, line, or area on the surface of the earth in some notation. [13.3.4.1. Varieties of Location]
(language knowledge) summarizes the state of a person's linguistic knowledge, either as prose or by a list of <langKnown>
elements. [13.3.2.1. Personal Characteristics]
characterizes the element in some sense, using any convenient classification scheme or typology.
Sample values include: 1] listening; 2] speaking; 3] reading; 4] writing
[^\p{C}\p{Z}]+
supplies one or more valid language tags for the languages specified
(language known) summarizes the state of a person's linguistic competence, i.e., knowledge of a single language. [15.2.2. The Participant Description]
supplies a valid language tag for the language concerned.
a code indicating the person's level of knowledge for this language
[^\p{C}\p{Z}]+
(list of organizations) contains a list of elements, each of which provides information about an identifiable organization. [13.2.2. Organizational Names]
(list of events) contains a list of descriptions, each of which provides information about an identifiable event. [13.3.1. Basic Principles]
(list of persons) contains a list of descriptions, each of which provides information about an identifiable person or a group of people, for example the participants in a language interaction, or the people referred to in a historical source. [13.3.2. The Person Element 15.2. Contextual Information 2.4. The Profile Description 15.3.2. Declarable Elements]
(list of places) contains a list of places, optionally followed by a list of relationships (other than containment) defined amongst them. [2.2.7. The Source Description 13.3.4. Places]
provides information about relationships identified amongst people, places, and organizations, either informally as prose or as formally expressed relation links. [13.3.2.3. Personal Relationships]
(location) defines the location of a place as a set of geographical coordinates, in terms of other named geo-political entities, or as an address. [13.3.4. Places]
(nationality) contains an informal description of a person's present or past nationality or citizenship. [15.2.2. The Participant Description]
characterizes the element in some sense, using any convenient classification scheme or typology.
Sample values include: 1] birth; 2] naturalised; 3] self-assigned
[^\p{C}\p{Z}]+
(occupation) contains an informal description of a person's trade, profession or occupation. [15.2.2. The Participant Description]
characterizes the element in some sense, using any convenient classification scheme or typology.
Sample values include: 1] primary; 2] other; 3] paid; 4] unpaid
[^\p{C}\p{Z}]+
indicates the classification system or taxonomy in use, for example by supplying the identifier of a <taxonomy>
element, or pointing to some other resource.
\S+
identifies an occupation code defined within the classification system or taxonomy defined by the @scheme
attribute.
\S+
(organization) provides information about an identifiable organization such as a business, a tribe, or any other grouping of people. [13.3.3. Organizational Data]
specifies a primary role or classification for the organization.
[^\p{C}\p{Z}]+
(person) provides information about an identifiable individual, for example a participant in a language interaction, or a person referred to in a historical source. [13.3.2. The Person Element 15.2.2. The Participant Description]
specifies a primary role or classification for the person.
[^\p{C}\p{Z}]+
specifies the sex of the person.
[^\p{C}\p{Z}]+
specifies the gender of the person.
[^\p{C}\p{Z}]+
specifies an age group for the person.
[^\p{C}\p{Z}]+
provides information about one of the personalities identified for a given individual, where an individual has multiple personalities. [13.3.2. The Person Element]
specifies a primary role or classification for the persona.
[^\p{C}\p{Z}]+
specifies the sex of the persona.
[^\p{C}\p{Z}]+
specifies the gender of the persona.
[^\p{C}\p{Z}]+
specifies an age group for the persona.
[^\p{C}\p{Z}]+
(personal group) describes a group of individuals treated as a single person for analytic purposes. [15.2.2. The Participant Description]
specifies the role of this group of participants in the interaction.
[^\p{C}\p{Z}]+
specifies the sex of the participant group.
[^\p{C}\p{Z}]+
specifies the gender of the participant group.
[^\p{C}\p{Z}]+
specifies the age group of the participants.
[^\p{C}\p{Z}]+
describes informally the size or approximate size of the group for example by means of a number and an indication of accuracy e.g. approx 200.
[^\p{C}\p{Z}]+
(personal pronouns) indicates the personal pronouns used, or assumed to be used, by the individual being described. [13.3.2.1. Personal Characteristics]
(evidence) indicates support for the listed personal pronouns.
Suggested values include: 1] conjecture (conjecture); 2] selfIdentification (self identification); 3] trustedThirdParty (trusted third party)
conjecture
(conjecture) The given value was selected based on assumptions by someone besides the person to whom this pronoun applies. As a result, the value may be erroneous.
selfIdentification
(self identification) The given value has been explicitly stated or confirmed by the person to whom this pronoun applies.
trustedThirdParty
(trusted third party) The given value has been supplied by another individual trusted by the encoder to know the preferences of the person to whom this pronoun applies.
[^\p{C}\p{Z}]+
(value) supplies a regularized value for personal pronouns.
Sample values include: 1] e (e); 2] he (he); 3] she (she); 4] they (they)
[^\p{C}\p{Z}]+
(place) contains data about a geographic location [13.3.4. Places]
(population) contains information about the population of a place. [13.3.4.3. States, Traits, and Events]
(relationship) describes any kind of relationship or linkage amongst a specified group of places, events, persons, objects or other items. [13.3.2.3. Personal Relationships]
One of the attributes 'name', 'ref' or 'key' must be supplied
Only one of the attributes @active and @mutual may be supplied
the attribute 'passive' may be supplied only if the attribute 'active' is supplied
supplies a name for the kind of relationship of which this is an instance.
[^\p{C}\p{Z}]+
identifies the active participants in a non-mutual relationship, or all the participants in a mutual one.
\S+
supplies a list of participants amongst all of whom the relationship holds equally.
\S+
identifies the passive participants in a non-mutual relationship.
\S+
(residence) describes a person's present or past places of residence. [15.2.2. The Participant Description]
characterizes the element in some sense, using any convenient classification scheme or typology.
Sample values include: 1] primary; 2] secondary; 3] temporary; 4] permanent
[^\p{C}\p{Z}]+
(sex) specifies the sex of an organism. [13.3.2.1. Personal Characteristics]
supplies a coded value for sex
[^\p{C}\p{Z}]+
(socio-economic status) contains an informal description of a person's perceived social or economic status. [15.2.2. The Participant Description]
characterizes the element in some sense, using any convenient classification scheme or typology.
Sample values include: 1] atBirth; 2] atDeath; 3] dependent; 4] inherited; 5] independent
[^\p{C}\p{Z}]+
identifies the classification system or taxonomy in use, for example by pointing to a locally-defined <taxonomy>
element or by supplying a URI for an externally-defined system.
\S+
identifies a status code defined within the classification system or taxonomy defined by the @scheme
attribute.
\S+
(state) contains a description of some status or quality attributed to a person, place, or organization often at some specific time or for a specific date range. [13.3.1. Basic Principles 13.3.2.1. Personal Characteristics]
(terrain) contains information about the physical terrain of a place. [13.3.4.3. States, Traits, and Events]
(trait) contains a description of some status or quality attributed to a person, place, or organization typically, but not necessarily, independent of the volition or action of the holder and usually not at some specific time or for a specific date range. [13.3.1. Basic Principles 13.3.2.1. Personal Characteristics]
(name of an object) contains a proper noun or noun phrase used to refer to an object. [13.2.4. Object Names]
contains a description of a single identifiable physical object. [13.3.5. Objects]
(list of objects) contains a list of descriptions, each of which provides information about an identifiable physical object. [13.3.5. Objects]
(object identifier) groups one or more identifiers or pieces of locating information concerning a single object. [13.3.5. Objects]
(canonical name) contains the definition for a canonical name or name component of any kind. [13.3.6. Names and Nyms]
points to constituent nyms
\S+
(list of canonical names) contains a list of nyms, that is, standardized names for any thing. [13.3.6. Names and Nyms]
(role) indicates the kind of information held in this cell or in each cell of this row.
Suggested values include: 1] label; 2] data
label
labelling or descriptive information only.
data
data values.
[^\p{C}\p{Z}]+
(rows) indicates the number of rows occupied by this cell or row.
(columns) indicates the number of columns occupied by this cell or row.
(table) contains text displayed in tabular form, in rows and columns. [14.1.1. TEI Tables]
(rows) indicates the number of rows in the table.
(columns) indicates the number of columns in each row of the table.
(row) contains one row of a table. [14.1.1. TEI Tables]
(cell) contains one cell of a table. [14.1.1. TEI Tables]
(formula) contains a mathematical or other formula. [14.2. Formulæ and Mathematical Expressions]
encodes the presence of music notation in a text [14.3. Notated Music in Written Text]
(figure) groups elements representing or containing graphic information such as an illustration, formula, or figure. [14.4. Specific Elements for Graphic Images]
(description of figure) contains a brief prose description of the appearance or content of a graphic figure, for use when documenting an image without displaying it. [14.4. Specific Elements for Graphic Images]
(text description) provides a description of a text in terms of its situational parameters. [15.2.1. The Text Description]
(participation description) describes the identifiable speakers, voices, or other participants in any kind of text or other persons named or otherwise referred to in a text, edition, or metadata. [15.2. Contextual Information]
(setting description) describes the setting or settings within which a language interaction takes place, or other places otherwise referred to in a text, edition, or metadata. [15.2. Contextual Information 2.4. The Profile Description]
(primary channel) describes the medium or channel by which a text is delivered or experienced. For a written text, this might be print, manuscript, email, etc.; for a spoken one, radio, telephone, face-to-face, etc. [15.2.1. The Text Description]
specifies the mode of this channel with respect to speech and writing.
s
(spoken)
w
(written)
sw
(spoken to be written) e.g. dictation
ws
(written to be spoken) e.g. a script
m
(mixed)
x
(unknown or inapplicable)
(constitution) describes the internal composition of a text or text sample, for example as fragmentary, complete, etc. [15.2.1. The Text Description]
specifies how the text was constituted.
single
a single complete text
composite
a text made by combining several smaller items, each individually complete
frags
(fragments) a text made by combining several smaller, not necessarily complete, items
unknown
composition unknown or unspecified
(derivation) describes the nature and extent of originality of this text. [15.2.1. The Text Description]
categorizes the derivation of the text.
Sample values include: 1] original; 2] revision; 3] translation; 4] abridgment; 5] plagiarism; 6] traditional
[^\p{C}\p{Z}]+
(domain of use) describes the most important social context in which the text was realized or for which it is intended, for example private vs. public, education, religion, etc. [15.2.1. The Text Description]
categorizes the domain of use.
Sample values include: 1] art; 2] domestic; 3] religious; 4] business; 5] education (education); 6] govt (government); 7] public
[^\p{C}\p{Z}]+
(factuality) describes the extent to which the text may be regarded as imaginative or non-imaginative, that is, as describing a fictional or a non-fictional world. [15.2.1. The Text Description]
categorizes the factuality of the text.
fiction
the text is to be regarded as entirely imaginative
fact
the text is to be regarded as entirely informative or factual
mixed
the text contains a mixture of fact and fiction
inapplicable
the fiction/fact distinction is not regarded as helpful or appropriate to this text
(interaction) describes the extent, cardinality and nature of any interaction among those producing and experiencing the text, for example in the form of response or interjection, commentary, etc. [15.2.1. The Text Description]
specifies the degree of interaction between active and passive participants in the text.
none
no interaction of any kind, e.g. a monologue
partial
some degree of interaction, e.g. a monologue with set responses
complete
complete interaction, e.g. a face to face conversation
inapplicable
this parameter is inappropriate or inapplicable in this case
specifies the number of active participants (or addressors) producing parts of the text.
Suggested values include: 1] singular; 2] plural; 3] corporate; 4] unknown
singular
a single addressor
plural
many addressors
corporate
a corporate addressor
unknown
number of addressors unknown or unspecifiable
[^\p{C}\p{Z}]+
specifies the number of passive participants (or addressees) to whom a text is directed or in whose presence it is created or performed.
Suggested values include: 1] self; 2] single; 3] many; 4] group; 5] world
self
text is addressed to the originator e.g. a diary
single
text is addressed to one other person e.g. a personal letter
many
text is addressed to a countable number of others e.g. a conversation in which all participants are identified
group
text is addressed to an undefined but fixed number of participants e.g. a lecture
world
text is addressed to an undefined and indeterminately large number e.g. a published book
[^\p{C}\p{Z}]+
(preparedness) describes the extent to which a text may be regarded as prepared or spontaneous. [15.2.1. The Text Description]
a keyword characterizing the type of preparedness.
Sample values include: 1] none; 2] scripted; 3] formulaic; 4] revised
[^\p{C}\p{Z}]+
characterizes a single purpose or communicative function of the text. [15.2.1. The Text Description]
specifies a particular kind of purpose.
Suggested values include: 1] persuade; 2] express; 3] inform; 4] entertain
persuade
didactic, advertising, propaganda, etc.
express
self expression, confessional, etc.
inform
convey information, educate, etc.
entertain
amuse, entertain, etc.
[^\p{C}\p{Z}]+
specifies the extent to which this purpose predominates.
high
medium
low
unknown
describes one particular setting in which a language interaction takes place. [15.2.3. The Setting Description]
contains a brief informal description of the kind of place concerned, for example: a room, a restaurant, a park bench, etc. [15.2.3. The Setting Description]
(activity) contains a brief informal description of what a participant in a language interaction is doing other than speaking, if anything. [15.2.3. The Setting Description]
(corresponds) points to elements that correspond to the current element in some way.
\S+
(synchronous) points to elements that are synchronous with the current element.
\S+
points to an element that is the same as the current element.
\S+
points to an element of which the current element is a copy.
\S+
points to the next element of a virtual aggregate of which the current element is part.
\S+
(previous) points to the previous element of a virtual aggregate of which the current element is part.
\S+
points to elements that are in exclusive alternation with the current element.
\S+
selects one or more alternants; if one alternant is selected, the ambiguity or uncertainty is marked as resolved. If more than one alternant is selected, the degree of ambiguity or uncertainty is marked as reduced by the number of alternants not selected.
\S+
(link) defines an association or hypertextual link among elements or passages, of some type not more precisely specifiable by other elements. [16.1. Links]
You must supply at least two values for @target or on
(link group) defines a collection of associations or hypertextual links. [16.1. Links]
(anonymous block) contains any component-level unit of text, acting as a container for phrase or inter level elements analogous to, but without the same constraints as, a paragraph. [16.3. Blocks, Segments, and Anchors]
Abstract model violation: Lines may not contain higher-level divisions such as p or ab, unless ab is a child of figure or note, or is a descendant of floatingText.
(anchor point) attaches an identifier to a point within a text, whether or not it corresponds with a textual element. [8.4.2. Synchronization and Overlap 16.5. Correspondence and Alignment]
(arbitrary segment) represents any segmentation of text below the chunk level. [16.3. Blocks, Segments, and Anchors 6.2. Components of the Verse Line 7.2.5. Speech Contents]
indicates a point in time either relative to other elements in the same timeline tag, or absolutely. [16.4.2. Placing Synchronous Events in Time]
supplies an absolute value for the time.
specifies the unit of time in which the @interval
value is expressed, if this is not inherited from the parent <timeline>
.
Suggested values include: 1] d (days); 2] h (hours); 3] min (minutes); 4] s (seconds); 5] ms (milliseconds)
d
(days)
h
(hours)
min
(minutes)
s
(seconds)
ms
(milliseconds)
[^\p{C}\p{Z}]+
specifies a time interval either as a number or as one of the keywords defined by the datatype teidata.interval
regular
irregular
unknown
identifies the reference point for determining the time of the current <when>
element, which is obtained by adding the interval to the time of the reference point.
\S+
(timeline) provides a set of ordered points in time which can be linked to elements of a spoken text to create a temporal alignment of that text. [16.4.2. Placing Synchronous Events in Time]
designates the origin of the timeline, i.e. the time at which it begins.
\S+
specifies the unit of time corresponding to the @interval
value of the timeline or of its constituent points in time.
Suggested values include: 1] d (days); 2] h (hours); 3] min (minutes); 4] s (seconds); 5] ms (milliseconds)
d
(days)
h
(hours)
min
(minutes)
s
(seconds)
ms
(milliseconds)
[^\p{C}\p{Z}]+
specifies a time interval either as a positive integral value or using one of a set of predefined codes.
regular
irregular
unknown
(join) identifies a possibly fragmented segment of text, by pointing at the possibly discontiguous elements which compose it. [16.7. Aggregation]
You must supply at least two values for @target on
specifies the name of an element which this aggregation may be understood to represent.
indicates whether the targets to be joined include the entire element indicated (the entire subtree including its root), or just the children of the target (the branches of the subtree).
root
the rooted subtrees indicated by the @target
attribute are joined, each subtree become a child of the virtual element created by the join
branches
the children of the subtrees indicated by the @target
attribute become the children of the virtual element (i.e. the roots of the subtrees are discarded)
(join group) groups a collection of <join>
elements and possibly pointers. [16.7. Aggregation]
supplies the default value for the @result
on each <join>
included within the group.
(alternation) identifies an alternation or a set of choices among elements or passages. [16.8. Alternation]
specifies the destination of the reference by supplying one or more URI References
\S+
\S+
\S+
states whether the alternations gathered in this collection are exclusive or inclusive.
excl
(exclusive) indicates that the alternation is exclusive, i.e. that at most one of the alternatives occurs.
incl
(inclusive) indicates that the alternation is not exclusive, i.e. that one or more of the alternatives occur.
If @mode
is excl, each weight states the probability that the corresponding alternative occurs. If @mode
is incl each weight states the probability that the corresponding alternative occurs given that at least one of the other alternatives occurs.
(alternation group) groups a collection of <alt>
elements and possibly pointers. [16.8. Alternation]
states whether the alternations gathered in this collection are exclusive or inclusive.
excl
(exclusive) indicates that the alternation is exclusive, i.e. that at most one of the alternatives occurs.
incl
(inclusive) indicates that the alternation is not exclusive, i.e. that one or more of the alternatives occur.
Functions as a container element for linked data, contextual information, and stand-off annotations embedded in a TEI document. [16.10. The standOff Container]
This
element must have a @type attribute, since it is
nested inside a
contains a list of annotations, typically encoded as <annotation>
, <annotationBlock>
, or <note>
, possibly organized with nested <listAnnotation>
elements. [16.10. The standOff Container]
represents an annotation following the Web
Annotation Data Model. [16.10. The standOff Container]
(identifier) provides a unique identifier for the element bearing the attribute.
specifies the destination of the reference by supplying one or more URI References
\S+
assessing
intent is to assess the target resource in some way, rather than simply make a comment about it
bookmarking
intent is to create a bookmark to the target or part thereof
classifying
intent is to classify the target in some way
commenting
intent is to comment about the target
describing
intent is to describe the target, rather than (for example) comment on it
editing
intent is to request an edit or a change to the target resource
highlighting
intent is to highlight the target resource or a segment thereof
identifying
intent is to assign an identity to the target
linking
intent is to link to a resource related to the target
moderating
intent is to assign some value or quality to the target
questioning
intent is to ask a question about the target
replying
intent is to reply to a previous statement, either an annotation or another resource
tagging
intent is to associate a tag with the target
(feature system declaration) provides a feature system declaration comprising one or more feature structure declarations or feature structure declaration links. [18.11. Feature System Declaration]
(feature structure declaration) declares one type of feature structure. [18.11. Feature System Declaration]
gives a name for the type of feature structure being declared.
[^\p{C}\p{Z}]+
gives the name of one or more typed feature structures from which this type inherits feature specifications and constraints; if this type includes a feature specification with the same name as that of any of those specified by this attribute, or if more than one specification of the same name is inherited, then the set of possible values is defined by unification. Similarly, the set of constraints applicable is derived by combining those specified explicitly within this element with those implied by the @baseTypes
attribute. When no @baseTypes
attribute is specified, no feature specification or constraint is inherited.
(feature system description (in FSD)) describes in prose what is represented by the type of feature structure declared in the enclosing fsDecl. [18.11. Feature System Declaration]
(feature structure declaration link) associates the name of a typed feature structure with a feature structure declaration for it. [18.11. Feature System Declaration]
identifies the type of feature structure to be documented; this will be the value of the @type
attribute on at least one feature structure.
[^\p{C}\p{Z}]+
supplies a pointer to a feature structure declaration (<fsDecl>
) element within the current document or elsewhere.
\S+
(feature declaration) declares a single feature, specifying its name, organization, range of allowed values, and optionally its default value. [18.11. Feature System Declaration]
a single word which follows the rules defining a legal XML name (see ), indicating the name of the feature being declared; matches the @name
attribute of <f>
elements in the text.
indicates whether or not the value of this feature may be present.
(feature description (in FSD)) describes in prose what is represented by the feature being declared and its values. [18.11. Feature System Declaration]
(value range) defines the range of allowed values for a feature, in the form of an <fs>
, <vAlt>
, or primitive value; for the value of an <f>
to be valid, it must be subsumed by the specified range; if the <f>
contains multiple values (as sanctioned by the @org
attribute), then each value must be subsumed by the <vRange>
. [18.11. Feature System Declaration]
(value default) declares the default value to be supplied when a feature structure does not contain an instance of <f>
for this name; if unconditional, it is specified as one (or, depending on the value of the @org
attribute of the enclosing <fDecl>
) more <fs>
elements or primitive values; if conditional, it is specified as one or more <if>
elements; if no default is specified, or no condition matches, the value none is assumed. [18.11. Feature System Declaration]
defines a conditional default value for a feature; the condition is specified as a feature structure, and is met if it subsumes the feature structure in the text for which a default value is sought. [18.11. Feature System Declaration]
separates the condition from the default in an <if>
, or the antecedent and the consequent in a <cond>
element. [18.11. Feature System Declaration]
(feature-structure constraints) specifies constraints on the content of valid feature structures. [18.11. Feature System Declaration]
(conditional feature-structure constraint) defines a conditional feature-structure constraint; the consequent and the antecedent are specified as feature structures or feature-structure collections; the constraint is satisfied if both the antecedent and the consequent subsume a given feature structure, or if the antecedent does not. [18.11. Feature System Declaration]
(bi-conditional feature-structure constraint) defines a biconditional feature-structure constraint; both consequent and antecedent are specified as feature structures or groups of feature structures; the constraint is satisfied if both subsume a given feature structure, or if both do not. [18.11. Feature System Declaration]
(if and only if) separates the condition from the consequence in a <bicond>
element. [18.11. Feature System Declaration]
(feature structure) represents a feature structure, that is, a collection of feature-value pairs organized as a structural unit. [18.2. Elementary Feature Structures and the Binary
Feature Value]
specifies the type of the feature structure.
[^\p{C}\p{Z}]+
(features) references the feature-value specifications making up this feature structure.
\S+
(feature) represents a feature value specification, that is, the association of a name with a value of any of several different types. [18.2. Elementary Feature Structures and the Binary
Feature Value]
a single word which follows the rules defining a legal XML name (see ), providing a name for the feature.
(feature value) references any element which can be used to represent the value of a feature.
\S+
(binary value) represents the value part of a feature-value specification which can contain either of exactly two possible values. [18.2. Elementary Feature Structures and the Binary
Feature Value]
supplies a binary value.
(symbolic value) represents the value part of a feature-value specification which contains one of a finite list of symbols. [18.3. Other Atomic Feature Values]
supplies a symbolic value for the feature, one of a finite list that may be specified in a feature declaration.
[^\p{C}\p{Z}]+
(numeric value) represents the value part of a feature-value specification which contains a numeric value or range. [18.3. Other Atomic Feature Values]
supplies a lower bound for the numeric value represented, and also (if @max
is not supplied) its upper bound.
(\-?[\d]+/\-?[\d]+)
supplies an upper bound for the numeric value represented.
(\-?[\d]+/\-?[\d]+)
specifies whether the value represented should be truncated to give an integer value.
(string value) represents the value part of a feature-value specification which contains a string. [18.3. Other Atomic Feature Values]
(value label) represents the value part of a feature-value specification which appears at more than one point in a feature structure. [18.6. Re-entrant Feature Structures]
supplies a name identifying the sharing point.
[^\p{C}\p{Z}]+
(collection of values) represents the value part of a feature-value specification which contains multiple values organized as a set, bag, or list. [18.7. Collections as Complex Feature Values]
(organization) indicates organization of given value or values as set, bag or list.
set
indicates that the given values are organized as a set.
bag
indicates that the given values are organized as a bag (multiset).
list
indicates that the given values are organized as a list.
(default feature value) represents the value part of a feature-value specification which contains a defaulted value. [18.9. Default Values]
(value alternation) represents the value part of a feature-value specification which contains a set of values, only one of which can be valid. [18.8.1. Alternation]
(value negation) represents a feature value which is the negation of its content. [18.8.2. Negation]
(merged collection of values) represents a feature value which is the result of merging together the feature values contained by its children, using the organization specified by the @org
attribute. [18.8.3. Collection of Values]
indicates the organization of the resulting merged values as set, bag or list.
set
indicates that the resulting values are organized as a set.
bag
indicates that the resulting values are organized as a bag (multiset).
list
indicates that the resulting values are organized as a list.
(feature library) assembles a library of <f>
(feature) elements. [18.4. Feature Libraries and Feature-Value Libraries]
(feature-value library) assembles a library of reusable feature value elements (including complete feature structures). [18.4. Feature Libraries and Feature-Value Libraries]
(graph) encodes a graph, which is a collection of nodes, and arcs which connect the nodes. [19.1. Graphs and Digraphs]
describes the type of graph.
Suggested values include: 1] undirected; 2] directed; 3] transitionNetwork; 4] transducer
undirected
undirected graph
directed
directed graph
transitionNetwork
a directed graph with distinguished initial and final nodes
transducer
a transition network with up to two labels on each arc
[^\p{C}\p{Z}]+
states the order of the graph, i.e., the number of its nodes.
states the size of the graph, i.e., the number of its arcs.
(node) encodes a node, a possibly labeled point in a graph. [19.1. Graphs and Digraphs]
provides a type for a node.
Suggested values include: 1] initial; 2] final
initial
initial node in a transition network
final
final node in a transition network
[^\p{C}\p{Z}]+
provides the value of a node, which is a feature structure or other analytic element.
\S+
(adjacent to) gives the identifiers of the nodes which are adjacent to the current node.
\S+
(adjacent from) gives the identifiers of the nodes which are adjacent from the current node.
\S+
(adjacent) gives the identifiers of the nodes which are both adjacent to and adjacent from the current node.
\S+
gives the in degree of the node, the number of nodes which are adjacent from the given node.
gives the out degree of the node, the number of nodes which are adjacent to the given node.
gives the degree of the node, the number of arcs with which the node is incident.
(arc) encodes an arc, the connection from one node to another in a graph. [19.1. Graphs and Digraphs]
gives the identifier of the node which is adjacent from this arc.
\S+
gives the identifier of the node which is adjacent to this arc.
\S+
(tree) encodes a tree, which is made up of a root, internal nodes, leaves, and arcs from root to leaves. [19.2. Trees]
gives the maximum number of children of the root and internal nodes of the tree.
(ordered) indicates whether or not the tree is ordered, or if it is partially ordered.
true
indicates that all of the branching nodes of the tree are ordered.
partial
indicates that some of the branching nodes of the tree are ordered and some are unordered.
false
indicates that all of the branching nodes of the tree are unordered.
gives the order of the tree, i.e., the number of its nodes.
(root node) represents the root node of a tree. [19.2. Trees]
identifies the root node of the network by pointing to a feature structure or other analytic element.
\S+
identifies the elements which are the children of the root node.
\S+
(ordered) indicates whether or not the root is ordered.
unknown
inapplicable
gives the out degree of the root, the number of its children.
(intermediate (or internal) node) represents an intermediate (or internal) node of a tree. [19.2. Trees]
indicates an intermediate node, which is a feature structure or other analytic element.
\S+
provides a list of identifiers of the elements which are the children of the intermediate node.
\S+
provides the identifier of the element which is the parent of this node.
\S+
(ordered) indicates whether or not the internal node is ordered.
unknown
inapplicable
provides the identifier of an element which this node follows.
\S+
gives the out degree of an intermediate node, the number of its children.
(leaf) encodes the leaves (terminal nodes) of a tree. [19.2. Trees]
provides a pointer to a feature structure or other analytic element.
\S+
provides the identifier of parent of a leaf.
\S+
provides an identifier of an element which this leaf follows.
\S+
(embedding tree) provides an alternative to the <tree>
element for representing ordered rooted tree structures. [19.3. Another Tree Notation]
provides the value of an embedding tree, which is a feature structure or other analytic element.
\S+
(underspecified embedding tree, so called because of its
characteristic shape when drawn) provides for an underspecified <eTree>
, that is, an <eTree>
with information left out. [19.3. Another Tree Notation]
supplies a value for the triangle, in the form of the identifier of a feature structure or other analytic element.
\S+
(leaf or terminal node of an embedding tree) provides explicitly for a leaf of an embedding tree, which may also be encoded with the <eTree>
element. [19.3. Another Tree Notation]
indicates the value of an embedding leaf, which is a feature structure or other analytic element.
\S+
(forest) provides for groups of rooted trees. [19.3. Another Tree Notation]
provides for lists of forests. [19.3. Another Tree Notation]
identifies the type of the forest group.
[^\p{C}\p{Z}]+
indicates the numerical accuracy or precision associated with some aspect of the text markup. [21.2. Indications of Precision]
characterizes the precision of the element or attribute pointed to by the <precision>
element.
high
medium
low
unknown
supplies a standard deviation associated with the value in question
(\-?[\d]+/\-?[\d]+)
indicates the degree of certainty associated with some aspect of the text markup. [21.1.2. Structured Indications of Uncertainty]
(certainty) signifies the degree of certainty associated with the object pointed to by the <certainty>
element.
high
medium
low
unknown
characterizes the element in some sense, using any convenient classification scheme or typology; sample categorization of annotations of uncertainty might use following values:
Sample values include: 1] ignorance; 2] incompleteness; 3] credibility; 4] imprecision
[^\p{C}\p{Z}]+
indicates more exactly the aspect concerning which certainty is being expressed: specifically, whether the markup is correctly located, whether the correct element or attribute name has been used, or whether the content of the element or attribute is correct, etc.
name
uncertainty concerns whether the name of the element or attribute used is correctly applied.
start
uncertainty concerns whether the start of the element is correctly identified.
end
uncertainty concerns whether the end of the element is correctly identified.
location
uncertainty concerns both the start and the end of the element.
value
uncertainty concerns the content (for an element) or the value (for an attribute)
provides an alternative value for the aspect of the markup in question—an alternative generic identifier, transcription, or attribute value, or the identifier of an <anchor>
element (to indicate an alternative starting or ending location). If an @assertedValue
is given, the confidence level specified by @degree
applies to the alternative markup specified by @assertedValue
; if none is given, it applies to the markup in the text.
[^\p{C}\p{Z}]+
indicates one or more element(s) characterizing the conditions which are assumed in the assignment of a degree of confidence.
\S+
indicates the degree of confidence assigned to the aspect of the markup named by the @locus
attribute.
(responsibility) identifies the individual(s) responsible for some aspect of the content or markup of particular element(s). [21.3. Attribution of Responsibility]
indicates the specific aspect of the encoding (markup or content) for which responsibility is being assigned.
name
responsibility is being assigned concerning the name of the element or attribute used.
start
responsibility is being assigned concerning the start of the element concerned.
end
responsibility is being assigned concerning the end of the element concerned.
location
responsibility is being assigned concerning the location of the element concerned.
value
responsibility is being assigned concerning the content (for an element) or the value (for an attribute)
specifies the date on which the source text was extracted and sent to the translator
(19[789][0-9]|[2-9][0-9]{3}).*
(19[789][0-9]|[2-9][0-9]{3}).*
the condition under which the element bearing this attribute applies, given as an XPath predicate expression.
(minimum number of occurences) indicates the smallest number of times this component may occur.
(maximum number of occurences) indicates the largest number of times this component may occur.
unbounded
@maxOccurs should be greater than or equal to @minOccurs
When @maxOccurs is not specified, @minOccurs must be 0 or 1
specifies the effect of this declaration on its parent object.
add
this declaration is part of the current definitions.
delete
the object with the same identifier (and namespace in the case of an element or attribute) as the current one is no longer part of the current definitions
change
this declaration changes the object of the same identifier (and namespace in the case of an element or attribute) in the current definition
replace
this declaration replaces the object with the same identifier (and namespace in the case of an element or attribute) in the current definition.
supplies the identifier by which this element may be referenced.
says whether this object should be predeclared in the tei infrastructure module.
supplies a name for the module in which this object is to be declared.
Specification : the value of the module attribute ("")
should correspond to an existing module, via a moduleSpec or
moduleRef
provides a date before which the construct being defined will not be removed.
A deprecated construct should include, whenever possible, an explanation, but this does not have a child <desc type="deprecationInfo">
(namespace) specifies the namespace to which the element(s) being specified belongs.
\S*
(attribute) contains the name of an attribute appearing within running text. [22. Documentation Elements]
(scheme) supplies an identifier for the scheme in which this name is defined.
Sample values include: 1] TEI (Text Encoding Initiative); 2] DBK (Docbook); 3] XX (unknown); 4] imaginary (imaginary); 5] XHTML (XHTML); 6] XML (XML); 7] XI (XI)
[^\p{C}\p{Z}]+
contains literal code from some formal language such as a programming language. [22.1.1. Phrase Level Terms]
(formal language) a name identifying the formal language in which the code is expressed
[^\p{C}\p{Z}]+
(example) contains any kind of illustrative example. [22.5. Element Specifications 22.5.3. Attribute List Specification]
(example of XML) a single XML fragment demonstrating the use of some XML, such as elements, attributes, or processing instructions, etc., in which the <egXML>
element functions as the root element. [22.1.1. Phrase Level Terms]
indicates the intended validity of the example with respect to a schema.
true
the example is intended to be fully valid, assuming that its root element, or a provided root element, could have been used as a possible root element in the schema concerned.
feasible
the example could be transformed into a valid document by inserting any number of valid attributes and child elements anywhere within it; or it is valid against a version of the schema concerned in which the provision of character data, list, element, or attribute values has been made optional.
false
the example is not intended to be valid, and contains deliberate errors.
(element name) contains the name (generic identifier) of an element. [22. Documentation Elements 22.5. Element Specifications]
supplies the name of the scheme in which this name is defined.
Sample values include: 1] TEI; 2] DBK (docbook); 3] XX (unknown); 4] Schematron; 5] HTML
[^\p{C}\p{Z}]+
(identifier) contains an identifier or name for an object of some kind in a formal language. <ident>
is used for tokens such as variable names, class names, type names, function names etc. in formal programming languages. [22.1.1. Phrase Level Terms]
(tag) contains text of a complete start- or end-tag, possibly including attribute specifications, but excluding the opening and closing markup delimiter characters. [22. Documentation Elements]
indicates the type of XML tag intended
start
a start-tag, with delimiters < and > is intended
end
an end-tag, with delimiters </ and > is intended
empty
an empty tag, with delimiters < and /> is intended
pi
a pi (processing instruction), with delimiters <? and ?> is intended
comment
a comment, with delimiters <!-- and --> is intended
ms
a marked-section, with delimiters <[CDATA[ and ]]> is intended
supplies the name of the schema in which this tag is defined.
Sample values include: 1] TEI (text encoding initiative); 2] DBK (docbook); 3] XX (unknown); 4] Schematron; 5] HTML
[^\p{C}\p{Z}]+
(value) contains a single attribute value. [22. Documentation Elements 22.5.3. Attribute List Specification]
(specification list) marks where a list of descriptions is to be inserted into the prose documentation. [22.1.2. Element and Attribute Descriptions]
(specification description) indicates that a description of the specified element, class, or macro should be included at this point within a document. [22.1.2. Element and Attribute Descriptions]
(identifier) supplies the identifier of the documentary element or class for which a description is to be obtained.
(attributes) supplies attribute names for which descriptions should additionally be obtained.
points to the specification for an attribute or model class which is to be included in a schema [22.6. Class Specifications]
the identifier used for the required class within the source indicated.
indicates how references to this class within a content model should be interpreted.
alternation
any one member of the class may appear
sequence
a single occurrence of all members of the class may appear in sequence
sequenceOptional
a single occurrence of one or more members of the class may appear in sequence
sequenceOptionalRepeatable
one or more occurrences of one or more members of the class may appear in sequence.
sequenceRepeatable
one or more occurrences of all members of the class may appear in sequence
supplies a list of class members which are to be included in the schema being defined.
supplies a list of class members which are to be excluded from the schema being defined.
points to the specification for some element which is to be included in a schema [22.2. Modules and Schemas]
the identifier used for the required element within the source indicated.
points to the specification for some pattern which is to be included in a schema [22.7. Macro Specifications]
the identifier used for the required pattern within the source indicated.
(module reference) references a module which is to be incorporated into a schema. [22.2. Modules and Schemas]
Child elements of are only allowed when an external module is being loaded
specifies a default prefix which will be prepended to all patterns from the imported module
The prefix attribute
of should not match that of any other
element (it would defeat the purpose)
It is an error to supply both the @include and @except attributes
supplies a list of the elements which are to be copied from the specified module into the schema being defined.
supplies a list of the elements which are not to be copied from the specified module into the schema being defined.
the name of a TEI module
(uniform resource locator) refers to a non-TEI module of RELAX NG code by external location
\S+
(module specification) documents the structure, content, and purpose of a single module, i.e. a named and externally visible group of declarations. [22.2. Modules and Schemas]
(schema specification) generates a TEI-conformant schema and documentation for it. [2.3. The Encoding Description 22.2. Modules and Schemas 23.5.1. Making a Unified ODD]
specifies entry points to the schema, i.e. which patterns may be used as the root of documents conforming to it.
specifies a default prefix which will be prepended to all patterns relating to TEI elements, unless otherwise stated.
(target language) specifies which language to use when creating the objects in a schema if names for elements or attributes are available in more than one language
(documentation language) specifies which languages to use when creating documentation if the description for an element, attribute, class or macro is available in more than one language
(default namespace
exclusions) provides a list of namespaces and/or prefixed element names to be excluded by default from anyName in RELAX NG schemas
[^/\s]+:\S*
.+:.+
(specification group) contains any convenient grouping of specifications for use within the current module. [22.2. Modules and Schemas]
(reference to a specification group) indicates that the declarations contained by the <specGrp>
referenced should be inserted at this point. [22.2. Modules and Schemas]
points at the specification group which logically belongs here.
\S+
(element specification) documents the structure, content, and purpose of a single element type. [22.5. Element Specifications 22. Documentation Elements]
This elementSpec element has a mode= of "delete" even though it has child elements. Change the mode= to "add", "change", or "replace", or remove the child elements.
This elementSpec element has a mode= of "", but does not have any child elements or schema-changing attributes. Specify child elements, use validUntil=, predeclare=, ns=, or prefix=, or change the mode= to "delete".
specifies a default prefix which will be prepended to all patterns relating to the element, unless otherwise stated.
(class specification) contains reference information for a TEI element class; that is a group of elements which appear together in content models, or which share some common attribute, or both. [22.3. Specification Elements 22.6. Class Specifications]
indicates whether this is a model class or an attribute class
model
(content model) members of this class appear in the same content models
atts
(attributes) members of this class share common attributes
indicates which alternation and sequence instantiations of a model class may be referenced. By default, all variations are permitted.
alternation
members of the class are alternatives
sequence
members of the class are to be provided in sequence
sequenceOptional
members of the class may be provided, in sequence, but are optional
sequenceOptionalRepeatable
members of the class may be provided one or more times, in sequence, but are optional.
sequenceRepeatable
members of the class may be provided one or more times, in sequence
alternation
members of the class are alternatives
sequence
members of the class are to be provided in sequence
sequenceOptional
members of the class may be provided, in sequence, but are optional
sequenceOptionalRepeatable
members of the class may be provided one or more times, in sequence, but are optional.
sequenceRepeatable
members of the class may be provided one or more times, in sequence
alternation
members of the class are alternatives
sequence
members of the class are to be provided in sequence
sequenceOptional
members of the class may be provided, in sequence, but are optional
sequenceOptionalRepeatable
members of the class may be provided one or more times, in sequence, but are optional.
sequenceRepeatable
members of the class may be provided one or more times, in sequence
alternation
members of the class are alternatives
sequence
members of the class are to be provided in sequence
sequenceOptional
members of the class may be provided, in sequence, but are optional
sequenceOptionalRepeatable
members of the class may be provided one or more times, in sequence, but are optional.
sequenceRepeatable
members of the class may be provided one or more times, in sequence
alternation
members of the class are alternatives
sequence
members of the class are to be provided in sequence
sequenceOptional
members of the class may be provided, in sequence, but are optional
sequenceOptionalRepeatable
members of the class may be provided one or more times, in sequence, but are optional.
sequenceRepeatable
members of the class may be provided one or more times, in sequence
(datatype specification) documents a datatype. [22.3. Specification Elements 22.7. Macro Specifications]
(macro specification) documents the function and implementation of a pattern. [22.3. Specification Elements 22.7. Macro Specifications]
(remarks) contains any commentary or discussion about the usage of an element, attribute, class, or entity not otherwise documented within the containing element. [22.5. Element Specifications 22.5.3. Attribute List Specification 22.6. Class Specifications 22.7. Macro Specifications]
specifies the remark concerned.
(list of references) supplies a list of significant references in the current document or elsewhere.
In the context of tagset documentation, the listRef element must not self-nest.
In the context of tagset documentation, each ptr or ref element inside a listRef must have a target attribute with only 1 pointer as its value.
(exemplum) groups an example demonstrating the use of an element along with optional paragraphs of commentary. [22.5. Element Specifications]
(classes) specifies all the classes of which the documented element or class is a member or subclass. [22.5. Element Specifications 22.6. Class Specifications]
specifies the effect of this declaration on its parent module.
change
this declaration changes the declaration of the same name in the current definition
replace
this declaration replaces the declaration of the same name in the current definition
specifies class membership of the documented element or class. [22.4.3. Classification of Components]
specifies the identifier for a class of which the documented element or class is a member or subclass
specifies the effect of this declaration on its parent module.
add
this declaration is added to the current definitions
delete
this declaration and all of its children are removed from the current setup
supplies the maximum number of times the element can occur in elements which use this model class in their content model
(\-?[\d]+/\-?[\d]+)
supplies the minumum number of times the element must occur in elements which use this model class in their content model
(\-?[\d]+/\-?[\d]+)
(equivalent) specifies a component which is considered equivalent to the parent element, either by co-reference, or by external link. [3.4.1. Terms and Glosses 22.4.1. Description of Components]
a single word which follows the rules defining a legal XML name (see ), naming the underlying concept of which the parent is a representation.
(uniform resource identifier) references the underlying concept of which the parent is a representation by means of some external identifier
\S+
references an external script which contains a method to transform instances of this element to canonical TEI
\S+
(alternate identifier) supplies the recommended XML name for an element, class, attribute, etc. in some language. [3.4.1. Terms and Glosses 22.4.1. Description of Components]
describes the processing intended for a specified element. [22.5.4.1. The TEI processing model]
There are 2 (or more) 'model' elements in this ''
that have no predicate, but are targeted to the same output
("")
There are 2 (or more) 'model' elements in this
'' that have
the same predicate, and are targeted to the same output
("")
names the process or function which this processing model uses in order to produce output.
Suggested values include: 1] alternate; 2] anchor; 3] block; 4] body; 5] break; 6] cell; 7] cit; 8] document; 9] figure; 10] glyph; 11] graphic; 12] heading; 13] index; 14] inline; 15] link; 16] list; 17] listItem; 18] metadata; 19] note; 20] omit; 21] paragraph; 22] row; 23] section; 24] table; 25] text; 26] title
alternate
support display of alternative visualisations, for example by displaying the preferred content, by displaying both in parallel, or by toggling between the two.
anchor
create an identifiable anchor point in the output.
block
create a block structure
body
create the body of a document.
break
create a line, column, or page break according to the value of type
cell
create a table cell
cit
show the content, with an indication of the source
document
start a new output document
figure
make a figure with the title as caption
glyph
show a character by looking up reference to a chardesc at the given URI
graphic
if url is present, uses it to display graphic, else display a placeholder image.
heading
creates a heading.
index
generate list according to type.
inline
creates inline element out of content
link
create hyperlink
list
create a list.
listItem
create a list item.
metadata
create metadata section
note
create a note, often out of line, depending on the value of place; could be margin, footnote, endnote, inline
omit
do nothing, do not process children
paragraph
create a paragraph out of content.
row
create a table row
section
create a new section of the output document
table
create a table
text
create literal text
title
create document title
[^\p{C}\p{Z}]+
whether to obey any rendition attribute that is present.
the intended output.
Sample values include: 1] web; 2] print; 3] plain
[^\p{C}\p{Z}]+
the name of a CSS class which should be associated with this element
any sequence of model or <modelSequence>
elements which is to be processed as a single set of actions [22.5.4.7. Model sequence]
The 'model' children
of a 'modelSequence' element inherit the @output attribute of the
parent 'modelSequence', and thus should not have their own
the condition under which this model applies given as an XPath Predicate Expression
whether to obey any rendition attribute that is present
the intended output method
Suggested values include: 1] web; 2] print; 3] plaintext
web
the output is intended for presentation in a web format
print
the output is intended for presentation in a print format
plaintext
the output is intended for presentation in a plain text format
[^\p{C}\p{Z}]+
(model group) any grouping of <model>
or <modelSequence>
elements with a common output method [22.5.4.4. Model Contexts and Outputs]
whether to obey any rendition attribute which is present
the intended output method
Suggested values include: 1] web; 2] print; 3] plaintext
web
the output is intended for presentation in a web format
print
the output is intended for presentation in a print format
plaintext
the output is intended for presentation in a plain text format
[^\p{C}\p{Z}]+
describes the rendering or appearance intended for all occurrences of an element in a specified context for a specified type of output.
provides a way of defining pseudo-elements, that is, styling rules applicable to specific sub-portions of an element.
Sample values include: 1] first-line; 2] first-letter; 3] before; 4] after
[^\p{C}\p{Z}]+
list of parameter specifications
supplies specification for one parameter of a model behaviour [22.5.4.8. Defining a processing model]
provides a parameter for a model behaviour by supplying its name and an XPath expression identifying the location of its content. [22.5.4.5. Behaviours and their parameters]
a name for the parameter being supplied
Suggested values include: 1] alternate; 2] default; 3] height; 4] id; 5] label; 6] level; 7] link; 8] place; 9] type; 10] url; 11] width
alternate
when used with behaviour alternate, a parameter of this name supplies one of the pair of possible values; for example the regularized form rather than the original form within a <choice>
element.
default
when used with behaviour alternate, a parameter of this name supplies one of the pair of possible values; for example the original form rather than the regularized form within a <choice>
element.
height
when used with behaviour graphic, a parameter of this name supplies a value for the height of the graphic e.g. "300px", "50%".
id
a parameter of this name should supply a unique identifier for the element being processed; as for example with the anchor behaviour
label
a parameter of this name should supply an expression to be used to label something, for example `concat('Page ', @n)` for a page break or `@n` for a footnote reference; typically used with the note or break behaviours
level
when used with the heading behaviour, a parameter of this name supplies a positive integer indicating the hierarchic level of a heading.
link
when used with the link behaviour, a parameter of this name should supply a URL to be used as the target of a link.
place
when used with the note behaviour, a parameter of this name should provide a string which describes the intended placement of some text; typical values include "margin", "footnote", "endnote", "inline", "bottom"
type
a parameter of this name can be used to categorize the specified behaviour in any way; for example the kind of break (when used with the break behaviour) or the kind of index to be generated (if used with the index behaviour) etc.
url
when used with behaviour graphic, a parameter of this name supplies a a URL indicating the graphic intended.
width
when used with behaviour graphic, a parameter of this name supplies a value for the width of the graphic e.g. "400px", "70%".
[^\p{C}\p{Z}]+
supplies an XPath expression which when evaluated provides the value for the parameter
(content model) contains a declaration of the intended content model for the element (or other construct) being specified. [22.5. Element Specifications]
which could be wrapped in a <sequence> element.
which could be wrapped in an <rng:div> element.
but those children are neither all TEI elements nor
all RELAX NG elements, and thus this <content> is invalid and can not be easily rectified.
controls whether or not pattern names generated in the corresponding RELAX NG schema source are automatically prefixed to avoid potential nameclashes.
true
Each name referenced in e.g. an <rng:ref>
element within a content model is automatically prefixed by the value of the @prefix
attribute on the current <schemaSpec>
false
No prefixes are added: any prefix required by the value of the @prefix
attribute on the current <schemaSpec>
must therefore be supplied explicitly, as appropriate.
indicates that the constructs referenced by its children form a sequence [22.5.1. Defining Content Models]
The sequence element must have at least two child elements
if false, indicates that component elements of a sequence may occur in any order.
indicates that the constructs referenced by its children form an alternation [22.5.1. Defining Content Models]
The alternate element must have at least two child elements
(constraint rules) the formal rules of a constraint [22.5. Element Specifications]
(constraint on schema) contains a formal constraint, typically expressed in a rule-based schema language, to which a construct must conform in order to be considered valid [22.5. Element Specifications]
This constraintSpec element has a mode= of "delete" even though it has child elements. Change the mode= to "add", "change", or "replace", or remove the child elements.
This constraintSpec element has a mode= of "change", but does not have any child elements. Specify child elements, or change the mode= to "delete".
This constraintSpec element has a mode= of "", but does not have a child 'constraint' element. Use a child 'constraint' element or change the mode= to "delete" or "change".
Rules
in the Schematron 1.* language must be inside a constraintSpec
with a value other than 'schematron' or 'isoschematron' on the
scheme attribute
Rules
in the ISO Schematron language must be inside a constraintSpec
with the value 'schematron' on the scheme attribute
An ISO Schematron constraint specification for a macro should not
have an 'assert' or 'report' element without a parent 'rule' element
The @ident of 'constraintSpec' should be unique; this one () is the same as that of a previous 'constraintSpec'.
characterizes the <constraintSpec>
element in some sense; used to indicate when a <constraintSpec>
warns about a deprecated construct.
Suggested values include: 1] deprecationWarning
deprecationWarning
Indicates that this constraint specification warns that some other construct in the schema is deprecated.
[^\p{C}\p{Z}]+
supplies the name of the language in which the constraints are defined
Suggested values include: 1] schematron (ISO Schematron)
schematron
(ISO Schematron)
[^\p{C}\p{Z}]+
(attribute list) contains documentation for all the attributes associated with this element, as a series of <attDef>
elements. [22.5. Element Specifications 22.6. Class Specifications]
(organization) specifies whether only one (choice) or all (group) of the attributes in the list are available
group
grouped
choice
alternated
(attribute definition) contains the definition of a single attribute. [22.5.3. Attribute List Specification]
Attribute: the definition of the @ attribute in the
should have a closed valList or a datatype
Since the @ attribute is required, it will always be specified. Thus the default value (of "") will never be used. Either change the definition of the attribute so it is not required ("rec" or "opt"), or remove the defaultVal element.
In the defining
the default value of the
@ attribute is not among the closed list of possible
values
In the defining
the default value of the
@ attribute is not among the closed list of possible
values
specifies the optionality of the attribute.
req
(required)
rec
(recommended )
opt
(optional )
(namespace) specifies the namespace to which this attribute belongs
\S*
(attribute pointer) points to the definition of an attribute or group of attributes. [22.2. Modules and Schemas]
the name of the attribute class
[^\p{C}\p{Z}]+
the name of the attribute
(datatype) specifies the declared value for an attribute, by referring to any datatype defined by the chosen schema language. [1.4.2. Datatype Specifications 22.5.3. Attribute List Specification]
(minimum number of occurences) indicates the minimum number of times this datatype may occur in an instance of the attribute being defined
(maximum number of occurences) indicates the maximum number of times this datatype may occur in an instance of the attribute being defined
unbounded
identifies the datatype of an attribute value, either by referencing an item in an externally defined datatype library, or by pointing to a TEI-defined data specification [22.5.3.1. Datatypes]
Data facets can only be specified for references to datatypes specified by
XML Schemas: Part 2: Datatypes — that is, for there to be a 'dataFacet' child there must be a @name attribute.
Data facets and restrictions cannot both be expressed on the same data reference — that is, the @restriction attribute cannot be used when a 'dataFacet' element is present.
Restrictions can only be specified for references to datatypes specified by
XML Schemas: Part 2: Datatypes — that is, for there to be a @restriction attribute there must be a @name attribute, too.
the identifier used for this datatype specification
the name of a datatype in the list provided by XML Schemas: Part 2: Datatypes
a pointer to a datatype defined in some datatype library
\S+
supplies a string representing a regular expression providing additional constraints on the strings used to represent values of this datatype
Restricts the value of the strings used to represent values of a datatype, according to XML Schemas: Part 2: Datatypes. [22.5.3.1. Datatypes]
the name of the facet.
length
minLength
maxLength
pattern
enumeration
whiteSpace
maxInclusive
minInclusive
maxExclusive
minExclusive
totalDigits
fractionDigits
the facet value.
(default value) specifies the default declared value for an attribute. [22.5.3. Attribute List Specification]
(value description) specifies any semantic or syntactic constraint on the value that an attribute may take, additional to the information carried by the <datatype>
element. [22.5.3. Attribute List Specification]
documents a single value in a predefined list of values. [22.5.3. Attribute List Specification]
specifies the value concerned.
(value list) contains one or more <valItem>
elements defining possible values. [22.5.3. Attribute List Specification]
specifies the extensibility of the list of values specified.
closed
only the values specified are permitted.
semi
(semi-open) all the values specified should be supported, but other values are legal and software should have appropriate fallback processing for them.
open
the values specified are sample values only.
indicates the presence of a text node in a content model [22. Documentation Elements]
indicates the presence of any elements in a content model [22. Documentation Elements]
supplies a list of namespaces to one of which the permitted elements must belong.
\S*
supplies a list of namespaces or prefixed element names which are not permitted.
[^/\s]+:\S*
.+:.+
indicates the presence of an empty node within a content model [22. Documentation Elements]