Material that has been split or separated in the course of annotation (punctuation, contractions, cliticized articles, and so on) remains separated in the text files in order to simplify searches and tallies. See Word tokenization for details.
<P_2> <heading> I . (CMMALORY,2.3) Merlin (CMMALORY,2.4) <$$heading> HIT befel in the dayes of Uther Pendragon , when he was kynge of all Englond and so regned , that there was a myghty duke in Cornewaill that helde warre ageynst hym long tyme . (CMMALORY,2.6) and the duke was called the duke of Tyntagil . (CMMALORY,2.7) And so by meanes kynge Uther send for this duk chargyng hym to brynge his wyf with hym . (CMMALORY,2.8) for she was called a fair lady and a passynge wyse . (CMMALORY,2.9) and her name was called Igrayne . (CMMALORY,2.10)
The text is divided into tokens in the same way as in the text files.
<P_2>/CODE <heading>/CODE I/NUM ./PUNC CMMALORY,2.3/ID Merlin/NPR CMMALORY,2.4/ID <$$heading>/CODE HIT/PRO befel/VBD in/P the/D dayes/NS of/P Uther/NPR Pendragon/NPR ,/PUNC when/P he/PRO was/BED kynge/N of/P all/Q Englond/NPR and/CONJ so/ADV regned/VBD ,/PUNC that/C there/EX was/BED a/D myghty/ADJ duke/N in/P Cornewaill/NPR that/C helde/VBD warre/N ageynst/P hym/PRO long/ADJ tyme/N ,/PUNC CMMALORY,2.6/ID and/CONJ the/D duke/N was/BED called/VAN the/D duke/N of/P Tyntagil/NPR ./PUNC CMMALORY,2.7/ID And/CONJ so/ADV by/P meanes/NS kynge/NPR Uther/NPR send/VBD for/P this/D duk/N chargyng/VAG hym/PRO to/TO brynge/VB his/PRO$ wyf/N with/P hym/PRO ,/PUNC CMMALORY,2.8/ID for/CONJ she/PRO was/BED called/VAN a/D fair/ADJ lady/N and/CONJ a/D passynge/ADV wyse/ADJ ,/PUNC CMMALORY,2.9/ID and/CONJ her/PRO$ name/N was/BED called/VAN Igrayne/NPR ./PUNC CMMALORY,2.10/ID
( (CODE <BEGIN_cmmalory-m4>)) ( (CODE <P_2>)) ( (CODE <heading>)) ( (LS (NUM I) (PUNC .)) (ID CMMALORY-M4,2.4)) ( (NP (NPR Merlin)) (ID CMMALORY-M4,2.5)) ( (CODE <$$heading>)) ( (IP-MAT (NP-SBJ=1 (PRO HIT)) (VBD befel) (PP (P in) (NP (D the) (NS dayes) (PP (P of) (NP (NPR Uther) (NPR Pendragon))))) (PUNC ,) (PP (P when) (CP-ADV (C 0) (IP-SUB (IP-SUB (NP-SBJ (PRO he)) (BED was) (NP-PRD (N kynge) (PP (P of) (NP (Q all) (NPR Englond))))) (CONJP (CONJ and) (IP-SUB (NP-SBJ *con*) (ADVP (ADV so)) (VBD regned)))))) (PUNC ,) (CP-THT-1 (C that) (IP-SUB (NP-SBJ=2 (EX there)) (BED was) (NP-2 (D a) (ADJ myghty) (N duke) (CP-REL *ICH*-3)) (PP (P in) (NP (NPR Cornewaill))) (CP-REL-3 (WNP-4 0) (C that) (IP-SUB (NP-SBJ *T*-4) (VBD helde) (NP-ACC (N warre)) (PP (P ageynst) (NP (PRO hym))) (NP-MSR (ADJ long) (N tyme)))))) (PUNC ,)) (ID CMMALORY-M4,2.7)) ( (IP-MAT (CONJ and) (NP-SBJ-1 (D the) (N duke)) (BED was) (VAN called) (IP-SMC (NP-SBJ *-1) (NP-PRD (D the) (N duke) (PP (P of) (NP (NPR Tyntagil))))) (PUNC .)) (ID CMMALORY-M4,2.8)) ( (IP-MAT (CONJ And) (ADVP (ADV so)) (PP (P by) (NP (NS meanes))) (NP-SBJ (NPR kynge) (NPR Uther)) (VBD send) (PP (P for) (NP (D this) (N duk))) (IP-PPL (VAG chargyng) (NP-ACC (PRO hym)) (IP-INF (TO to) (VB brynge) (NP-ACC (PRO$ his) (N wyf)) (PP (P with) (NP (PRO hym))))) (PUNC ,)) (ID CMMALORY-M4,2.9)) ( (IP-MAT (CONJ for) (NP-SBJ-1 (PRO she)) (BED was) (VAN called) (IP-SMC (NP-SBJ *-1) (NP-PRD (NP (D a) (ADJ fair) (N lady)) (CONJP (CONJ and) (NP (D a) (ADJP (ADV passynge) (ADJ wyse)))))) (PUNC ,)) (ID CMMALORY-M4,2.10)) ( (IP-MAT (CONJ and) (NP-SBJ-1 (PRO$ her) (N name)) (BED was) (VAN called) (IP-SMC (NP-SBJ *-1) (NP-PRD (NPR Igrayne))) (PUNC .)) (ID CMMALORY-M4,2.11))
Editor comments are either omitted or enclosed in
{ED:...}. Comments added by Helsinki or Penn are enclosed in
{COM:...}.
Emendations are preceded by a dollar sign (for
instance, $the); multi-word emendations are also sometimes
surrounded by ... . Emendations include those in the
original edition as well as those introduced by Helsinki or Penn.
Font codes are retained.
Headings that are part of the original text are enclosed in
<heading> ... <$$heading>. Headings in the Helsinki
samples are in all caps. This convention is generally not followed in the
samples added at Penn.
Headings added by the editor are treated as editor comments.
That is, they are either omitted or enclosed in {ED:...}.
Language codes are omitted.
Parentheses indicating emendations in the original text are
omitted, and the material in them is treated like other emendations.
Otherwise, parentheses are represented as
( (CODE <P_73>)) ← page number in edition
( (CODE <heading>)) ← beginning of heading
( (FRAG (NUM VII)
(CODE {COM:Trinity_Homily_IV}) ← comment added at Penn
(, .)
(LATIN (FW CREDO))
(. .))
(ID CMLAMB1-M1,73.4))
( (CODE <$$heading>)) ← end of heading
( (IP-MAT-SPE (NP-SBJ (PRO $we)) ← emendation
(CODE {TEXT:+te}) ← text in edition
(MD wulle+d)
(VB fole+ge)
(NP-OB1 (PRO +te)))
(ID CMANCRIW-1-M1,II.130.1708))
( (IP-MAT-SPE (' ')
(NP-SBJ (D +De) (N mann))
(NEG ne)
(VBP leue+d)
(NEG naht)
(PP (P $be) ← emendation
(CODE {TEXT:he}) ← text in edition
(NP (N bread) (FP ane)))
(. ,))
(ID CMVICES1-M1,89.1018))
( (IP-SUB (NP-SBJ (N mihte))
(NP-OB1 (PRO $+te)) ← emendation over two words
(NEG $ne)
(CODE {TEXT:+te_+te}) ← text in edition
(VBP atiere+d))
(ID CMTRINIT-MX1,29.394))