List of tags and empty categories


POS tags

Basic label Category Paradigmatic variants
ADJ adjective, including ordinal number ADJR (comparative)
ADJS (superlative)
ADJ$ (possessive; very rare)
ADV adverb ADVR (comparative)
ADVS (superlative)
ALSO the words ALSO (except with the meaning AS) and EKE (the latter in Middle English only)
BE auxiliary and main verb BE BE (infinitive)
BAG (present participle)
BED (past, including past subjunctive)
BEI (imperative)
BEN (perfect participle)
BEP (present, including present subjunctive)
C complementizer
CONJ coordinating conjunction
D determiner
DO auxiliary and main verb DO DO (infinitive)
DAG (present participle)
DAN (passive participle)
DOD (past)
DOI (imperative)
DON (perfect participle)
DOP (present, including present subjunctive)
ELSE the word ELSE in the collocation OR ELSE
EX existential THERE
FOR infinitival FOR
FP focus particle
FW foreign word
HV auxiliary and main verb HAVE HV (infinitive)
HAG (present participle)
HAN (passive participle)
HVD (past)
HVI (imperative)
HVN (perfect participle)
HVP (present, including present subjunctive)
ID identification number for sentence tokens
INTJ interjection
LB line break (Middle English only)
LS list item
MAN indefinite subject pronoun (Middle English only)
MD modal verb MD0 (untensed, rare)
N common noun N (singular)
N$ (singular, possessive)
NS (plural)
NS$ (plural possessive)
NPR proper noun NPR (singular)
NPR$ (singular, possessive)
NPRS (plural)
NPRS$ (plural possessive)
NUM cardinal number NUM$ (possessive, very rare)
ONE default label for the word ONE ONE (singular)
ONE$ (singular possessive)
ONES (plural)
ONES$ (plural possessive)
OTHER the word OTHER OTHER (singular)
OTHER$ (singular possessive)
OTHERS (plural)
OTHERS$ (plural possessive)
P preposition or subordinating conjunction
PRO pronoun PRO$ (possessive)
PUNC punctuation (both sentence-medial and sentence-final)
Q quantifier QR (comparative)
QS (superlative)
Q$ (possessive, very rare)
RP adverbial particle
SUCH the word SUCH
TO infinitival TO, TIL, and AT
VB verb other than BE, DO, or HAVE VB (infinitive)
VAG (present participle)
VAN (passive participle)
VBD (past)
VBI (imperative)
VBN (perfect participle)
VBP (present, including present subjunctive)
WADV wh-adverb  
WARD the morpheme WARD  
WD wh-determiner  
WPRO wh-pronoun WPRO$ (possessive)
WQ WHETHER or IF as head of indirect question  
X unknown part of speech  
$ possessive (when split from host morpheme)  

Syntactic tags

Syntactic tags encode various pieces of information about a phrasal constituent, notably its syntactic category and grammatical or thematic function and also various other syntactic, discourse, and metalinguistic features.

Certain basic syntactic categories have variants ending in X (for instance, NP and NX). These variants are used in conjunction contexts and are intended to be agnostic about whether the category in question is a lexical or a phrasal projection. For details, see the section on conjunction.

Category tags

Label Category Combines with function tags Variant for shared premodifier contexts Wh- variant
ADJP adjective phrase CAR, LOC, PRD, SPR ADJX WADJP
ADVP adverb phrase DIR, LOC, TMP ADVX WADVP
CONJP conjunction phrase
CP complement clause or question See CP types WCP (rare, but possible in pied-piping contexts)
FRAG sentence fragment
INTJP interjection phrase
IP clauses other than CP See IP types WIP (rare, but possible in pied-piping contexts)
NP noun phrase ADT, ADV, CAR, COM, DIR, DPS, LOC, MSR,
OB1, OB2, PRD, RFL, SBJ, SPR, TMP, VOC
NX WNP
NUMP number phrase NUMX (rare)
PP prepositional phrase WPP
QP quantifier phrase QX WQP
QTP quotation phrase
VP verb phrase (rare)
XX unknown phrasal category

Function tags

Label Function
-ADT adjunct
-ADV adverbial (e.g., manner)
-CAR clause-adjoined
-COM bare NP complement (corresponding to Modern English OF phrase)
-DIR directional
-DPS dative of possession
-LOC locative
-MSR measure
-OB1 direct object
-OB2 indirect object
-PRD predicate
-RFL inherent reflexive
-SBJ subject
-SPR secondary predicate
-TMP temporal
-VOC vocative

Extended tags

Label Definition
-LFD left-dislocated
-PRN parenthetical or appositive
-RSP resumptive
-SPE direct speech (only on CP, IP)
-XXX unknown function

"Meta" tags

CODE material that is not part of the text proper (text markup, page numbers, editorial comments, and the like)
FOREIGN passage in any foreign language other than (or, in PCEEC2, including) Latin
LATIN passage in Latin
META special text material (stage directions, speaker changes in plays, quotations in Modern English from Middle English, and the like)
REF references in the original text to sources of quotations

IP types

Label Type Subtype
IP-ABS absolute clause
IP-IMP imperative
IP-INF infinitive -ABS (absolute infinitive)
-ADT (infinitival adjunct)
-DEG (degree infinitive)
-PRP (purpose infinitive)
-SBJ (as subject)
IP-MAT matrix clause
IP-PPL participial clause -ABS (absolute)
-OB1 (as complement of verb or preposition)
-SBJ (as subject)
IP-SMC small clause
IP-SUB subordinate clause
RRC reduced relative clause

CP types

Label Type Subtype
CP-ADV adverbial clause
CP-CAR clause-adjoined relative clause
CP-CLF IT cleft
CP-CMP comparative clause
CP-DEG degree complement clause
CP-EOP empty operator clause (nonfinite)
CP-EXL exclamation (direct or indirect)
CP-FRL free relative clause
CP-QUE (always annotated for subtype) -MAT (direct question)
-SUB (indirect question)
-SUB-ADV (adverbial WHETHER clause)
-SUB-SBJ (ndirect question as subject)
-TAG (tag question)
CP-REL relative clause
CP-THT THAT complement clause -SBJ (as subject)
CP-TMC TOUGH movement complement

Empty categories

*arb* arbitrary subject in ECM infinitive (WE HEAR TELL)
*con* subject elided under conjunction
*exp* empty expletive subject
*pro* empty thematic subject (rare in English)
*ICH* mnemonic for "insert constituent here"; trace of extraposition, scrambling,
or other movement type that does not fit neatly into the A/A' dichotomy
*T* trace of A'-movement
* trace of A-movement where required by guidelines;
also default empty category
0 empty operator (when immediately dominated by wh- phrase);
empty terminal node (when immediately dominated by POS label)

Indices

Numeric indices are used to coindex: