A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.
Susanne corpus part-of-speech tagset is a special tagset used in Susanne corpus comprises 353 distinct word tags. Detailed information about the tagset find at http://www.grsampson.net/SueDoc.html
This is modified tagset of default TreeTagger tagset.
An Example of a tag in the CQL concordance search box: [tag="NN.*"]
finds all common nouns, e.g. people, years (note: please make sure that you use straight double quotation marks)
Tagset
Rootrank Formtags | |
O | paragraph |
Oh | heading |
Ot | title (e.g. of book) |
Q | quotation |
I | interpolation |
Iq | tag question |
Iu | technical reference |
Clausetags | |
S | main clause |
Ss | embedded quoting clause |
Fa | adverbial clause |
Fn | nominal clause |
Fr | relative clause |
Ff | fused relative |
Fc | comparative clause |
Tg | present participle clause |
Tn | past participle clause |
Ti | infinitival clause |
Tf | for-to clause |
Tb | bare nonfinite clause |
Tq | infinitival relative clause |
W | with clause |
A | special as clause |
Z | reduced (“whiz-deleted”) relative |
L | miscellaneous verbless clause |
Phrasetags | |
V | verb group |
N | noun phrase |
J | adjective phrase |
R | adverb phrase |
P | prepositional phrase |
D | determiner phrase |
M | numeral phrase |
G | genitive phrase |
Subcategories | |
Vo | operator section of verb group, when separated from remainder of V e.g. by subject-auxiliary inversion |
Vr | remainder of V from which Vo has been separated |
Vm | V beginning with am |
Va | V beginning with are |
Vs | V beginning with was |
Vz | V beginning with other 3rd-singular verb |
Vw | V beginning with were |
Vj | V beginning with be |
Vd | V beginning with past tense |
Vi | infinitival V |
Vg | V beginning with present participle |
Vn | V beginning with past participle |
Vc | V beginning with modal |
Vk | V containing emphatic DO |
Ve | negative V |
Vf | perfective V |
Vu | progressive V |
Vp | passive V |
Vb | V ending with BE |
Vx | V lacking main verb |
Vt | catenative V |
Nq | wh- N |
Nv | wh…ever N |
Ne | I/me as whole or head |
Ny | you as whole or head |
Ni | it as whole or head |
Nj | adjectival head |
Nn | proper name |
Nu | unit of measurement as head |
Na | marked as subject |
No | marked as nonsubject |
Ns | marked as singular |
Np | marked as plural |
Jq | wh- J |
Jv | wh…ever J |
Jx | measured absolute J |
Jr | measured comparative J |
Jh | heavy (postmodified) J |
Rq | wh- R |
Rv | wh…ever R |
Rx | measured absolute R |
Rr | measured comparative R |
Rs | adverb conducive to asyndeton |
Rw | quasi-nominal adverb |
Po | of phrase |
Pb | by phrase |
Pq | wh- P |
Pv | wh…ever P |
Dq | wh- D |
Dv | wh…ever D |
Ds | marked as singular |
Dp | marked as plural |
Ms | M headed by one |
Gq | wh- G |
Gv | wh…ever G |
Functiontags | |
Functiontags are appended to formtags, similarly to the previously mentioned subcategories. They are however seperated from them with a ‚:‘. | |
Complement Functiontags | |
s | logical subject |
o | logical direct object |
i | indirect object |
u | prepositional object |
e | predicate complement of subject |
j | predicate complement of object |
a | agent of passive |
S | surface (and not logical) subject |
O | surface (and not logical) direct object |
G | guest having no grammatical role within its tagma |
Adjunct Functiontags | |
p | place |
q | direction |
t | time |
h | manner or degree |
m | modality |
c | contingency |
r | respect |
w | comitative |
k | benefactive |
b | absolute |
Other Functiontags | |
n | participle of phrasal verb |
x | relative clause having higher clause as antecedent |
z | complement of catenative |
Source: http://www.grsampson.net/SueDoc.html