A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.
Norwegian part-of-speech tagset is available in Norwegian corpora annotated by the Oslo-Bergen Tagger.
An Example of a tag in the CQL concordance search box: [tag="S[AP].*
"]
finds all nouns including proper nouns, e.g. Noreg, dag (note: please make sure that you use straight double quotation marks)
Tagset (basic part-of-speech)
Tag | Description |
S | substantives |
A | adjectives |
D | determiners |
P | participles |
ADV | adverb |
INTERJ | interjection |
INF-M | infinitive marker |
KONJ | conjuction |
PRON-(plus features) | pronoun |
PREP | preposition |
V-(plus features) | verb |
SBU (subordination) | subjunction |
TALL (roman numerals) | roman numerals |
FORK | abbreviation |
Full tagset summary (detailed part-of-speech information)
NOUNS | |
Position | explanation |
1 | S (wordclass) |
2 | A/P for appelative or proper noun |
3 | M/F/N for the gender |
4 | E/F for singular or plural |
5 | case (usually 0 but can be G for the genitive) |
6 | U/B for the indefinite or definite |
ADJECTIVES | |
Position | explanation |
1 | A |
2 | Q (qualitative*) |
3 | P/K/S degree (positive, comparative, superlative) |
4 | M/F/N/2 masculine, feminine, neutrum, masculine/feminine (most) |
5 | E/F (number) |
6 | G case (rare) |
7 | U/B (definite, indefinite) |
PARTICIPLES | |
Position | explanation |
1 | P (class) |
2 | F/P (perfect, present) |
3 | 0 (always 0, not used) |
4 | 2/N (gender : masc/fem, neutrum, 0 [most common]) |
5 | E/F (number: sing/pl) |
6 | never used |
7 | U/B (definite, indefinite)* |
PREPOSITIONS | |
PREP | |
DETERMINTERS | |
Position | explanation |
1 | D (class) |
2 | D/K/P (demonstrative, numerical, possessive) |
3 | M/F/N (gender) |
4 | E/F (number) |
5 | G (case: genitive, usually 0) |
6 | B/0 (def/indef) ? |
PUNCTUATION, etc | |
Position | explanation |
FE | External punctuation (. ! ? ) |
FI | Internal punctuation (, ;) |
FP | parallel matching punctuation, quotes, parentheses etc |
GRAFIKK | markers in the text for divisions etc (***) |
SYMBOL | publishing symbols |
ABBREVIATIONS | |
FORK | |
Interjections | |
INTERJ | |
Conjunctions, subjunctions | |
KONJ | |
SBU | |
PRONOUNS | |
PRON | + |
Position | explanation |
6 | P/R/H (personal, reflexive, interrogative “who”) |
7 | 1/2/3 (first, second, third person) |
8 | M/F/N (gender) |
9 | E/F (number) |
10 | N/A (normal, object form) |
11 | H/I (animate, inanimate) |
NUMBERS | |
TO | ordinal |
TALL | number |
VERBS | |
Tag | explanation |
INF-M | infinitive marker |
V-IMP | imperative |
V-INF | infinitive |
V-INF-GEN | XXX |
V-INF-PRES-ST-FORM | XXX |
V-INF-ST-FORM | XXX |
V-PAA-ST-PRET | XXX |
V-PRES | XXX |
V-PRES-SIDEFORM | XXX |
V-PRES-ST-FORM | XXX |
V-PRET | XXX |
Foreign words | |
X |
Source: http://folk.uio.no/danielr/nyno-brill-doc.html