A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.
IFD Tagset – Icelandic part-of-speech tagset
IFD tagset is an Icelandic tagset of part-of-speech tags used with Icelandic corpora annotated by IceNLP which is an open source Natural Language Processing (NLP) toolkit for analyzing and processing Icelandic text.
The following table shows part-of-speech tags from IFD Tagset.
An Example of a tag in the CQL concordance search box: [tag="fah.*"]
finds all demonstrative pronouns in the accusative case, e.g. people, years (note: please make sure that you use straight double quotation marks)
Basic overview of Icelandic tagset
Part of speech | Tag |
adjective | l.* |
adverb | a.* |
article | g.* |
conjunction | c.* |
noun | n.* |
numeral | t.* |
pronoun | f.* |
verb | s.* |
Icelandic part-of-speech tags in detail
Noun
For example, to find all masculine plural nouns in accusative that are other proper names, use the following CQL tag: [tag="nkfo-s"]
(empty/unused category is replaced with dash “-“)
Order | Category/Feature | Value | Tag |
1 | Word class | n | noun |
2 | Gender | k | masculine |
v | feminine | ||
h | neuter | ||
x | unspecified | ||
3 | Number | e | singular |
f | plural | ||
4 | Case | n | nominative |
o | accusative | ||
þ | dative | ||
e | genitive | ||
5 | Article | g | with suffixed definite article |
6 | Proper noun | m | person name |
ö | place name | ||
s | other proper name |
Adjective
For example, to find all feminine singular adjectives in dative with strong declension and positive degree, use the following CQL tag: [tag="lveþsf"]
(the latin small letter thorn “þ” can be copied from here; Unicode code: U+00FE)
Order | Category/Feature | Tag | Value |
1 | Word class | l | adjective |
2 | Gender | k | masculine |
v | feminine | ||
h | neuter | ||
3 | Number | e | singular |
f | plural | ||
4 | Case | n | nominative |
o | accusative | ||
þ | dative | ||
e | genitive | ||
5 | Declension | s | strong declension |
v | weak declension | ||
o | indeclineable | ||
6 | Degree | f | positive |
m | comparative | ||
e | superlative |
Pronoun
For example, to find all second-person personal pronouns in singular and the genitive case, use the following CQL tag: [tag="fp2ee"]
Order | Category/Feature | Tag | Value |
1 | Word class | f | pronoun |
2 | Subcategory | a | demonstrative |
b | reflexive | ||
e | possessive | ||
o | indefinite | ||
p | personal | ||
s | interrogative | ||
t | relative | ||
3 | Gender / Person | k | masculine |
v | feminine | ||
h | neuter | ||
1 | 1st person | ||
2 | 2nd person | ||
4 | Number | e | singular |
f | plural | ||
5 | Case | n | nominative |
o | accusative | ||
þ | dative | ||
e | genitive |
Article
For example, to find all masculine articles in the genitive (both singular and plural), use the following CQL tag: [tag="gk.e"]
Order | Category/Feature | Tag | Value |
1 | Word class | g | article |
2 | Gender | k | masculine |
v | feminine | ||
h | neuter | ||
3 | Number | e | singular |
f | plural | ||
4 | Case | n | nominative |
o | accusative | ||
þ | dative | ||
e | genitive |
Numeral
For example, to find numerals in the plural, use the following CQL tag: [tag="t..f."]
Order | Category/Feature | Tag | Value |
1 | Word class | t | numeral |
2 | Category | f | alpha |
o | numeric | ||
p | percentage | ||
3 | Gender | k | masculine |
v | feminine | ||
4 | Case | h | neuter |
4 | Number | e | singular |
f | plural | ||
5 | Case | n | nominative |
o | accusative | ||
þ | dative | ||
e | genitive |
Verb (except for past participle)
For example, to find all verbs in the 2nd person singular present active indicative forms, use the following CQL tag: [tag="sfg2en"]
Order | Category/Feature | Tag | Value |
1 | Word class | s | verb (except for past participle) |
2 | Mood | n | infinitive |
b | imperative | ||
f | indicative | ||
3 | Gender | v | subjunctive |
s | supine | ||
4 | Case | l | present participle |
3 | Voice | g | active |
m | middle | ||
4 | Person | 1 | 1st person |
2 | 2nd person | ||
3 | 3rd person, | ||
5 | Number | e | singular |
f | plural | ||
6 | Tense | n | present |
þ | past |
Verb (past participle)
For example, to find all past participle verb forms in the nominative, use the following CQL tag: [tag="sþ.*n"]
Order | Category/Feature | Tag | Value |
1 | Word class | s | verb (past participle) |
2 | Mood | þ | past participle |
3 | Voice | g | active |
m | middle | ||
4 | Gender | k | masculine |
v | feminine | ||
h | neuter | ||
5 | Number | e | singular |
f | plural | ||
6 | Case | n | nominative |
o | accusative | ||
þ | dative | ||
e | genitive |
Adverb and preposition
For example, to find all superlative forms of adverbs/prepositions, use the following CQL tag: [tag="a.e"]
Order | Category/Feature | Tag | Value |
1 | Word class | a | adverb and preposition |
2 | Category | a | does not govern case |
u | exclamation | ||
o | governs accusative | ||
þ | governs dative | ||
e | governs genitive | ||
3 | Degree | m | comparative |
e | superlative |
Conjunction
For example, to find all relative conjunctions, use the following CQL tag: [tag="ct"]
Order | Category/Feature | Tag | Value |
1 | Word class | c | conjunction |
2 | Category | n | sign of infinitive |
t | relative conjunction, |
Foreign word
Order | Category/Feature | Tag | Value |
1 | Word class | e | foreign word |
Unanalyzed word
Order | Category/Feature | Tag | Value |
1 | Word class | x | unanalyzed word |
Reference
Github repository of IceNLP toolkit: https://github.com/hrafnl/icenlp
or