A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.
Modified Finish TreeTagger part-of-speech tagset is available in Finish corpora annotated by the tool TreeTagger that was developed by Helmut Schmid in the TC project at the Institute for Computational Linguistics of the University of Stuttgart and containing modifications developed by Sketch Engine (currently pipeline version 2).
Finnish part-of-speech tagset overview
An Example of a tag in the CQL concordance search box: [tag="N_.*Pl"]
finds all nouns in plural, e.g. joissa, ihmiset (note: please make sure that you use straight double quotation marks)
Finnish Tagset overview
N|N_.* | noun |
A|A_.* | adjective |
Pron | pronoun |
Num.* | numeral |
V|V_.* | verb (except for present participle and perfect participle) |
Adv|Ag.* | adverb |
Adp_.* | adposition |
CC | coordinating conjunction |
CS | preposition and subordinating conjuction |
Interj | interjection |
NON-TWOL | non-word or foreign word |
PrfPrc.* | perfect participle |
Abbr | abbreviation |
PrsPrc.* | present participle |
Punct | punctuation except for sentence-ending punctuation |
SENT | sentence-ending punctuation (. or ! or ? or their combination) |
Nouns subclassification
All nouns may be searched by a CQL query: [tag = “N|N_.*]
Category |
Subcategory |
Tag | Example of a tag |
Example of words | |
Noun type | Proper | Prop | N_Prop_Nom_Sg | ||
Case | Grammatical cases | Nominative | Nom | N_Nom_Sg | aika, osa |
Genitive | Gen | N_Gen_Sg | verran, ihmisen | ||
Accusative | Acc | ||||
Locative cases | Inessive | Ine | N_Ine_Pl | kuvissa, töissä | |
Elative | Ela | N_Ela_Pl | lapsista, töissä | ||
Illative | Ill | N_Ill_Sg | kohtaan | ||
Adessive | Ade | N_Ade_Sg | puolella | ||
Ablative | Abl | N_Abl_Sg | puolelta, nimeltään | ||
Allative | All | N_All_Sg | tasolle, paikalle | ||
Essive | Ess | N_Ess_Sg | perjantaina, vuonna | ||
Translative | Tra | N_Tra_Sg | anteeksi, onneksi | ||
Underused cases | Abessive | Abe | N_Abe_Sg | kiistatta, vuotta | |
Comitative | Com | N_Com | korkoineen, päivineen | ||
Instructive | Ins | N_Ins_Pl | kaksin | ||
Number | Singular | Sg | N_Ill_Sg | aikaan | |
Plural | Pl | N_Ill_Pl | aikoihin |
Adjective subclassification
All adjectives may be searched by a CQL query: [tag = “A|A_.*”]
Category |
Subcategory |
Tag | Example of a Tag | |
Adjective |
Type | Proper | Prop | A_Prop_Nom_Sg |
Adjective | Grade | Positive | Pos | |
Comparative | Comp | |||
Superlative | Superl | |||
Case | Grammatical cases | Nominative | Nom | A_Nom_Sg |
Genitive | Gen | A_Gen_Sg | ||
Partitive | Par | A_Par_Sg | ||
Accusative | Acc | |||
Locative cases | Inessive | Ine | A_Ine_Sg | |
Elative | Ela | A_Ela_Sg | ||
Illative | Ill | A_Ill_Sg | ||
Adessive | Ade | A_Ade_Sg | ||
Ablative | Abl | A_Abl_Sg | ||
Allative | All | A_All_Sg | ||
Essive | Ess | A_Ess_Sg | ||
Translative | Tra | A_Tra_Sg | ||
Underused cases | Abessive | Abe | A_Abe_Pl | |
Comitative | Com | A_Com | ||
Instructive | Ins | A_Ins_Pl | ||
Number | Singular | Sg | A_Nom_Sg | |
Plural | Pl | A_Nom_Pl |
Pronoun subclassification
All pronouns may be searched by a CQL query: [tag = “Pron|Pron_.*”]
Category |
Subcategory |
Tag | Example of a tag | |
Pronouns | Pronoun types | Personal pronoun | Pers | Pron_Pers_Nom_Sg |
Relative pronoun | Rel | Pron_Rel_Nom_Sg | ||
Reciprocal pronoun | Recip | |||
Reflexive pronoun | Refl | Pron_Refl_Nom_Sg | ||
Demonstrative pronoun | Dem | Pron_Dem_Nom_Sg | ||
Indefinite pronoun | Indef | Pron_Indef | ||
Quantifying pronoun | Qnt | Pron_Qnt_Nom_Sg | ||
Interrogative pronoun | Interr | Pron_Interr_Nom_Sg | ||
Case | Grammatical cases | Nominative | Nom | Pron_Dem_Nom_Sg |
Genitive | Gen | Pron_Dem_Gen_Sg | ||
Accusative | Acc | Pron_Pers_Acc_Sg | ||
Locative cases | Inessive | Ine | Pron_Dem_Ine_Sg | |
Elative | Ela | Pron_Qnt_Ela_Sg | ||
Illative | Ill | Pron_Rel_Ill_Sg | ||
Adessive | Ade | Pron_Dem_Ade_Sg | ||
Ablative | Abl | Pron_Dem_Abl_Sg | ||
Allative | All | Pron_Pers_All_Sg | ||
Essive | Ess | Pron_Dem_Ess_Sg | ||
Translative | Tra | Pron_Dem_Tra_Sg | ||
Underused cases | Abessive | Abe | Pron_Qnt_Abe_Pl | |
Comitative | Com | Pron_Qnt_Com | ||
Instructive | Ins | Pron_Qnt_Ins_Pl | ||
Number | Singular | Sg | Pron_Pers_Nom_Sg | |
Plural | Pl | Pron_Pers_Nom_Pl |
Numeral subclassification
All numerals may be searched by a CQL query: [tag = “Num|Num_.*”]
Category |
Subcategory |
Tag | Example of a Tag | |
Numeral | Real number | Real | ||
Ordinal | Ord | Num_Ord_Nom_Sg | ||
Cardinal | Card | |||
Case | Grammatical cases | Nominative | Nom | Num_Nom_Sg |
Genitive | Gen | Num_Gen_Sg | ||
Partitive | Par | Num_Par_Sg | ||
Accusative | Acc | |||
Locative cases | Inessive | Ine | Num_Ine_Sg | |
Pronouns | Elative | Ela | Num_Ela_Sg | |
Illative | Ill | Num_Ill_Sg | ||
Adessive | Ade | Num_Ade_Sg | ||
Ablative | Abl | Num_Abl_Sg | ||
Allative | All | Num_All_Sg | ||
Essive | Ess | Num_Ess_Sg | ||
Translative | Tra | Num_Tra_Sg | ||
Underused cases | Abessive | Abe | Num_Abe_Sg | |
Comitative | Com | Num_Com | ||
Instructive | Ins | Num_Ins_Pl | ||
Number | Singular | Sg | Num_Nom_Sg | |
Plural | Pl | Num_Nom_Pl |
Verbs
All verbs may be searched by a CQL query: [tag = “V|V_.*”]
Category | Subcategory | Tag | Example of a tag |
Example of a word |
Negative verb | Neg | V_Neg_Sg3 | ei, eihän | |
Infinitives | First Infinitive | Inf1 | V_Inf1_Lat | nähdä, lopettaa, käyttää |
Second Infinitive | Inf2 | V_Inf2_Act_Ine | ollessa, mennessä | |
Third Infinitive | Inf3 | V_Inf3_Ill | saamaan, poistamaan | |
Participles | Present participle | PrsPrc | ||
Past participle | PrfPrc | V_PrfPrc_Act_Sg1 | ajamaan, pelaamaan | |
Agent participle | AgPcp | |||
Negative participle | Neg><pcp< td=””> </pcp<> | |||
Mood | Indicative | Ind | ||
Conditional | Cond | V_Cond_Act_Pl3 | kytkivät, olisivat | |
Imperative | Imprt | V_Imprt_Act_Sg3 | olkoon, tulkoon | |
Potential | Pot | V_Pot_Act_Pl3 | lienevät, kelvannevat | |
Optative | Opt | |||
Tense | Present | Prs | V_Prs_Act_Sg1 | olen, pidän |
Imperfect | Prt | |||
Active/Passive | Active | Act | V_Prs_Act_Sg3 | on, tulee |
Passive | Pas | V_PrfPrc_Pass_Pe4 | tarvittiin, koettiin | |
Negated form | ConNeg | V_Prs_Act_ConNeg | ole, riitä | |
Person | First person singular | Sg1 | V_Prs_Act_Sg1 | olen |
Second person singular | Sg2 | V_Prs_Act_Sg2 | olet | |
Third person singular | Sg3 | V_Pot_Act_Sg3 | lienee | |
First person plural | Pl1 | V_Prs_Act_Pl1 | olemme | |
Second person plural | Pl2 | V_Prs_Act_Pl2 | olette | |
Third person plural | Pl3 | V_Prs_Act_Pl3 | ovat |
Other PoS
Part of speech | Category | Tag |
Adverbs | Adverb | Adv |
Manner (adverb) | Man | |
Particles | Particle | Pcle |
Interjection | Interjection | Interj |
Conjunctions | Coordinating conjunction | CC |
Subordinating conjunction | CS | |
Adpositions | Adposition | Adp |
Preposition | Pr | |
Postposition | Po |
Other
Possessive suffixes | Possessive suffix: first person singular | PxSg1 |
Possessive suffix: second person singular | PxSg2 | |
Possessive suffix: third person singular | PxSg3 | |
Possessive suffix: first person plural | PxPl1 | |
Possessive suffix: second person plural | PxPl2 | |
Possessive suffix: third person plural | PxPl3 | |
Other | Abbreviation | Abbr |
Acronym | Acro | |
Upper case | Cap | |
Sentence ending | Sent | |
Dash | Dash | |
Truncated compound | TrunCo | |
Foreign | Forgn | |
Punctuation | Punct | |
Quotation | Quote | |
Lative | Lat |
For more information see the list of all POS tags in the Finnish TreeTagger tagset and FinnTreeBank2Manual.
Reference
VOUTILAINEN Atro, PURTONEN Tanja, MUHONEN Kristiina. FinnTreeBank2: Manual. Helsinki: University of Helsinki, Department of Modern Languages, 2012.