A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.
Turkish part-of-speech tagset is available in Turkish corpora annotated by the MaltParser with the pre-trained Turkish model created by Asst.Prof.Dr. Gülşen Cebiroğlu Eryiğit.
An Example of a tag in the CQL concordance search box: [tag="Noun"]
finds all nouns, e.g. zaman, Türk (note: please make sure that you use straight double quotation marks)
Tagset
tag | description | example |
Noun | noun | zaman |
Punc | punctuation | , . |
Adj | adjective | olan, büyük |
Verb | verb | ise, vardır |
Conj | conjunction | ve, da |
Adv | adverb | olarak, en |
Num | number | 1 |
Postp | postposition | için, ile |
Det | determiner | bu, her |
Pron | pronoun | kendi, onun |
Ques | question | mi, mu |
Interj | interjection | ey, be |
Dup | dup | pırıl, tefek |
Source: http://web.itu.edu.tr/gulsenc/TurkishDepModel.html