A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.
Persian part-of-speech tagset is available in Persian corpora annotated by the POS tagger with the tagset based on the Persian Syntactic Dependency Treebank.
An Example of a tag in the CQL concordance search box: [tag=".*ANM"] finds all nouns, e.g. کسی, خبر (note: please make sure that you use straight double quotation marks)
Tagset
| Description | Tag |
| adjective | |
| comparitive | AJCM |
| positive | AJP |
| superlative | AJSUP |
| address term | |
| pre-noun | PRADR |
| post-noun | POSADR |
| adverb | SADV |
| conjunction | CONJ |
| title | IDEN |
| noun | |
| animate | ANM |
| inanimate | IANM |
| particle | PART |
| post-noun modifier | POSNUM |
| postposition | POSTP |
| pronoun | |
| separate personal | SEPER |
| enclitic personal | JOPER |
| demonstrative | DEMON |
| interogative | INTG |
| common reflexive | CREFX |
| noncommon reflexive | UCREFX |
| reciprocal | RECPR |
| pre-modifier | PREM |
| exclamatory | EXAJ |
| interrogative | QUAJ |
| demonstrative | DEMAJ |
| ambiguous | AMBAJ |
| pre-noun numeral | PRENUM |
| preposition | PREP |
| pseudo-sentence | PSUS |
| punctuation | PUNC |
| verb | |
| active | ACT |
| passive | PASS |
| modal | MODL |
| subordinating clause | SUBR |
Source: http://www.cs.columbia.edu/~rasooli/papers/Rasooli-et-al.,NAACL-HLT2013.pdf




