A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.
Hebrew YAP part-of-speech tagset
Hebrew corpora annotated by the Yet Another (natural language) Parser (abbreviated as YAP) tool are tagged with this Hebrew part-of-speech scheme. YAP is part of the ONLP lab tool kit.
The following table shows Hebrew the part-of-speech tagset scheme used
An Example of a tag in the CQL concordance search box: [tag="NNP"]
finds all proper nouns (note: please make sure that you use straight double quotation marks)
POS Tag | Description | Example |
ADVERB | The word כ before numerals | |
AT | Accusative marker את which appears as a separate word in written Hebrew | |
BN | BN – participle (בינוני) | |
BNN | Participle (בינוני) in construct state form | |
CC | Conjunction | |
CD | Numeral | |
CDT | Numeral in the construct state | |
CONJ | Coordinating conjunction ו | |
COP | Copula | |
COP_TOINFINITIVE | The infinitive form of the verb היה used as a copula | |
DEF | A special tag assigned to the definiteness marker ה, which appears with nouns, adjectives and numerals | |
DT | Used in the treebank only for the determiner כל with a pronominal suffix | |
DTT | Determiner | |
DUMMY_AT | Accusative marker אתwhen used with a pronominal suffix | |
EX | existential marker יש or אין | |
IN | Preposition | |
INTJ | Interjection | |
JJ | Adjective | |
JJT | Adjective in the construct state | |
MD | Modal predicates | |
NN | Noun | |
NN_S_PP | Noun with a pronominal suffix | |
NNP | Proper noun | |
NNT | Construct state noun | |
P | Prefix written as a separate word (אי, בלתי,אנטי etc.) | |
POS | Possessive preposition של and accusative marker את with a pronominal suffix | |
PREPOSITION | Inseparable preposition | |
PRP | Personal pronoun | |
QW | Question word | |
S_PRN | Personal pronoun attached to a preposition as a pronominal suffix | |
TEMP | Subordinating conjunction introducing time clauses | |
VB | Verb | |
VB_TOINFINITIVE | A verb in its infinitive form | |
yy.* | various symbols | |
yyCLN | Colon | : |
yyCM | Comma | , |
yyDASH | Hyphen or dash | – or – |
yyDOT | Period | . |
yyELPS | Ellipsis (…) | … |
yyEXCL | Exclamation point | ! |
yyLRB | Left parenthesis | ( |
yyQM | Question mark | ? |
yyQUOT | Quotation mark | „ |
yyRRB | Right parenthesis | ) |
yySCLN | Semicolon | ; |
Reference
https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.22.8581&rep=rep1&type=pdf (Hebrew tagset can be found on page 7 and following)
or