A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.
Stanford Arabic parser tagset is available in Arabic corpora processed by the Stanford Arabic Parser. This tool is developed by The Stanford Natural Language Processing Group at Stanford University.
An Example of a tag in the CQL concordance search box: [tag="VBD"]
finds all verb past tenses, e.g. كان
Tagset summary
Basic notation
noun | (DT)?NN.* |
verb | VB.* |
adjective | (DT)?JJ.* |
adverb | W?RB |
conjunction | CC |
preposition | IN |
pronoun | PRP.? |
cardinal number | CD |
Complete notation
Source: http://nlp.stanford.edu/software/parser-arabic-faq.shtml#d
Arabic text corpora in Sketch Engine
Sketch Engine offers dozens of Arabic language corpora.
or