A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.
Korean HanNanum part-of-speech tagset simplified is available in Korean corpora annotated by the Hannanum tool which is a morphological analyzer and a POS tagger
HanNanum – a Korean Morphological Analyzer and POS Tagger – is used for tagging Korean texts. The 22 tags present configuration of HanNanum demo java class is used to assign tags to words. Some own pre/post processing scripts were used to correct/improve lemmatization, add sentence markup, add English tag names.
An Example of a tag in the CQL concordance search box: [tag="V"]
searches all verbs, e.g. 하는 (note: please make sure that you use straight double quotation marks)
Tagset
ORIGINAL | SIMPLIFIED (in Sketch Engine) | DESCRIPTION | Example |
---|---|---|---|
NOMINALS | |||
NC | N | noun | |
NQ | NP | proper noun | |
NB | N | noun | |
NP | Pron | pronoun | |
NN | Num | numeral | |
PREDICATES | |||
PV | V | verb | |
PA | Adj | adjective | |
PX | VAux | auxiliary verb | |
MODIFIERS | |||
MM | Det | determiner | |
MA | Adv | adverb | |
INTERJECTION | |||
II | Interj | interjection | |
RELATIONAL SUFFIX | |||
JC | Suff | ||
JX | Suff | ||
JP | Suff | ||
ENDING | |||
EP | Suff | ||
EC | Suff | ||
ET | Suff | ||
EF | Suff | ||
AFFIXES | |||
XP | Pref | 접두사 | |
XS | Suff | ||
SYMBOLS | |||
SP | Sym | 쉼표 | |
SL | Sym | 여는 따옴표 및 묶음표 | |
SD | Sym | 이음표 | |
SU | Sym | 단위 기호 | |
SF | Sym | 마침표 | |
SR | Sym | 닫는 따옴표 및 묶음표 | |
SE | Sym | 줄임표 | |
SY | Sym | 기타 기호 | |
FOREIGN WORDS | |||
F | X |
Source: https://www.sketchengine.eu/wp-content/uploads/Original-HanNanum-manual.pdf
or
Use Sketch Engine in minutes
Generating collocations, frequency lists, examples in contexts, n-grams or extracting terms is easy with Sketch Engine. Use our Quick Start Guide to learn it in minutes.