Subcorpus is a smaller part of a corpus that enables you to focus on a specific set of data, e.g. a topic, year etc. Recently, we have made it easier to build multiple subcorpora in user corpora. Find the details at https://www.sketchengine.eu/documentation/create-subcorpora-to-share-with-other-users/
The new Indonesian Corpus 2024 now available in Sketch Engine. The corpus is enriched with part-of-speech tagging and lemmatization. Perfect for #corpuslinguistics, #digitalhumanities, #linguistics, #lexicography, and #NLP.
🌐https://t.co/esZSsTzHwj pic.twitter.com/pkP2EX2ri3— Sketch Engine (@SketchEngine) July 31, 2024
Have you tried the word sense induction feature in Sketch Engine? It is available in the Word Sketch tool and it automatically groups collocates according to various senses they belong to. Read more at https://t.co/gpiujQBXnG#corpuslinguistics #textanalysis pic.twitter.com/xzR6KYTs4R
— Sketch Engine (@SketchEngine) July 25, 2024
Exciting news for Hungarian language researchers! The new Hungarian Corpus 2023 is here, featuring 3.4 billion words, genre and topic annotations, or terminology extraction. Enjoy a free trial today! https://t.co/t4uarXccVY #corpuslinguistics #digitalhumanities pic.twitter.com/n0FRYidn8L
— Sketch Engine (@SketchEngine) July 3, 2024