Entries by Jana Bušková

A new Russian corpus, Sketch Engine use case and DMLex.

The new Russian Corpus 2020 now available in Sketch Engine. The corpus is enriched with part-of-speech tagging and lemmatization. Perfect for #corpuslinguistics, #digitalhumanities, #linguistics, #lexicography, and #nlp. 18 June, 2025

Chinese localization and upcoming Lexicom

Sketch Engine now speaks Chinese!  We’re excited to announce that our interface is now available in Chinese. A huge thanks to Liang-Ting Juan from Palacky University in Olomouc for her excellent translation work. https://app.sketchengine.eu #corpuslinguistics #corpusling 7 May, 2025

New Lexonomy with a new guide!

We’ve released a new version of Lexonomy, a free online tool for editing and publishing dictionaries! Any questions or need help? Contact us at support@sketchengine. #dictionary #glossary #lexicography 3 March, 2025

Expand your linguistic research with new corpora!

Let us introduce our newest addition to the corpora list: German Web 2023. This corpus includes 16.6 billion words collected from the web, providing a comprehensive representation of the contemporary German language. https://sketchengine.eu/detenten-german-corpus/… #corpuslinguistics #textanalysis 5 February, 2025

New Year, new data: Maldivian corpus, NLP opportunities, Lexicom

Develop your #lexicography skills at Lexicom 2025 in Bari! A 5-day hands-on course on #dictionary production supported by text corpora production combining text corpora with #computationallinguistics. Apply now! https://lexicom.courses/lexicom-2025-bari-italy-lexicography-workshop/ 17 January, 2025

Happy New Year with a bunch of new corpora!

We’re also happy that the MDPI corpus of peer reviews from the @MDPIOpenAccess database is available to everyone through Sketch Engine https://app.sketchengine.eu/#dashboard?corpname=preloaded%2Fmdpi_review #corpuslinguistics #textcorpusterms.sketchengine.eu 9 December, 2024

New corpora, tips and improvement.

Corpus info now includes an overview panel for parallel corpora, showing aligned languages for the selected multilingual corpus, e.g. OpenSubtitles 2018. This feature allows you to quickly see and access other languages aligned with the one you’re viewing. https://app.sketchengine.eu/#dashboard?corpname=preloaded%2FOS_en&corp_info=1 #corpuslinguistic #textanalysis 29 November, 2024

Lexicom 2025, new corpora and features!

Save the dates and register for Lexicom, a 5-day intensive workshop in #lexicography, #corpuslinguistics, and #dictionary building. In Southern Italy in late summer 2025. https://lexicom.courses/lexicom-2025-bari-italy-lexicography-workshop/ 21 October, 2024

Open applications for AK Prize, new corpora and better term extraction

We invite applications for the Adam Kilgarriff Prize 🏆 for outstanding works in #corpuslinguistics, #computationallinguistics, and #lexicography. Apply by 30th September 2024. The Prize will be awarded at the eLex Conference 2025. https://kilgarriff.co.uk/prize/ 16 August, 2024

Enhance your text analysis skills with new Corpora and Tools!

Subcorpus is a smaller part of a corpus that enables you to focus on a specific set of data, e.g. a topic, year etc. Recently, we have made it easier to build multiple subcorpora in user corpora. Find the details at https://www.sketchengine.eu/documentation/create-subcorpora-to-share-with-other-users/ #CorpusAnalysis #corpusling 26 July, 2024

Discover the new Timeline and other features.

Track how #wordusage and frequency change over time with Sketch Engine’s Timeline Function!  Discover trends, uncover new words, and delve into detailed changes in any word or phrase using our Concordance and Wordlist. https://sketchengine.eu/timeline-language-use-in-time/… #corpuslinguistics #textanalysis 10 June, 2024

Term extraction from non-aligned docs, Lexicom 2024, and the largest corpus!

You know we run a powerful #terminology extraction tool – OneClick Terms. However, you may not know this service supports extracting bilingual terms from non-aligned docs. Try uploading documents where one is the translation of the other! #termextraction https://terms.sketchengine.eu 24 April, 2024