Concordance – a tool to search a corpus
…types (metadata) or corpus structures. The CQL search on the advanced tab is used for complex searches with unspecific criteria…
If you are not happy with the results below please do another search
…types (metadata) or corpus structures. The CQL search on the advanced tab is used for complex searches with unspecific criteria…
…own data to build parallel corpora. change search criteria download results view options – display tags, lemmas, structures random sample…
…to complete appear here. You can continue using Sketch Engine and come back here to see the result when you…
…using a new POS tagger (77.63% accuracy), lemmatizer and morph analyser downloaded from http://sivareddy.in/downloads Sketch Engine general reference Adam Kilgarriff,…
…Chinese journalism. The corpus contains data from archives of News Agencies and was prepared by Linguistic Data Consortium (LDC) with…
Patakis is a 100 million word collection of POS-tagged texts mostly downloaded from the Internet, prepared by Milos Husak of…
…concordance – examples in the context of medicine environment text type analysis – statistics of metadata in the corpus Domain…
Environment Corpus – domain-specific corpus The English Environment Corpus is a domain-specific corpus made up of texts in English related…
ukWaC – British Web corpus from the .uk domain The British Web (ukWaC) is an English corpus collected from the…
…{ ARG1 “-” ARG2 “1” DYNAMIC “getnbysep” DYNLIB “internal” DYNTYPE “index” FROMATTR “lempos2” FUNTYPE “ci” } ATTRIBUTE lc { DYNAMIC…
…Thanks to these parameters, users can narrow their search: Domain: the EEC encompasses all the domains and subdomains of environmental…
…and talk pages from Czech Wikipedia (downloaded in April 2017) and texts from the domain .cz of Czech Timestamped web…
…than 40 languages. Data for the Ukrainian Web 2020 corpus consists of texts from May 2014, July–August 2020, and October–December…
…million words. The texts were downloaded between March and April 2021. The sample texts of the biggest web domains which…
…text type analysis – statistics of metadata in the corpus Gujarati Web 2021 (guTenTen21) version gutenten21_gnt (July 2023) 88 million…