Download a corpus
How to download a corpus User corpora, i.e. the corpora which the user builds, can be downloaded. Preloaded corpora cannot…
If you are not happy with the results below please do another search
How to download a corpus User corpora, i.e. the corpora which the user builds, can be downloaded. Preloaded corpora cannot…
frWaC: French corpus from the .fr domain The frWaC corpus is a French text corpus collected from the .fr domain…
…domain. Domain specific corpora built using WebBootCat and Dante lexical database. List of corpora: CAJA (academic journal articles) COMPAS (newspaper…
…Agent-Patient Relation from Corpus With Word Sketches. Proceedings of the 4th Conference on Language, Data and Knowledge: 666-675, 2023. [Download…
…Noun common singular genitive Nc-sg 39871 jala S.com.sg.part Substantiiv apellatiiv singular partitiiv Noun common singular partitive Nc-s1 17027 jalga S.com.sg.ill…
…% ( len(d[‘Gramrels’]), data[‘lemma’], data[‘lpos’], data[‘corpname’])) Sketch Engine uses HTTP REST API. All API methods (unless stated otherwise) expect GET…
…items will be downloaded or displayed (technical limitation of 10 million items will be applied) ● view and download 1,000…
…(source data and vertical files) /corpora/manatee/mycorpus/ (compiled data) each of these directories should contain only the corresponding files, nothing else…
…below the text. Either option will be processed correctly. Download example data on the right. Metadata format There is no…
…S:singular; P:plural; N:invariable noun Position Atribute Values category N:noun 1 type C:common; P:proper 2 gen F:feminine; M:masculine; C:common 3 num…
…can add your one quite easily add new ones (see DYNLIB). Dynamic attributes are created when compiling the corpus using…
…a concrete corpus and the pricing options. Result download There are limits to the download of data generated by the…
…SUBcom @COM comparative subordinator subordinador comparativo komparator Esta fofoqueira fala como uma cachoeira. SUBprd @PRD predicative subordinator (role complementizer) subordinador…
…and from November 2023 to January 2024. The sample texts of the largest web domains which account for 51% of…
…(December 2023) MaCoCu Serbian Web v1 (2021-2022) (December 2023) MaCoCu Bosnian Web v1 (2021-2022) (November 2023) – this corpus has…