idTenTen – Indonesian corpus from the web
idTenTen: Corpus of the Indonesian Web The Indonesian Web Corpus (idTenTen) is an Indonesian corpus made up of texts collected…
If you are not happy with the results below please do another search
idTenTen: Corpus of the Indonesian Web The Indonesian Web Corpus (idTenTen) is an Indonesian corpus made up of texts collected…
…Tagset Indonesian tagset is available in Indonesian corpora annotated by the tool TreeTagger (with the Indonesian parameter file) developed by…
idWaC: Indonesian web corpus The Indonesian web corpus (idWaC) is an Indonesian corpus made up of texts collected from the…
…the CQL concordance search box: [tag=”” & morph=””] searches for cardinal numerals Indonesian and Malaysian_Previous morphology – Apertium Source http://wiki.apertium.org/wiki/Indonesian_and_Malaysian/Previous_morphology…
…in words Indonesian Web (IndonesianWaC) trial 90,120,046 Indonesian Web 2020 (idTenTen20) trial 3,687,192,045 OpenSubtitles 2018 parallel – Indonesian main 77,273,767…
…vietnamese, turkish, chinese-traditional, hindi, telugu, czech, finnish, croatian, italian, swedish, danish, indonesian, chinese-simplified, malayalam, bengali, spanish, estonian, german, arabic, hebrew,…
…Icelandic Web 2020 (isTenTen20) 518,620,759 Igbo Igbo Web 2015 (IgboWaC15) 331,042 Indonesian Indonesian Web (IndonesianWaC) 90,120,046 Irish Irish Web 2022…
…parallel – Icelandic Icelandic main 9,194,074 OpenSubtitles 2018 parallel – Indonesian Indonesian main 77,273,767 OpenSubtitles 2018 parallel – Italian Italian…
…Galician, Georgian, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Malayalam, Norwegian, Persian…
…German tagsets Greek tagsets Hebrew tagset Hindi tagset Hungarian tagsets Indonesian tagset Irish tagset Italian tagset Japanese tagsets Korean tagset…
…deWaC (sdeWaC)), Greek (gkWaC), Gujarati (guWaC) H Hausa (haWaC ), Hebrew (hebWaC), Hindi (hindiWaC) I Igbo (igWaC), Indonesian (idWaC), Italian…
…I Indonesian, Igbo, Sichuan Yi, Iloko, Ingush, Icelandic, Italian J Japanese, Machame, Javanese K Kartuli (Georgian), Kabyle, Kaje (Jju), Kamba,…
…huTenTen (Hungarian web corpus) idTenTen (Indonesian web corpus) isTenTen (Icelandic web corpus) itTenTen (Italian web corpus) jaTenTen (Japanese web corpus)…
…English tagsets Estonian tagsets Finnish tagsets French tagsets German tagsets Greek tagsets Hebrew tagsets Hindi tagset Hungarian tagsets Indonesian tagset…
…ⓧ Indonesian ✓ ✓ full ⓧ Irish ✓ ✓ full ⓧ Italian ✓ ✓ full ✓ Japanese ✓ ✓ full…