CHILDES corpora are comparable corpora made up from transcripts of child language. Most of these transcripts record spontaneous conversational interactions. Often the speakers involved are young monolingual children conversing with their parents or siblings. Corpora also comprise transcripts of bilingual children, older school-aged children, adult second-language learners, children with various types of language disabilities and aphasics who are trying to recover from language loss.
These corpora belong to the child language project within the TalkBank system that was created for sharing and studying conversational interactions. Current CHILDES corpora in Sketch Engine include 24 languages.
CHILDES TalkBank web page is available at http://childes.talkbank.org/
Detailed information about each corpus within CHILDES collection can be found at http://childes.talkbank.org/access/
Availability
CHILDES corpora are accessible to users with a paid subscription, see our price list.
The overview of CHILDES corpora in Sketch Engine
The following list of CHILDES corpora contains link(s) to the particular corpus pages with detailed information. Each corpus page has the link Download transcripts to a zip archive containing a file called 0metadata.cdc where are stored all metadata of the particular corpus.
CHILDES Afrikaans
More information can be found at http://childes.talkbank.org/access/Dutch/ (section Afrikaans)
CHILDES Catalan
More information can be found at http://childes.talkbank.org/access/Biling/Serra.html
CHILDES Croatian
More information can be found at http://childes.talkbank.org/access/Slavic/Croatian/Kovacevic.html
CHILDES Danish
- http://childes.talkbank.org/access/German/
- Klammler: http://childes.talkbank.org/access/Biling/Klammler.html
- Koroschetz: http://childes.talkbank.org/access/Biling/Koroschetz.html
CHILDES English
- http://childes.talkbank.org/access/German/
- Klammler: http://childes.talkbank.org/access/Biling/Klammler.html
- Koroschetz: http://childes.talkbank.org/access/Biling/Koroschetz.html
CHILDES Estonian
More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Other/ (section Estonian)
CHILDES Farsi (Persian)
More information can be found at http://childes.talkbank.org/access/Other/Farsi/Family.html
CHILDES French
More information about particular collections can be found at:
- Rondall http://childes.talkbank.org/access/French/Rondal.html
- Vioncolas http://childes.talkbank.org/access/French/VionColas.html
CHILDES Gaelic (Irish)
More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Celtic/Irish/Guilfoyle.html
CHILDES German
More information about particular collections of the corpus can be found at
and
- Klammler collection: http://childes.talkbank.org/access/Biling/Klammler.html
- Koroschetz collection: https://web.archive.org/web/20200222070605/https://childes.talkbank.org/access/Biling/Koroschetz.html (source: wayback machine)
CHILDES Hebrew
More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Other/ (in the section Hebrew)
CHILDES Hungarian
More information about particular collections of the corpus can be found at
- http://childes.talkbank.org/access/Other/ (in the section Hungarian)
- http://childes.talkbank.org/access/narrative.html (MacBates collections)
CHILDES Italian
More information about particular collections can be found at:
- http://childes.talkbank.org/access/Romance/
- http://childes.talkbank.org/access/narrative.html
CHILDES Japanese
More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Japanese/
CHILDES Korean
More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/EastAsian/Korean/Jiwon.html
CHILDES Norwegian
More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Scandinavian/Norwegian/Simonsen.html
CHILDES Polish
More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Slavic/Polish/Szuman.html
CHILDES Portuguese
More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Romance/ (section Portuguese)
CHILDES Russian
More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Slavic/Russian/Protassova.html
CHILDES Spanish
More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Spanish/
CHILDES Swedish
More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/
CHILDES Tamil
More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Other/Tamil/Narasimhan.html
CHILDES Thai
More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/
CHILDES Turkish
More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Frogs/ (search Turkish)
Use Sketch Engine in minutes
Generate collocations, frequency lists, examples in contexts, n-grams or extract terms. Use our Quick Start Guide to learn it in minutes.