The ACLs logo The ACL NLP/CL Universe

ABOUT | SEARCH | SUBSCRIBE | SUBMIT | FEEDBACK
Directory listing for: RESOURCES/CORPORA:
o 1963 Time Magazine corpus
o 2000 NIST Speaker Recognition Evaluation Corpus
o A Syntactically Annotated Corpus of German Newspaper Texts
o A Web Corpus and Topic Signatures for All WordNet 1.6 Nominal Senses (v 1.0)
o AOT
o Alpino Treebank
o An Empirical Grammar of the English Verb System
o Annotated list of resources on statistical NLP and corpus-based CL
o Arabic Newswire Part 1
o Arabic first names (female)
o Arabic first names (male)
o BNC Online Service
o BRITISH NATIONAL CORPUS - WORLD EDITION
o Base Textuelle de Moyen Francais
o Bokr Russian Reference Corpus
o Browse the Reuters-21578 collection
o CETEMPUBLICO
o CIRCLE Tutorial Archive
o COMPUTER RESEARCH LABORATORY ANONYMOUS FTP
o CORPUS DEL ESPANOL
o CREA
o CREA
o Collections of texts and corpora
o Corpora at ELSNET
o Corpus Resources (Chulalongkorn University, Thailand)
o Corpus de referencia de la lengua Espanola contemporanea: corpus oral peninsular
o Corpus de referencia de la lengua Espanola contemporanea: corpus oral peninsular
o Corpus del Espanol
o Corpus del Espanol
o Corpus of spoken Bulgarian
o Cranfield collection
o Czech National Corpus
o Danish news corpus
o ELRA Corpus Catalogue
o Edinburgh Associative Thesaurus (EAT)
O English [DIR: 46 entries] ...
o EuroWordNet
o Experimental Corpus Query System (University of Stuttgart, Germany)
o Finnish text bank
O French [DIR: 0 entries] ...
O French [DIR: 0 entries] ...
o GENIA corpus version 3.0p
O German [DIR: 6 entries] ...
o German Corpora, Online Search
o HAITIAN CREOLE ELECTRONIC TEXTS
o HCRC Map Task Corpus XML annotations
o HPSG-based Syntactic Treebank of Bulgarian
o Hansards Corpus - Searchable
o Hebrew Corpora
o Helsinki Corpus of Swahili (HCS)
o ICOPOST
o IMS Corpus Toolbox, Univ. of Stuttgart
o IMS Corpus Workbench (CWB)
o IPI PAN Polish Corpus
o Information Retrieval Laboratory (University of Harbin) Chinese Corpus Resources
o International Corpus of Learner English
o Kiel University's Institute on Phonetics and Speech Procesing
o LANGUAGE LEARNING CENTER - ACADEMIC CORPUS
o Laboratorio de Engenharia da Linguagem - Poruguese corpora
o Laboratorio de Engenharia da Linguagem - Poruguese corpora
o Lacio Web Corpora
o Le corpus BAF (French and English)
o Linguistic Data Consortium (LDC) FTP site
o Links to French corpora
o List of Language Lists (Version 1.C)
o List of stop words
o Lists of Corpora
o MICASE Michigan Corpus of Academic Spoken English
o Manuel Barbera: General Corpora and Corpus Linguistics Resources
o Medlars collection
o Michigan Corpus of Academic Spoken English
o Miscellaneous Word Lists from Oxford University
O Miscellaneous corpora-related URL [DIR: 0 entries] ...
o Morphologically Analyzed and Disambiguated Turkish News Text
O Multilingual [DIR: 39 entries] ...
o Multilingual Text Tools and Corpora
o Name lists from US census
o Nexing Corpus
o OPUS -- An Open Source Parallel Corpus
o On-line books at CMU
o Oxford Text Archive
o Oxford Text Archive Corpus of Italian Newspapers
o Parallel Corpora
o Parallel Corpora (United Nations) from the LDC
o Parallel Corpora from the World Health Organization
o Parallel Texts of Hong Kong Laws
o Penn-Helsinki Parsed Corpus of Middle English
o Polish subcorpus of the International Corpus of Learner English
o Project Gutenberg
o Prototype Corpus of Contemporary Arabic (CCA)
o Ramon Piero Center for Research
o Reuters Corpus
o Romanian NLP
O Russian [DIR: 0 entries] ...
o Russian Corpora
o Russian Corpora
o Russian Corpus Page
o Russian Corpus Site
o Russian Corpus Site
o Russian Newspaper Corpus
o Russian Newspaper Corpus
o Russicon Resources
o Sanskrit Library
o ShATR - a multi-simultaneous-speaker corpus
o Slovene-English Parallel Corpus
O Spanish [DIR: 0 entries] ...
o Speech in Noisy Environments 1 (SPINE1 CODED) Coded Audio
o Speech in Noisy Environments 2 (SPINE2 CODED) Coded Audio
o Stop List
o Survey of Electronic Corpora (by Jane A. Edwards, file at CMU)
o Survey of English Usage, University College, London
O Swedish [DIR: 3 entries] ...
o Switchboard Transcription Project
o TELRI Research Archive of Computational Tools and Resources
o TRAINS93 Dialog transcripts
o Terminology for more than 15 languages
o The British National Corpus
o The British National Corpus Survey: An Edited Letter from Lou Burnard
o The CORPORA DataCenter (Norway)
o The Childes Corpus - Children's language
o The International Corpus of English
o The Moby Corpus
o The Moby corpus
o The Oslo Corpus of Bosnian Texts
o The Probert Encyclopedia
o The Reading Academic Text Corpus
o The Sketch Engine
o The Sofie Treebank - A Parallel Treebank of North European Languages
o The bank of English
o Top 10 words used on Usenet
o Towards a Corpus of Corrected Student Translations
o Treebank tokenization scheme
o Voice of America (VOA) Czech Broadcast News Audio
o Voice of America (VOA) Czech Broadcast News Transcript Corpus
o Word frequency lists
o a corpus of student-advisor advising sessions (by Michael Elhadad)
o list of Japanese transitive - intransitive verb pairs

UP


Total number of entries in system: 4260 , Last updated: Mon Aug 14 14:03:55 EDT 2006