-
Corpus
Oxford Text Archive Core Collection
Date of publication:
2100 BCE-1700 BCE
Author(s):
Unknown author
Description:
This edition of the ETCSL is an expansion, revision
and enhancement of the first-time deposit of the corpus
( - currently not available). The Electronic Text Corpus of Sumerian Literature
...
This item contains 11 files (4.91
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
2020
Description:
The CorCenCC corpus contains over 11 million words (circa 14.4m tokens) from written, spoken and electronic (online, digital texts) Welsh language sources, taken from a range of genres, language varieties (regional and ...
This item contains 1 file (49.41
KB).
Publicly Available
-
-
CollectionSound
Oxford Text Archive Core Collection
Date of publication:
2015
Description:
The resource is a speech corpus, with digital audio files, text transcripts, and files containing time stamps of the phoneme boundaries.
1813 .wav files containing spoken utterances.
...
This item contains 4 files (1.98
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
2004
Description:
Mode of access: Online. OTA website The rudimentary form of the Sheffield Corpus of Chinese contains a limited body of representative texts from Medieval (MedC) and Modern Chinese (ModC) periods. They are of two text types: ...
This item contains 2 files (145.39
KB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
2004
Description:
The BAWE corpus contains 2761 pieces of proficient assessed student
writing, ranging in length from about 500 words to about 5000 words. Holdings are fairly
evenly distributed across four broad disciplinary ...
This item contains 2 files (107.9
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
2004
Author(s):
Unknown author
Description:
The Lancaster Corpus of Mandarin Chinese (LCMC) is designed as a Chinese match for the FLOB and FROWN corpora for modern British and American English. The corpus is suitable for use in both monolingual research into modern ...
This item contains 2 files (6.34
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
2003
Author(s):
Unknown author
Description:
The collection consists of: Thirty million words of monolingual written data (Gujarati, Tamil, Hindi, Punjabi-news website articles); 600,000 words of monolingual spoken data (Hindi, Urdu, Punjabi, Bengali, Gujarati-radio ...
This item contains 10 files (108.26
MB).
Publicly Available
-
-
CollectionSoundCollectionText
Oxford Text Archive Core Collection
Date of publication:
2001
Description:
The four major objectives of the project were: i) to establish an electronic corpus of (a) conversations, from the British National Corpus (BNC) and (b) oral narratives, from Lancaster's Centre for North Western Regional ...
This item contains 2 files (2.03
MB).
-
-
Text
Oxford Text Archive Core Collection
Date of publication:
2000 BCE-1600 BCE
Author(s):
Unknown author
Description:
The source for this file is a relational database created by the ETCSL team, containing detailed records of all compositions in the Sumerian literary corpus Bibliographic and museum catalogue information recorded in SGML ...
This item contains 2 files (5.08
MB).
Academic Use
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
1994-11-01
Author(s):
Unknown author
Description:
Subset of the Brown corpus of American English : [1961]
This item contains 3 files (1.46
MB).
Academic Use
-