Show simple item record

Kuçera-Francis wordlist : [a] frequency count of the Brown corpus of present day American English

 
dc.contributor Coltheart, Max School of Behavioural Science Macquarie University Sydney
dc.contributor.editor Coltheart, Max
dc.contributor.editor Kucera, Henry
dc.coverage.placeName s.l.
dc.date.accessioned 2018-07-27
dc.date.accessioned 2019-07-04T11:04:42Z
dc.date.available 2019-07-04T11:04:42Z
dc.date.created 1961
dc.identifier ota:0668
dc.identifier.citation http://purl.ox.ac.uk/ota/0668
dc.identifier.uri http://hdl.handle.net/20.500.12024/0668
dc.description.abstract In English “The corpus consists of approximately 1,014,000 graphic words of running text, all of which was first printed in the United States in the year 1961” Frequency analysis of English usage Publication based on OTA text: Computational analysis of present-day American English / by Henry Kuçera and W. Nelson Francis. -- Providence [RI] : Brown University, 1967. -- pp. xvii-xxv Publication based on OTA text: Frequency analysis of English usage : lexicon and grammar / by Henry Kuçera and W. Nelson Francis. -- Boston : Houghton Mifflin, 1982. -- pp. 3-15. -- “Available with prior consent of depositor for research purposes only”. -- United States Office of Education. Cooperative Research Project No. E-007. -- OTA 0402
dc.format.extent Text data (3 files : ca. 1095, 5, 8 KB)
dc.format.medium Digital bitstream
dc.language English
dc.language.iso eng
dc.publisher University of Oxford
dc.relation.ispartof Legacy Collection Digital Museum
dc.rights Distributed by the University of Oxford under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/
dc.rights.label PUB
dc.subject.lcsh Computational linguistics -- Australia
dc.subject.lcsh Anthologies -- United States
dc.subject.other Anthologies
dc.title Kuçera-Francis wordlist : [a] frequency count of the Brown corpus of present day American English
dc.type Text
has.files yes
branding Oxford Text Archive
files.size 1135278
files.count 3
otaterms.date.range 1900-1999

This item is
Publicly Available
and licensed under:
Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)

 Files for this item

 Download all local files for this item (1.08 MB)

Icon
Name
header0668.xml
Size
6.93 KB
Format
XML
Description
METADATA
 Download file
Icon
Name
kuceradat-0668.txt
Size
1.07 MB
Format
Text file
Description
Version of the work in plain text format
 Download file  Preview
 File Preview  
1 01 001 .0044**K 1 01 001 .01 1 01 001 .020 2 01 001 .027 1 01 001 .028 1 01 001 .05 1 01 001 .05**K 3 01 001 .07 1 01 001 .076 1 01 001 .09 1 01 001 .1 1 01 001 .130 1 01 001 .143 1 01 001 .179 12 02 002 .22 3 03 003 .22-CALIBER 1 01 001 .222'S 1 01 001 .243 1 01 001 .255 2 01 001 .264 ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ -^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^^ ^ ^ ^ ^ ^ ^ ^ ^ ^^ ^^ ^ ^ ^ ^^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ` ^ ^ ^ '- ^ ^ ^ ^ ^ 4 01 002 .45 1 01 001 .45-CALIBER 1 01 001 .455 2 01 001 .458 2 02 002 .5 1 01 001 .50 1 01 001 .500 1 01 001 .7 1 01 001 .75 1 01 001 .7854 1 01 001 (*=A,B*$) 139 12 039 + 1 01 001 +.04 1 01 001 +.50 1 01 001 +.7 1 01 001 +C 1 01 001 $0.9 1 . . .
Icon
Name
kuceradoc-0668.txt
Size
7.49 KB
Format
Text file
Description
Version of the work in plain text format
 Download file  Preview
 File Preview  
KUCERA (Kucera-Francis Word-frequency Count) Notes provided by Roger Mitton, Dept of Computer Science, Birkbeck College, Malet Street, London WC1E 7HX November 1984 KUCERA contains over 50,000 entries from the Kucera-Francis Frequency Count of items in the corpus of text collected at Brown University (commonly referred to as the Brown Corpus). Details of the corpus are given in 'Computational Analysis of Present-day American English' by Henry Kucera and W. Nelson Francis, Brown University Press, 1967, and also in 'Frequency Analysis of English Usage: Lexicon and Grammar' by the same authors, published by Houghton Mifflin, 1982. The following is from the latter book: 'The corpus consists of approximately 1,014,000 graphic words of running text, all of which was first printed in the United States in the year 1961. The text is divided into five hundred samples of about two thousand words each, which are assigned . . .

Show simple item record