HelexKids: a word frequency database for Greek and Cypriot primary school children

Aris R. Terzopoulos (Lead / Corresponding author), Lynne G. Duncan, Mark A. J. Wilson, Georgia Z. Niolaki, Jackie Masterton

Research output: Contribution to journalArticle

208 Downloads (Pure)

Abstract

In this article, we introduce HelexKids, an online written-word database for Greek-speaking children in primary education (Grades 1 to 6). The database is organized on a grade-by-grade basis,and on a cumulative basis by combining Grade 1 with Grades 2 to 6. It provides values for Zipf, frequency per million, dispersion, estimated word frequency per million, standard word frequency, contextual diversity, orthographic Levenshtein distance, and lemma frequency. These values are derived from 116 textbooks used in primary education in Greece and Cyprus, producing a total of 68,692 different word types. HelexKids was developed to assist researchers in studying language development educators in selecting age-appropriate items for teaching, as well as writers and authors of educational books for Greek/Cypriot children.
Original languageEnglish
Pages (from-to)83-96
Number of pages14
JournalBehavior Research Methods
Volume49
Issue number1
Early online date28 Jan 2016
DOIs
Publication statusPublished - Feb 2017

Fingerprint

Cyprus
Databases
Education
Language Development
Textbooks
Greece
Teaching
Research Personnel
Data Base
Word Frequency
Primary Education
Primary School
School children
Writer
Educators
Contextual
Orthographic
Lemma
Levenshtein Distance

Keywords

  • Word database
  • Greek language
  • Children
  • Frequency
  • Contextual diversity

Cite this

Terzopoulos, Aris R. ; Duncan, Lynne G. ; Wilson, Mark A. J. ; Niolaki, Georgia Z. ; Masterton, Jackie. / HelexKids : a word frequency database for Greek and Cypriot primary school children. In: Behavior Research Methods. 2017 ; Vol. 49, No. 1. pp. 83-96.
@article{5ac1d3e890e94c4585826c1f1e5e5ba5,
title = "HelexKids: a word frequency database for Greek and Cypriot primary school children",
abstract = "In this article, we introduce HelexKids, an online written-word database for Greek-speaking children in primary education (Grades 1 to 6). The database is organized on a grade-by-grade basis,and on a cumulative basis by combining Grade 1 with Grades 2 to 6. It provides values for Zipf, frequency per million, dispersion, estimated word frequency per million, standard word frequency, contextual diversity, orthographic Levenshtein distance, and lemma frequency. These values are derived from 116 textbooks used in primary education in Greece and Cyprus, producing a total of 68,692 different word types. HelexKids was developed to assist researchers in studying language development educators in selecting age-appropriate items for teaching, as well as writers and authors of educational books for Greek/Cypriot children.",
keywords = "Word database, Greek language, Children, Frequency, Contextual diversity",
author = "Terzopoulos, {Aris R.} and Duncan, {Lynne G.} and Wilson, {Mark A. J.} and Niolaki, {Georgia Z.} and Jackie Masterton",
year = "2017",
month = "2",
doi = "10.3758/s13428-015-0698-5",
language = "English",
volume = "49",
pages = "83--96",
journal = "Behavior Research Methods",
issn = "1554-351X",
publisher = "Springer Verlag",
number = "1",

}

HelexKids : a word frequency database for Greek and Cypriot primary school children. / Terzopoulos, Aris R. (Lead / Corresponding author); Duncan, Lynne G.; Wilson, Mark A. J. ; Niolaki, Georgia Z.; Masterton, Jackie.

In: Behavior Research Methods, Vol. 49, No. 1, 02.2017, p. 83-96.

Research output: Contribution to journalArticle

TY - JOUR

T1 - HelexKids

T2 - a word frequency database for Greek and Cypriot primary school children

AU - Terzopoulos, Aris R.

AU - Duncan, Lynne G.

AU - Wilson, Mark A. J.

AU - Niolaki, Georgia Z.

AU - Masterton, Jackie

PY - 2017/2

Y1 - 2017/2

N2 - In this article, we introduce HelexKids, an online written-word database for Greek-speaking children in primary education (Grades 1 to 6). The database is organized on a grade-by-grade basis,and on a cumulative basis by combining Grade 1 with Grades 2 to 6. It provides values for Zipf, frequency per million, dispersion, estimated word frequency per million, standard word frequency, contextual diversity, orthographic Levenshtein distance, and lemma frequency. These values are derived from 116 textbooks used in primary education in Greece and Cyprus, producing a total of 68,692 different word types. HelexKids was developed to assist researchers in studying language development educators in selecting age-appropriate items for teaching, as well as writers and authors of educational books for Greek/Cypriot children.

AB - In this article, we introduce HelexKids, an online written-word database for Greek-speaking children in primary education (Grades 1 to 6). The database is organized on a grade-by-grade basis,and on a cumulative basis by combining Grade 1 with Grades 2 to 6. It provides values for Zipf, frequency per million, dispersion, estimated word frequency per million, standard word frequency, contextual diversity, orthographic Levenshtein distance, and lemma frequency. These values are derived from 116 textbooks used in primary education in Greece and Cyprus, producing a total of 68,692 different word types. HelexKids was developed to assist researchers in studying language development educators in selecting age-appropriate items for teaching, as well as writers and authors of educational books for Greek/Cypriot children.

KW - Word database

KW - Greek language

KW - Children

KW - Frequency

KW - Contextual diversity

U2 - 10.3758/s13428-015-0698-5

DO - 10.3758/s13428-015-0698-5

M3 - Article

C2 - 26822666

VL - 49

SP - 83

EP - 96

JO - Behavior Research Methods

JF - Behavior Research Methods

SN - 1554-351X

IS - 1

ER -