UROP Project

Keystroke Logging in Second Language Writing Research

Contact

Name

Daniel Holder

Program Director UROP

Telephone

workPhone
+49 241 80-90695

E-Mail

Key Info

Basic Information

Project Offer-Number:
1002
Category:
UROP International, RWTH UROP
Field:
Computer Science
Faculty:
7
Organisation unit:
Department of English Linguistics
Language Skills:
fluency in English or German
Computer Skills:
Java or Python

Recent years have seen a growing interest in the use of keystroke logging for identifying writing strategies and underlying cognitive processes (see, e.g., Leijten and Van Waes, 2013 for an overview). Keystroke logging programs time stamp keystroke activity making it possible to reconstruct text production processes. The basic assumption is that the analyses of pauses (length, number, distribution, location, etc.) and revisions (number, type, operation, emeddeddness, location, etc.) are indicative of cognitive effort involved. One of the main challenges is to link the logging data to the underlying cognitive processes. The goal of the project is to analyze a combination of cross-sectional and longitudinal keystroke logging data collected at our university to gain insights into the writing processes and strategies of second language learners of English.

Task

Your task will involve the preprocessing of keystroke logging data and subsequent data analysis. For each individual text, you will get lists of Etherpad changeset and the corresponding timestamps (see, https://github.com/ether/etherpad-lite/wiki/Changeset-Library).

Requirements

Ideally interest in natural language processing and experience working with language data