UROP Project

Automatic Assessment of Text Complexity

Contact

Name

Daniel Holder

Program Director UROP

Telephone

workPhone
+49 241 80-90695

E-Mail

Key Info

Basic Information

Project Offer-Number:
1001
Category:
UROP International, RWTH UROP
Field:
Computer Science
Faculty:
7
Organisation unit:
Department of English Linguistics
Language Skills:
fluency in English or German
Computer Skills:
JavaScript or Python

Advances in natural language processing have paved the way for the development of computational tools designed to automatically assess the complexity of spoken and written language samples. There are a variety of computational tools available that afford speed, flexibility and reliability and permit the direct comparison of numerous indices of language complexity. Considerable gains have been made from the use of such tools with respect to identification of reliable and valid measures of linguistic complexity to predict text readability, writing quality or speaking proficiency. However, all these available tools only generate for each complex measure a single score (summary statistics) that represents the global complexity of a language sample. We have developed a computational tool that implements a sliding-window approach to track changes in complexity within a language sample derived from a series of measurements obtained by the tool.

Task

In its current version, the tool supports 32 complexity measures derived from language acquisition and processing research. The tool was designed with extensibility in mind, so that additional complexity measures can easily be integrated. Your task is to work on the further development of the tool including the integration of new complexity measures and the development of a platform independent desktop application.

Requirements

Interest in natural language processing, good programming skills ideally in JavaScript or Python