UROP Project
Automatic Assessment of Text Complexity
Contact
Key Info
Basic Information
- Project Offer-Number:
- 1001
- Category:
- UROP International, RWTH UROP
- Field:
- Computer Science
- Faculty:
- 7
- Organisation unit:
- Department of English Linguistics
- Language Skills:
- fluency in English or German
- Computer Skills:
- JavaScript or Python
Advances in natural language processing have paved the way for the development of computational tools designed to automatically assess the complexity of spoken and written language samples. There are a variety of computational tools available that afford speed, flexibility and reliability and permit the direct comparison of numerous indices of language complexity. Considerable gains have been made from the use of such tools with respect to identification of reliable and valid measures of linguistic complexity to predict text readability, writing quality or speaking proficiency. However, all these available tools only generate for each complex measure a single score (summary statistics) that represents the global complexity of a language sample. We have developed a computational tool that implements a sliding-window approach to track changes in complexity within a language sample derived from a series of measurements obtained by the tool.
Task
In its current version, the tool supports 32 complexity measures derived from language acquisition and processing research. The tool was designed with extensibility in mind, so that additional complexity measures can easily be integrated. Your task is to work on the further development of the tool including the integration of new complexity measures and the development of a platform independent desktop application.
Requirements
Interest in natural language processing, good programming skills ideally in JavaScript or Python