SEQCLAS

Key Info

Basic Information

Coordinator:
Prof. Dr. Hermann Ney
Faculty / Institution:
Mathematics, Computer Science and Natural Sciences
Organizational Unit:
Computer Science 6 (Human Language and Pattern Recognition)
Pillar:
Excellent Science
Project duration:
01.08.2016 to 31.07.2021
EU contribution:
2.500.000 euros
  EU flag and ERC logo This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (Grant agreement No. 694537)  

Title

A Sequence Classification Framework for Human Language Technology

Concept

This project will develop a unifying framework of novel methods for sequence classification and thus make a major break-through in automatic speech recognition and machine translation, advancing these areas of human language technology (HLT) beyond state-of-the-art. Despite the huge progress made in the field, the specific aspect of sequence classification has not been addressed adequately in the past research in these disciplines and remains a big challenge. The proposed project will provide a novel framework under consistent consideration of the leading aspect of sequence classification. It will break the ground for a deeper, more comprehensive foundation for sequence classification and pave the way for a new generation of algorithms that will put human language technology on a more solid basis and that will accelerate progress in the field across several disciplines.

The leading research objectives are: 1. A novel theoretical framework for sequence classification. 2. Consistent sequence modeling across training and testing, which is specifically lacking in machine translation. 3. Adequate sequence-level performance-aware training criteria to learn the free parameters of the models. 4. Investigation of (true) unsupervised training for HLT sequence classification: its principles, its prerequisites, its limitations and its practical usage. The study of these four problems will provide key enabling techniques for HLT sequence classification in general that will carry over to and create high impact on the areas of speech recognition, machine translation and handwritten text recognition. Using our top-ranking research prototype systems, we will verify the validity and effectiveness or our research on public international benchmarks.