ARM-1 2nd in PolEval 2019 ASR competition

ARM-1 engine was one of the systems competing within Task 5: Automatic Speech Recognition at PolEval 2019. PolEval is a series of competitions organized every year for natural language processing tools for Polish. The event was inspired by SemEval International Workshop on Semantic Evaluation.


Various tools and system compete against each other within certain tasks, selected each year by the organizers. This year one of the tasks was to transcribe audio recordings of speech in a noisy environment and was designed for Large-Vocabulary Continuous Speech Recognition (LVSCR) systems. ARM-1 was submitted in open competition subtask for systems that could use any training data, as opposed to fixed competition for systems which could use only a few corpora listed by the organizers. In both cases, this year speech recordings from lower and higher house of the Polish Parliament were excluded from data used for system training, since a subset of these recordings was used as a test set. During evaluation phase only systems developed by the competitors could have been used.

The test set consisted of 29 recordings of various speakers with a total duration of nearly 48 minutes and the results were evaluated by determining WER (Word Error Rate). ARM-1 had the best result in its subtask with WER = 26,4% and correctness of 77% over all recordings, and took second place among all competing systems.

news