File size: 1,389 Bytes
0a00b58
1858418
 
0a00b58
920b570
 
0a00b58
1858418
920b570
 
0a00b58
 
920b570
 
 
 
9710146
 
 
920b570
 
9cd5df3
920b570
 
 
9710146
 
920b570
 
9cd5df3
0a00b58
 
1858418
0a00b58
 
 
1858418
0a00b58
 
1858418
0a00b58
1858418
0a00b58
1858418
 
9cd5df3
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
---
language:
- da
license: cc0-1.0
tasks:
- automatic-speech-recognition
datasets:
- common_voice_8_0
metrics:
- wer
model-index:
- name: kblab-voxrex-wav2vec2-large-cv8-da
  results:
  - task: 
      type: automatic-speech-recognition
    dataset:
      type: mozilla-foundation/common_voice_8_0
      args: da
      name: Danish Common Voice 8.0
    metrics:
      - type: wer
        value: 30.51
  - task: 
      type: automatic-speech-recognition
    dataset:
      type: Alvenir/alvenir_asr_da_eval
      name: Alvenir ASR test dataset
    metrics:
      - type: wer
        value: 28.33
---

# KBLab-VoxRex-Wav2vec2-large-CV8-da

## Model description

This model is a fine-tuned version of the Swedish acoustic model [KBLab/wav2vec2-large-voxrex](https://huggingface.co/KBLab/wav2vec2-large-voxrex) on the Danish part of [Common Voice 8.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_8_0), containing ~6 crowdsourced hours of read-aloud Danish speech.


## Performance

The model achieves the following WER scores (lower is better):

| **Dataset** | **WER without LM** | **WER with 5-gram LM** |
| :---:   | ---: | ---: |
| [Danish part of Common Voice 8.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_8_0/viewer/da/train) | 37.63 | 30.51 |
| [Alvenir test set](https://huggingface.co/datasets/Alvenir/alvenir_asr_da_eval) | 35.75 | 28.33 |