Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -240,7 +240,7 @@ The tokenizers for these models were built using the text transcripts of the tra
 The model was trained on 64K hours of English speech collected and prepared by NVIDIA NeMo and Suno teams.
-The training dataset consists of private subset with 40K hours of English speech plus 25K hours from the following public datasets:
 - Librispeech 960 hours of English speech
 - Fisher Corpus

 The model was trained on 64K hours of English speech collected and prepared by NVIDIA NeMo and Suno teams.
+The training dataset consists of private subset with 40K hours of English speech plus 24K hours from the following public datasets:
 - Librispeech 960 hours of English speech
 - Fisher Corpus