OpenGPT-X research projectLarge AI language model published
The large AI language model of the OpenGPT-X research project is now available for download on Hugging Face: "Teuken-7B" was trained from scratch with the 24 official languages of the EU and comprises 7 billion parameters.
The bar chart shows the performance of Teuken-7B-instruct-research-v0.4 in the multilingual benchmarks ARC-, HellaSwag- and TruthfulQA in comparison to other open source models of similar size. The bars show the performance for the respective benchmark averaged over 21 European languages, and the mean value of all three benchmarks. In this selection of benchmarks, Teuken-7B-instruct-research-v0.4 is ahead of all other models on average. In the individual benchmarks ARC and HellaSwag, Teuken is in second place behind Salamandra-7b-instruct, and in TruthfulQA in second place behind Mistral-7B-instruct-v0.3. © Fraunhofer IAIS

