OELLM's Deliverables
Find the project's structure and organization plan.
- 31JUL2025Initial catalogue and analytics reports for existing training datasets
A catalogue of training datasets per language along with automatic analytics reports.
- 31JUL2025Communication, Dissemination and Exploitation Strategy
Full communication and dissemination and exploitation strategy and schedule of activities to implement during the project duration.
- 31JUL2026Initial dataset release
Texts (with metadata) used to train the OpenEuroLLM model available at mid-project.
- 31JUL2026First models
Initial release of LLM models (tokenizers and model weights).
- 31JUL2026Evaluation Code package
Code package in Python containing model evaluation procedures. The package will be open-sourced on the proposed due date after having iterated on feedback provided by other WPs.
- 31JUL2027Final dataset release
Texts (with metadata) used to train the final OpenEuroLLM model(s).
- 31JAN2028Stakeholder Report
Written report on strategic advice of the OSPB and community feedback on the development of OpenEuroLLM.
- 31JAN2028Final models
Final release of LLM models (tokenizers and model weights).
- 31JAN2028LLM training report (other tasks)
Final report on model training, including all necessary details for open publishing and regulatory compliance.
- 31JAN2028Evaluation Report
Technical report on the work made in the Evaluation workpackage. It will include our findings in evaluating LLMs on multilingual and regulatory aspects.
- 31JAN2028Evaluation Report of Communication, Dissemination and Exploitation Strategy
Evaluation of the impact of the overall strategy in aspects of dissemination, exploitation and communication.