Alliance for Language Technologies (SG)

Opened

Programme Category

EU Competitive Programmes

Programme Name

Digital Europe Programme

Programme Description

Digital Europe Programme is the first EU programme that aims to accelerate the recovery and drive the digital transformation of Europe.

Worth €7.6 billion (in current prices), the Programme is a part of the next long-term EU budget, (the Multiannual Financial Framework), and it covers 2021 to 2027. It will provide funding for projects in five crucial areas: supercomputing, artificial intelligence, cybersecurity, advanced digital skills, and ensuring the wide use of digital technologies across the economy and society.

The Programme is fine-tuned to fill the gap between the research of digital technologies and their deployment, and to bring the results of research to the market – for the benefit of Europe’s citizens and businesses, and in particular SMEs. Investments under the Digital Europe programme supports the Union’s twin objectives of a green transition and digital transformation and strengthens the Union’s resilience and strategic autonomy.

Programme Details

Identifier Code

DIGITAL-2024-AI-06-LANGUAGE-01

Call

Alliance for Language Technologies (SG)

Summary

This action will directly contribute to preserving the linguistic and cultural diversity in Europe while effectively implementing the European Common Data Infrastructure and Service MCP’s objectives in the area of language technologies. This action will have a strong impact on the deployment of large language foundation models and their applications such as generative AI.

Detailed Call Description

This call addresses the first work strand which will support the language data collection and the adaptation of existing large language foundation models to specific languages, domains or industries, so as to support the onboarding of the latest language technologies by European actors.

Data: Leveraging on the Common European Language Data Space and other relevant Data Spaces, this activity will, in compliance with the applicable legislation (e.g.Copyright Directive (EU) 2019/790 and GDPR Regulation (EU) 2016/679), gather the necessary language data (text, audio, image and other modalities) from a broad array of European industrial, academic and institutional actors, and provide data in sufficient quality and quantity to build large language foundation models, ensuring a coherent coverage of all the official languages of the Member States as well as the most socially and economically relevant ones. This will also include providing data required to adapt such large language foundation models to specific languages, domains or industries. The action will also provide a repository of existing European Large Language foundation models as well as models adapted to specific languages, domains or industries. Once sufficiently advanced, the consortium may consider working on a future copyright infrastructure and related issues to allow efficient use of language and other data, while taking into account the interests of the rightsholders.

Fine-tuning:

  • This activity will also provide large language models fine-tuned to specific languages, domains or industries as a result of further training of large language foundation models on specific language data. This process involves adapting, evaluating and optimizing foundation models for specific languages, domains or industries. It will facilitate the efficient deployment of these models across various industries, requiring less task-specific data compared to building models from scratch, which is particularly advantageous for SMEs. The action will also include the support for the ongoing maintenance and enhancement of these models, ensuring their adaptability to evolving tasks and domains over time.
  • Τhis activity will also provide, including through Financial Support to Third Parties, dedicated support and services, in particular for SMEs, to facilitate the fine-tuning of available models. This support and services will provide third parties with an infrastructure to fine-tune and evaluate existing models for their purpose.

The EuroHPC Joint Undertaking would provide access to their facilities for the adaptation and fine-tuning of the models when necessary.

Call Total Budget

€20.000.000

Financing percentage by EU or other bodies / Level of Subsidy or Loan

50%

Maximum grant amount per project: €20.000.000

Thematic Categories

  • Research, Technological Development and Innovation

Eligibility for Participation

  • Large Enterprises
  • Legal Entities
  • NGOs
  • Non Profit Organisations
  • Other Beneficiaries
  • Private Bodies
  • Researchers/Research Centers/Institutions
  • Semi-governmental organisations
  • Small and Medium Enterprises (SMEs)
  • State-owned Enterprises

Eligibility For Participation Notes

In order to be eligible, the applicants (beneficiaries and affiliated entities) must:

  • be legal entities (public or private bodies)
  • be established in one of the eligible countries, i.e.:
    • EU Member States (including overseas countries and territories (OCTs))
    • non-EU countries:

Targeted stakeholders: The consortium that will carry out this action should be composed by representatives of Member States; public and private organisations, SMEs, RTOs; entities with access to large compute capacities; public and/or private data providers, such as the media or publishing industry.

Consortium composition: minimum 3 independent applicants (beneficiaries; not affiliated entities) from 3 different eligible countries OR minimum 1 European digital infrastructure consortium (EDIC) composed of at least 3 Member States.

Call Opening Date

29/02/2024

Call Closing Date

29/05/2024

National Contact Point(s)

Ministry of Research, Innovation and Digital Policy
Directorate of Research and Innovation
Eleana Gabriel
Telephone: +357 22 691918
Email: egabriel@dmrid.gov.cy

EU Contact Point

For help related to this call, please visit the following link: Write to us | European Union (europa.eu)