
Integrated Data Analysis Pipelines for Large-Scale Data Management, HPC, and Machine Learning
Name of the Project: Integrated Data Analysis Pipelines for Large-Scale Data Management, HPC, and Machine Learning
Acronym: DAPHNE
Project fund: EU programme H2020 EU.2.1.1. – INDUSTRIAL LEADERSHIP – Leadership in enabling and industrial technologies – Information and Communication Technologies (ICT)
- Project Website at project funder (in CORDIS):
https://cordis.europa.eu/project/id/957407- Reference number: 957407
- Topic: ICT-51-2020 – Big Data technologies and extreme-scale analytics
- Funding Scheme: RIA – Research and Innovation action
- Call for proposal: H2020-ICT-2018-20
- Field of science:
- /natural sciences/chemical sciences/analytical chemistry/quantitative analysis
- /humanities/languages and literature/languages – general
- /natural sciences/computer and information sciences/artificial intelligence/machine learning
- /natural sciences/computer and information sciences/data science/data analysis
- Time frame: 01. 12. 2020 – 30. 11. 2024
- Total costs: 6,609,665.00 €
- Co-funding rate (in %): 100 %
- The amount of co-financing at UM FERI (UM FERI share): 244 975 €
- UM FERI Coordinator: Assoc. Prof. Dr. Aleš Zamuda
Project Coordinator: KNOW-CENTER GMBH RESEARCH CENTER FOR DATA-DRIVEN BUSINESS & BIG DATA ANALYTICS (Austria)
Project Partners:
- DEUTSCHES ZENTRUM FUR LUFT – UND RAUMFAHRT EV
- EIDGENOESSISCHE TECHNISCHE HOCHSCHULE ZUERICH
- HASSO-PLATTNER-INSTITUT FUR DIGITAL ENGINEERING GGMBH
- INSTITUTE OF COMMUNICATION AND COMPUTER SYSTEMS
- INFINEON TECHNOLOGIES AUSTRIA AG
- INTEL TECHNOLOGY POLAND SPOLKA Z OGRANICZONA ODPOWIEDZIALNOSCIA
- IT-UNIVERSITETET I KOBENHAVN
- KAI KOMPETENZZENTRUM AUTOMOBIL – UND INDUSTRIEELEKTRONIK GMBH
- TECHNISCHE UNIVERSITAET DRESDEN
- UNIVERZA V MARIBORU
- UNIVERSITAT BASEL
- TECHNISCHE UNIVERSITAT BERLIN
Project Summary
Modern data-driven applications leverage large, heterogeneous data collections to find interesting patterns, and build robust machine learning (ML) models for accurate predictions. Large data sizes and advanced analytics spurred the development and adoption of data-parallel computation frameworks like Apache Spark or Flink as well as distributed ML systems like MLlib, TensorFlow, or PyTorch. A key observation is that these new systems share many techniques with traditional high-performance computing (HPC), and the architecture of underlying HW clusters converges. Yet, the programming paradigms, cluster resource management, as well as data formats and representations differ substantially across data management, HPC, and ML software stacks. There is a trend though, toward complex data analysis pipelines that combine these different systems. Examples are workflows of distributed data pre-processing, tuned HPC libraries, and dedicated ML systems, but also HPC applications that leverage ML models for more cost-effective simulation. Major obstacles are (1) limited development productivity for integrated analysis pipelines due to different programming models, and separated cluster environments, (2) unnecessary data movement overhead and underutilization due to separate, statically provisioned clusters, and (3) lack of a common system infrastructure with good interoperability. For these reasons, DAPHNE’s overall objective is the definition of an open and extensible systems infrastructure for integrated data analysis pipelines. We aim at building a reference implementation of language abstractions (i.e., APIs and a domain-specific language), an intermediate representation, as well as compilation and runtime techniques with support for integrating and scheduling heterogeneous accelerator and storage devices. A variety of real-world, high-impact use cases, datasets, and a new benchmark will be used for qualitative and quantitative analysis compared to state-of-the-art.
UM FERI Activities
UM FERI is involved in project work from project management, system architecture, compilation and abstraction of a domain-specific language, runtime and integration, through use case preparations, benchmarking and analysis to dissemination and exploitation of project results.
DAPHNE Project Research Work Group Members at UM FERI
- assoc. prof. dr. Aleš Zamuda (project lead)
- dr. Matjaž Divjak
- assist. prof. dr. Tina Tomažič
- assoc. prof. dr. Danilo Korže
- full prof. dr. Janez Brest
- assist. prof. dr. Borko Bošković
- Laura Horvat
- Matej Moravec (past member)
- assoc. prof. dr. Tomaž Kosar (past member)
- dr. Milan Ojsteršek (past member)
- full prof. dr. Aleš Holobar (past member)
- full prof. dr. Marjan Mernik (past member)
- Klemen Berkovič (past member)
- Sašo Pečnik (past member)
- dr. Danijel Žlaus (past member)
Project data on SICRIS: https://cris.cobiss.net/ecris/si/sl/project/22513

News updates (X)
Annual meeting and report:
https://r8.ieee.org/slovenia-cis/2026/02/11/2026-ieee-slovenia-meeting-at-vransko-cis11-annual-reporting-available/
Special Session @ WCCI 2026 on "Data-Driven Surrogate-Assisted & Knowledge-Informed Evolutionary Optimization for Complex Systems". To explore how machine learning, surrogate modelling, and evolutionary optimisation can together solve challenging problems.
https://cs.ijs.si/project/wcci2026/
34th International Electrotechnical and Computer Science Conference ERK 2025
https://erk.fe.uni-lj.si/erk25.html
The ERK 2025 conference starts today, on September 25, 2025 in Congress Center Bernardin, Portorož, Slovenia.
From the Agenda on CS:
RAČUNALNIŠTVO IN INFORMATIKA / COMPUTER AND
June TOP500 List is out today, just presented at ISC 2025: all top three HPC systems in #supercomputing on #GREEN500 are in #EU, half of top 10 systems in 65th #Top500 HPC list from Europe, HPCG & TOP500 topped by US.
#energyefficiency #ranking #HPC #ISC25 #AI
🇸🇮 VEGA HPC

65th #Top500 is out!
o Top 3 #Exascale systems remain unchanged:
#1 El Capitan
#2 Frontier
#3 Aurora
o JUPITER Booster (@EuroHPC_JU #Exascale system being commissioned – hence partial system) @fzj_jsc in Germany at #4 is the only new system in the #Top10
#HPC #AI #ISC25
100+ reads in a week for the new article with Teo Prica in @MathematicsMDPI (Special Issue Innovations in High-Performance Computing):
"High-Performance Deployment Operational Data Analytics of Pre-Trained Multi-Label Classification Architectures with
High-Performance Deployment Operational Data Analytics of Pre-Trained Multi-Label Classification Architectures with Differential-Evolution-Based Hyperparameter Optimization (AutoDEHypO) https://www.mdpi.com/3320652 #mdpimathematics via @MathematicsMDPI
Thanks @IntechOpen for publishing "Foundational Concepts and Real-World Applications of Self-Adaptive Differential Evolution and Success History" last week as #OpenAccess. #DOI: 10.5772/intechopen.1010630
Acks. @daphne_eu and many more.
More: Grok's further "DeepSearch"
#OpenAcces chapter by @IntechOpen: Foundational Concepts and Real-World Applications of Self-Adaptive Differential Evolution and Success History https://www.intechopen.com/online-first/1222844
High-Performance Deployment Operational Data Analytics of Pre-Trained Multi-Label Classification Architectures with Differential-Evolution-Based Hyperparameter Optimization (AutoDEHypO) #mdpimathematics via @MathematicsMDPI

High-Performance Deployment Operational Data Analytics of Pre-Trained Multi-Label Classification...
This article presents a high-performance-computing differential-evolution-based hyperparameter optimization automated workflow (Au...
www.mdpi.com
#OpenAcces chapter by @IntechOpen: Foundational Concepts and Real-World Applications of Self-Adaptive Differential Evolution and Success History

Foundational Concepts and Real-World Applications of Self-Adaptive Differential Evolution and...
This chapter describes a range of foundational concepts in differential evolution (DE) algorithm, including distance-based...
www.intechopen.com
Sharing slides from the summarizing talk at #Final DAPHNE Review Meeting yesterday (January 15, 2025). Thank you DAPHNE EU Project (@daphne_eu).
Slides:
Link to event post:
https://www.linkedin.com/posts/daphne-eu-project-695735230_daphne-eu-project-team-is-happy-to-announce-activity-7285513332260753409-CAVF?utm_source=social_share_send&utm_medium=member_desktop_web
Randomised Optimisation Algorithms Pipelines @ DAPHNE FRM 2025
The document details the agenda and content of the DAPHNE final review meeting scheduled for January 15, 2025, foc...
www.slideshare.net
Presenting "Randomised Optimisation Algorithms" at DAPHNE @daphne_eu General Assembly Meeting, October 8-9 2024, Athens
Slides ➡️
Randomised Optimisation Algorithms @ DAPHNE GAM 2024
The document provides an overview of the randomized optimization algorithms (ROA) discussed during the DAPHNE general ...
www.slideshare.net
Computational Intelligence (IEEE Slovenia CIS, CIS11) at ERK 2024 track Computer and Information Science (sessions CS) and Technical Presentations
https://events.vtools.ieee.org/m/414440
#ERK #CIS #Computational #Intelligence #Računska #inteligenca #slovenia #evolutionary #erk #cis
Computational Intelligence (IEEE Slovenia CIS, CIS11 @IEEESloveniaCIS ) at ERK 2024 track Computer and Information Science (sessions CS) and Technical Presentations
https://events.vtools.ieee.org/m/414440
IEEE CIS Slovenia website link: https://r8.ieee.org/slovenia-cis/
IEEE Slovenia CIS annual…
Newly available #job offer by University of Maribor in project @daphne_eu
#hiring #HPC #ML #BigData #algorithms #programming #supercomputing #DaphneDSL #GitHub #ICT
➡️➡️➡️

Projekt DAPHNE https://daphne-eu.eu Integrated Data Analysis Pipelines for Large-Scale Data...
Projekt DAPHNE https://daphne-eu.eu Integrated Data Analysis Pipelines for Large-Scale Data Management, HPC and Machine Le...
www.linkedin.com
Newly available #job offer for project @daphne_eu at University of Maribor #hiring #HPC #ML #BigData #algorithms #programming #supercomputing #DaphneDSL #GitHub #ICT

Projekt DAPHNE https://daphne-eu.eu Integrated Data Analysis Pipelines for Large-Scale Data...
Projekt DAPHNE https://daphne-eu.eu Integrated Data Analysis Pipelines for Large-Scale Data Management, HPC and Machine Le...
www.linkedin.com
Still available #job positions for project @daphne_eu at University of Maribor #hiring #HPC #ML #BigData #algorithms #programming #supercomputing #DaphneDSL #GitHub #ICT

Projekt DAPHNE https://daphne-eu.eu Integrated Data Analysis Pipelines for Large-Scale Data...
Projekt DAPHNE https://daphne-eu.eu Integrated Data Analysis Pipelines for Large-Scale Data Management, HPC and Machine Le...
www.linkedin.com
“IEEE ISN GRSS/CIS/MTT-S Workshop Maribor 2024”
https://events.vtools.ieee.org/m/426281
Sharing my slides from CEEPUS mobility at TU Graz hosted by prof. Erich Leitgeb. ➡️➡️➡️
Aleš Zamuda.
Modelling, Simulation, and Computer-aided Design in Computational, Evolutionary, Supercomputing, and Intelligent Systems.
Central European Exchange Program for University
Slides from talk:
Aleš Zamuda, Mark Dokter:
Deploying DAPHNE Computational Intelligence on EuroHPC Vega for Benchmarking Randomised Optimisation Algorithms.
2024 International Conference on Broadband Communications for Next Generation Networks and Multimedia Applications
132 attendees, 42 talks, 15 posters & lightning talks, workshops, meetings, and important conversations on HPC, AI, latest research, and emerging technologies - this was #ASHPC24. Visit the website for photos, a book of abstracts, and a program: https://ashpc.eu/
After four days and 40+ talks, #ASHPC24 has come to an end. It's been a great week in the incredible company of #HPC users and providers. Thanks to all the participants & partners @VSCluster @EuroCC_SLING @uniinnsbruck👋
Save the date for #ASHPC25 ➡️ 19-22 May in Slovenia🇸🇮
Members of the Slovenian consortium for high-performance computing #SLING significantly supported and contributed to the 10th #ASHPC24 Conference, which is taking place from 10th June to 13th June in Grundlsee, Austria 🙌
Euro-Par 2023: Parallel Processing Workshops (LNCS book series volume 14352) is available online, with our conference paper:
"DAPHNE Runtime: Harnessing Parallelism for Integrated Data Analysis Pipelines",
by Aristotelis Vontzalidis, Stratos Psomadakis (@ps0mas), Constantinos
“Remote Sensing and Computational, Evolutionary, Supercomputing, and Intelligent Systems” talk at IcETRAN 2024 Panel Session.
Slides:
@daphne_eu
Presenting today: “Randomised Optimisation Algorithms in DAPHNE”
Austrian-Slovenian HPC Meeting 2024 – ASHPC24, Seeblickhotel Grundlsee in Austria, 10–13 June 2024
@daphne #HPC #algorithms #ASHPC
Slides ➡️ https://www.slideshare.net/slideshow/randomised-optimisation-algorithms-in-daphne/269638046
Randomised Optimisation Algorithms in DAPHNE
The document outlines a presentation by Aleš Zamuda on randomized optimization algorithms (ROA) taking place at the ...
ashpc.eu
It's this time of the year again: Austrian-Slovenian HPC Meeting #ASHPC24 has started!
With @EuroCC_SLING @VSCluster & @uniinnsbruck 🇦🇹 🇸🇮 https://ashpc.eu
“Remote Sensing and Computational, Evolutionary, Supercomputing, and Intelligent Systems” talk at IcETRAN 2024 Panel Session.
Slides:
@daphne_eu
Remote Sensing and Computational, Evolutionary, Supercomputing, and...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intelligent Systems (IcETRAN2024-IEEE-CIS-ISN-Panel-Lecture-Ales...
www.slideshare.net
“Presentation of @IEEESloveniaCIS (Computational Intelligence Society) Chapter and Networking”
Slides:
11th International Conference on Electrical, Electronic, and Computing Engineering (IcETRAN 2024).
Presentation of IEEE Slovenia CIS (Computational Intelligence...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapter and Networking - Download as a PDF or view online for free
www.slideshare.net
Presenting the paper today:
“Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Representations from Transformers” co-authored with Jani Dugonik & Elena Lloret @elloretpastor at:
IcETRAN track “Artificial Intelligence”
(11th International Conference on…
We are thrilled to have welcomed Matthias Pohl to the Daphne team! 🚀 As Group Lead for Data Access and Processing at the @DLR_en, Mathias brings invaluable expertise to our endeavors. Welcome to the crew! #EUProject #NewAddition
Euro-Par 2023: Parallel Processing Workshops (LNCS book series volume 14352) is available online, with our conference paper:
"DAPHNE Runtime: Harnessing Parallelism for Integrated Data Analysis Pipelines",
by Aristotelis Vontzalidis, Stratos Psomadakis (@ps0mas), Constantinos…
Report available from today’s Public presentation of individual work with a seminar by Teo Prica: “Exploiting the potential of the National Supercomputing Network”
#CIS #HPC
https://events.vtools.ieee.org/m/413267
