Data Management & Engineering

Data Management is one of the central challenges in developing modern software systems. The need for more sophisticated Data Management is even more emphasized in the current times of Artificial Intelligence and Big Data-based systems which have even more demanding data requirements than traditional Data Management had to consider.

Data Engineering

In Data Engineering, we focus on preparing data for its deployment or usage in a complex AI/data-driven system. This covers for example discovering data, cleaning data, transforming data, or integrating data from heterogenous sources. Also, there is a focus on (domain-specific) meta-data creation and management. Furthermore, aspects of data biases and potentially arising societal issues like misrepresentation and unfairness become focus area. Data Engineering topics are often seen in the context of their application domains, like Digital Humanities, medicine, but also business application like banking.

Projects

  • Amalur - Next-generation Data Integration in Data Lakes

    With Amalur project we believe that this is the right moment to revisit all the components of classic data integration (DI) systems, and to see how these fit into modern data lakes that are meant to support linear algebra as a first-class citizen.

  • Optimizing ML Inference Queries under Constraints

    Optimizing ML inference queries is hard, especially when constraints (e.g., accuracy or execution time) have to be satisfied, and the complexity of the inference query increases. This project aims to tackle constraint-based ML inference query optimization problem. The proposed optimizer aims at high effectiveness, and can navigate a large search space to find optimal query plans on various model zoos.