Focus

Data Science

Articles, podcasts, talks, and more about Data Science.
Article

Data Inventories in the EU Data Act: The Democratization of IoT Devices

Starting in September 2025, the EU Data Act (Regulation (EU) 2023/2854) will require companies that collect or process data from connected devices to maintain comprehensive data inventories.

Blog Post

LLM-assisted Abbreviation Mining for Legacy Systems

This blog post shows the process of mining abbreviations and discovering first concepts a COBOL legacy mainframe codebase is made of with the help of Large Language Models. It uses Python, pandas and Claude 3.5 Sonnet to generate insights that can be gathered from such a simple thing like a list of files.

Podcast

Data Contracts

API Spezifikationen, aber für Datensätze

Article

How To Build a Data Product with Databricks

In today’s data engineering, the focus is primarily on developing modular data products. This article outlines the advantages of modularity over monolithic data pipelines and explains, step-by-step, how to develop data products using Databricks – from defining a data contract to creating and implementing Databricks Asset Bundles, setting up a CI/CD pipeline, and publishing metadata.

Article

Creating data products with Terraform on AWS

Have you heard of data mesh? Are you intrigued by its potential but uncertain how to get started building data mesh and data products? If so, this article outlines a potential approach and delves into the key concepts behind it!

Article

Processing medical study data with Data Mesh technologies

Revisiting the tech stack of a self-serve data platform

Article

Data Mesh: Decentralized Data Analytics for Software Engineers

The decentralized data architecture approach Data Mesh is designed to enable developers to independently perform cross-domain data analysis.

Blog Post

Defect Analysis using pandas

Defect Analysis is a classic analysis technique to get insights into how buggy your system might be. In this blog post, we explore how Defect Analysis works and how we can implement it with a standard data analysis tool from Python: pandas.

Podcast

Software Analytics

Mit Data Science Probleme in der eigenen Software finden

Talk
Talk

Data Governance mit GenAI automatisieren

data2day / 10:15 - 11:00

Talk
Talk

Data Governance für Entwickler:innen: Zwischen Pflicht, Portal und Purpose

data2day / 15:30 - 16:15

Talk
Talk

MCP für Datenprodukte

data2day / 10:30 - 11:15

Talk
Talk

Data Contracts in der Praxis (Workshop)

data2day / 10:00 - 17:00

News

Now Live: The Women+ in Data and AI Festival Schedule

News

INNOQ launches Data and AI Consulting Services

News

Neu bei INNOQ: Beratung und Entwicklung im Bereich Data und AI

News

Women+ in Data and AI Summer Festival 2024

News

Women+ in Data and AI Summer Festival

News

Women+ in Data and AI Summer Festival