Hyperight

How We Build a Data Lakehouse to Manage PB’s of Data – Joachim Zetterman, Scania

Session Outline

Building autonomous vehicles is a complex task that requires a flexible and efficient data platform. In this session at the NDSML Summit 2023, Joachim Zetterman from Scania shows how Scania has leveraged the Data Lakehouse concept to manage and analyse huge volumes of sensor data from different sources and formats. A Data Lakehouse is a new architecture that combines the advantages of data lakes and data warehouses, such as scalability, reliability, and performance. It allows us to store, process, and query sensor data in a consistent and unified way. We will share our experience and lessons learned from the following aspects:

Key Takeaways

  • How we built and deployed our data platform architecture using AWS and Databricks.
  • How we fostered user adoption and collaboration among various teams and stakeholders.
  • How we are preparing for the future challenges of MLOps, such as data quality, model management, and deployment.

Add comment

Upcoming Events

Data Innovation Summit 2025

Early bird tickets ending in:

days hours minutes seconds
SECURE YOUR TICKET NOW!