As data and AI became increasingly central to HEMA’s operations, it’s been relying on a system to collect and process data to carry out several tasks. It is an on-premise cloud infrastructure where data is collected and reports are processed to be used by the sales, stock, and managerial teams, among others. This legacy data system, which performs tasks such as tracing revenue from online and offline sales and performing stock and promotion forecasting, is no longer able to scale up to HEMA’s needs and different data sources. For this reason, the retailer chain created a dedicated team to build a new platform that is more adequate, cost efficient, and able to accommodate data coming from multiple sources. Xomnia’s data engineer Daniel Galea is part of the team working to build, install, and configure the new system. Besides building the new system, Daniel’s role is to also migrate the existing data, code, and technology from the legacy system to the new system, and to be an intermediary between the development team and business team to clearly communicate the interests of the business, such as specific reports and the datasets that they require. Xomnia can help you all the way - from setting data strategies to executing them. Get in touch for a consultation.
The solution, called the Cloud Data Analytics Platform (CDAP), can almost meet all the business needs at HEMA, such as ingesting, processing, aggregating/ combining, and exposing/serving different datasets. As migrating the data and data sources from the several departments that have for years used the legacy system is a time-consuming process, the solution is expected to be fully built by the end of 2021.
Going over the technical details, the CDAP is built in the AWS cloud, using open-source system Kubernetes. In Kubernetes, the data engineers are installing tools like Spark for processing, Airflow for orchestration, and Prometheus/Grafana for monitoring. The data engineering team is working on building, installing, and configuring this.
The new system has been partially put in use already. When fully operational, it will enable HEMA to run analytical dashboards and machine learning features, and perform tasks that require running round the clock. The system will also efficiently accommodate new sources of data, and accept API’s for multiple purposes.
By developing this data processing and service system, running almost entirely on AWS managed Kubernetes, HEMA aims to achieve four impacts:
Don't miss Xomnia's latest business cases and blogs. Subscribe to our newsletter.