Img-Reboot

Transforming Data Management for a Global Hospitality Leader

Summary

A large global hospitality and entertainment firm specializes in hotels, resorts, dining services, sports venues, gaming, and retail. Due to the diversity of data sources, the firm's technical environment lacked a unified perspective on transactional data. Additionally, data quality challenges inhibited effective data consumption by business units.

To address these issues, Tredence developed a modern data platform utilizing Databricks and Alation to enhance data quality, categorization, and governance.

Goal

The customer needed a consolidated view of transactional data, which was dispersed across various systems, hindering effective reporting. There were data quality challenges, and the existing design wasn't scalable enough for large-scale analytical solutions or data storage. Additionally, there was a lack of data governance and stewardship.

The objective was to develop a fault-tolerant, uniform, and scalable ingestion framework that would be easy to configure.

To achieve this, Tredence utilized Databricks and AWS services to build a unified data platform, and Alation to enhance data governance, quality, and accuracy.

Tools and AWS Services Used

The platform is primarily built around AWS services, with some exceptions such as Databricks, Terraform, and Alation:

  • AWS IAM: Identity and Access Management
  • AWS CloudWatch: Monitoring
  • AWS SNS: Notifications and Alerts
  • Amazon S3: Object Storage
  • Databricks
  • Delta Lake
  • Terraform: Automate Process
  • Alation: Data Catalog and Data Governance

Approach

Tredence collaborated with the customer to develop and build a modern data platform utilizing AWS cloud services and Databricks. 

Tredence designed medallion architecture with three layers: raw, core, and presentation layer. The process began with ingesting data from the landing zone in S3, including various file patterns (CSV, JSON, XML, Text) from various source systems into the raw layer. This layer effectively flattens various files into a row and column format. The core layer uses a star schema to simplify the data model and accelerate aggregation. The final layer, known as the presentation layer, includes several materialized views based on the consumption requirements of specific groups.

The solution architecture for the platform is shown in the diagram below:

Key Benefits

  • Unified data platform for analytics consumption integrating 17 POS systems with different file patterns
  • Automatic schema evolution
  • Dynamic and configurable, ready-to-use ingestion framework
  • Immediate downstream data consumption via Tableau for sales performance measurement
  • Complete visibility and data governance from raw to presentation layer

Results

Icon Boost

Configurable end-to-end ingestion architecture

Icon Boost

Scalable automated data consumption pipelines

Icon Boost

Minimal costs for operations and maintenance

Talk To Us