Enterprise Data Warehouse

A near realtime data warehousing system built on serverless technology for a pharma organization including an operational UI.

Industry
Enterprise Data Warehouse

Location
USA

Services
Data Warehouse

Situation

The client worked in the healthcare industry. They used a data warehousing system for storing their Pharmaceutical Sales Representatives’ data. The client did not process or refine the collected data, or there was no sorting of the data according to the business/fact/dimension criteria. Additionally, there was no system in place that allowed the company reps to use the data.

Problem Statement

In order to evaluate the data, the previous data warehousing system lacked a suitable user interface (UI). Correct functionalities weren’t utilized in the data warehousing system to deal with validations and standardization. Furthermore, Fact GPO & Dimensions were not properly implemented. A mechanism was required in the new data warehousing system that would provide them with the following three alternatives for resolving the following issues:

It was hard to validate, modify, and add new and existing data via the UI interface

There was no precise address standardization method

Fact, GPO, and Dimensions layers weren’t used.

Approach

For the new data warehousing system, the AWS Lambda service was used to execute straightforward alerts of incoming files in order to accomplish real-time processing of input data files. An input definition system had also been created, making it simple to add or change an existing input data source. As part of the input definition implementation, the team also leveraged the AWS Glue service and the idea of an external schema. By leveraging the SmartyStreets API for address verification and data collection surrounding a specific address, accurate address standardization was made possible. Using a user interface to edit, update, add, or remove existing data simplified the data validation process. Additionally, the processed data was synchronized with the existing CRM system to make it simple for the marketing and sales teams to check data. The client worked in the healthcare industry. They used a data warehousing system for storing their Pharmaceutical Sales Representatives’ data. The client did not process or refine the collected data, or there was no sorting of the data according to the business/fact/dimension criteria. Additionally, there was no system in place that allowed the company reps to use the data.

Outcome

Once the new data warehousing system was finished, the Incentius team found that the data accuracy had increased and that the data validation had become simple with the new UI. With the assistance of the AWS Lambda service, real-time data processing was accomplished. Since the data warehouse system was used to develop the data needed for the client’s existing CRM system, data synchronization between the two systems was accomplished easily. Sequential lambda functions were used to apply business, fact, and dimensions rules to the incoming data with simplicity.