VuNet Docs

Data Enrichment Techniques

Introduction

Data enrichment is a powerful process that takes your raw data and makes it even more valuable by adding relevant contextual information. It’s like enhancing your data’s context and depth.

To make data enrichment work, you need a unique key that exists in both your incoming raw data and the enrichment table, usually created manually. This key serves as the magic link that connects them.  For example, it can transform IP addresses into branch names or decode postal codes into geographical information. In these cases, IP addresses/ postal codes will act as unique key. If the unique key present in raw data is also available in the addition source, the raw data is enriched with datasets available in the additional source. If the unique key is not present, raw data is not enriched. If there’s no match, the data remains unchanged and isn’t enriched. So, having that unique key in both places is essential for success.

Data enrichment is incredibly versatile. It can turn cryptic codes into easily understandable names. By using data enrichment, you’ll gain deeper insights and harness your data’s full potential. The extra context it provides makes data analysis a breeze. By applying data enrichment, users can gain deeper insights and make better use of their data. The enriched information provides additional context and enhances the understanding and analysis of the data.

Prerequisites

Ensure that ContextStream is configured to implement data enrichment details using plugins in the data pipeline.

Data Enrichment Example

A real-world example of data enrichment is upgrading geolocation data using an Enrichment Table. By linking a pincode (unique key) with values such as address, city name, and geo-IP, users can supercharge their geolocation data. When you provide a Key (i.e. pincode), the enrichment process fetches all the contextual data (i.e.ddress, city name, and geo-IP). This contextual data further helps in enabling a dynamic geographic map. A sample of geolocation data demonstrating the relationship between pincodes and geographical information is shown in the image below:

Here’s how data enrichment is done:

  1. Data Setup: The pincode, along with its corresponding data (like address, city name, and geo-IP), is stored in an Enrichment Table. Think of it as a lookup table where the pincode is the key, and the other details are the values.
  2. Easy Enrichment: When the pincode comes into the input stream, the magic of data enrichment happens. It automatically fetches the associated information like address, city name, and GeoIP (latitude and longitude). This enriches your data with meaningful context.
  3. Map Magic: This enriched data can be used to create dynamic geographic maps in the user interface. Imagine pincode inputs coming from a data source, and the map showing the exact locations.

In essence, the key-value pair system enriches your data by adding contextual details based on the provided key (in this case, the pincode). It’s like giving your data a power-up, and it opens up exciting possibilities, like creating interactive maps with ease.

Workflow for Data Enrichment

Performing data enrichment in vuSmartMaps™ is made simple with a clear step-by-step process. Let’s break it down:

  1. Enrichment Configuration:
    Enrichment Configuration is the process where we create a lookup with the keys and values along with their corresponding data stored in it.
    • Creating Enrichment Table
      Begin by establishing a new enrichment table. This is where you’ll store the key-value pairs that will enhance your data. Clearly specify the keys (identifiers) and their corresponding values (information) that are relevant to your data. These keys and values act like a schema for your data enrichment.
    • Updating Data in Enrichment Table
      You can update data in the enrichment table through either manual entry or by uploading a spreadsheet. For larger data updates or when you have data stored in a spreadsheet, you can effortlessly upload the spreadsheet.
  2. Using Enrichment in Data Pipeline:
    After the enrichment configuration, you can incorporate the enrichment feature into your data pipeline. This ensures that the data flowing through the pipeline undergoes the enrichment process.
    • Creation of Input-Output Streams:
      To enable the enrichment pipeline, you need to create the data streams that act as channels for data flow, to specify input and output streams for the enrichment process.
    • Creation of Pipeline:
      The next step involves configuring the pipeline within the application. This includes defining the transformation details from the input data to the output data and incorporating the enrichment step into the pipeline flow.

The visual representation below illustrates the enrichment pipeline. Input data is transformed using configured enrichment settings, and the enriched output data is stored in an output stream. This allows users to enhance their data with additional context and insights for improved analysis and decision-making.

This simplified workflow guide ensures that you can easily enrich your data, making it more valuable and insightful for your analyses and decision-making processes.

Further Reading

  1. Explore ContextStreams to set up the data pipeline
  2. Learn Data Model to structure the data

Resources

Browse through our resources to learn how you can accelerate digital transformation within your organisation.

Unveiling our all powerful Internet and Mobile Banking Observability Experience Center. Click Here