Row-level Duplicates

Overview

Sifflet Duplicates Monitor is a table-level metadata monitor. It scans tables for row-level duplicates and calculates a duplicate rate. Expectations are computed by Machine Learning models based on the historical behavior of data.

📘

Metadata Monitoring

Metadata can be defined as information about data, including its structure and transformations applied to it. Metadata monitoring helps identify and address issues related to data integration, and data transformations.

As data volumes and complexities continue to grow, metadata monitoring is becoming increasingly crucial for maintaining a reliable and trustworthy data ecosystem.