Skip to content

Unification Waterfall

The FirstHive CDP (Customer Data Platform) Unification Dashboard provides a comprehensive view of data ingestion, processing, and unification metrics across multiple customer touchpoints. This waterfall visualization demonstrates how raw data from various sources is processed and unified to create a single customer view.

Dashboard Components

Key Metrics Summary

  • Total Unified Users:
  • Total Source Identities
  • Total Channel Level Unique Identities
  • Total FHIDs generated

Waterfall Flow Architecture

First Hive Ingestion Layers (Top Row)

The dashboard shows data flow through various ingestion channels with their respective volumes:

ChannelIcon
Enterprise WarehouseDatabase
Website InteractionsGlobe
AndroidMobile Device
iOSTimer/Clock
FacebookFacebook
TwitterTwitter
Data UploadCloud Upload
EmailEmail
SMSSMS
WhatsAppWhatsApp
Push NotificationsBell
Audience PushUsers

First Hive Channel Level Identity Layer (Bottom Row)

After processing through the unification engine, the data is consolidated:

ChannelReduction Ratio
Enterprise Warehouse~90.7% reduction
Website Interactions~93.9% reduction
Android~99.4% reduction
iOS~99.2% reduction
Facebook~0.3% reduction
Twitter~55.6% reduction
Data Upload~46.3% reduction
Email~94.8% reduction
SMS~63.9% reduction
WhatsApp~62.5% reduction
Push Notifications~96.2% reduction
Audience Push~61.2% reduction

Data Ingestion & Unification Chart

The bar chart visualization shows the comparison between:

  • Blue bars: Ingested Users (raw data volume)
  • Purple bars: Unified Users (post-deduplication)

Key insights from the chart:

  • Email has the highest ingestion volume but significant deduplication
  • Enterprise Warehouse shows substantial data processing with good unification rates
  • Mobile channels (Android/iOS) show high deduplication ratios
  • Social media channels have varying unification patterns

Key Performance Indicators

Unification Efficiency Metrics

  • Overall Unification Ratio
  • Best Performing Channels
  • Highest Deduplication Channels:

Data Quality Insights

  • Total Data Reduction: From 1.5B identities to 185M channel-level unique identities (87.7% reduction)
  • Final Unification: From 185M to 30.8M unified users (83.3% additional consolidation)
  • Cross-channel Identity Resolution: Demonstrates effective matching across multiple touchpoints

Business Implications

Customer Journey Completeness

The dashboard indicates comprehensive customer data collection across:

  • Digital Touchpoints: Website, mobile apps, social media
  • Communication Channels: Email, SMS, WhatsApp, push notifications
  • Enterprise Systems: Data warehouse integration
  • Marketing Platforms: Audience push capabilities

Data Quality and Accuracy

  • High deduplication rates suggest robust identity resolution algorithms
  • Cross-channel unification enables single customer view creation
  • Enterprise warehouse integration ensures data completeness

Platform Capabilities Demonstrated

  • Multi-source Ingestion: Different data sources
  • Real-time Processing: Handling 1.5B+ monthly unique source identities in single account
  • Advanced Identity Resolution: 97.95% deduplication efficiency
  • Scalable Architecture: Processing hundreds of millions of records per channel

Usage Guidelines

Dashboard Interpretation

  • Monitor unification ratios to assess data quality
  • Track channel-specific performance for optimization
  • Use metrics for capacity planning and resource allocation
  • Identify channels requiring improved identity matching

Performance Monitoring

  • Red Flags: Sudden drops in unification ratios
  • Optimization Targets: Channels with unusually high or low deduplication
  • Capacity Planning: Monitor total ingestion volumes for scaling decisions
  1. Regular Review: Weekly assessment of unification metrics
  2. Channel Optimization: Focus on improving identity matching for high-volume, low-efficiency channels
  3. Data Quality Maintenance: Monitor for anomalies in source data patterns
  4. Business Alignment: Use metrics to demonstrate CDP value and ROI

Technical Specifications

Data Processing Flow

  1. Ingestion Layer: Multi-source data collection
  2. Identity Resolution Engine: Cross-reference and matching algorithms
  3. Unification Process: Consolidation into single customer profiles
  4. Output Layer: Unified customer database with 30.8M profiles

Performance Metrics

  • Processing Capacity: 1.5B+ source records per month per account
  • Unification Accuracy: 97.95+% deduplication rate
  • Channel Coverage: Multiple integrated data sources
  • Real-time Capability: Live dashboard updates

This dashboard serves as a critical tool for understanding data flow, quality, and unification effectiveness within the FirstHive CDP ecosystem.