Operational Data Layer Data loading:
β
Data must sync with source systems
β
Appropriate data loading strategy
β
Producer systems: frequency and quantity of data changes
β
Consuming systems: clear requirements for data currency
Step 1: Batch extract and load:
π Initial batch load
π Copy application database data to Operational Data Layer
π One-time operation to load data from source systems
*Step 2: Delta extract and load: *
π Starts immediately following initial batch load
π Real-time synchronization
π Incremental updates from source systems into the ODL
π Use Change Data Capture (CDC)
π Catch changes from source systems
π Matching, merging, reconciling data
Data flow and maturity model:
ποΈ Simple start
π± Grows in scope and strategic importance
π Delivering increased benefits to business
Phase 1: Simple ODL, offloading reads:
π» Serve only read operations
π° Cut costs
π High availability: takes over during source system downtime
β‘ Improve performance
π Handle long-running analytics queries
π Handle high read traffic peak
Phase 2: Enriched ODL for new use cases:
π Create single customer view
π³ Example: credit card transactions enriched by categorizing purchases
π° Determine their spend on each category (travel)
Phase 3: Offloading reads and writes:
π₯ Y-loading un both latency, new systems in parallel
Phase 4: ODL first:
βοΈ All writes are directed to Operational Data Layer (ODL)
Phase 5: System of Record:
ποΈ Operational Data Layer serve as the System of Record
π³οΈ Source system can be decommissioned for cost savings
ποΈ Architectural simplicity
MongoDB for Operational Data Layer:
π€ Ease: MongoDB's document model easily manage data
𧩠Flexibility: integrate multiple source systems into a single ODL, without pre-define schema
β‘ Speed: better performance when accessing data
π¨ Versatility: satisfy a range of application requirements by flexibility of document model
Example:
π Embedding of arrays and sub-documents
π Modeling complex relationships and hierarchical data
ποΈ Ability to manipulate deeply nested data without rewrite entire document
ποΈ Model flat, table-like structures, simple key-value pairs, text
π Geospatial data
πΈοΈ Nodes and edges used in graph processing
Processing pipelines:
π Lookups and range queries
π Data analytics
π¨ Transformations
π Faceted search
π Geospatial processing
π΅οΈ Graph traversals
Intelligently distribute (Operational Data Layer) ODL:
Availability:
π» Multiple copies of data using replica sets
π Failover and recovery is fully automated
Scalability:
π Challenge: new source systems, adding data volume, new consuming systems, increasing workload
ποΈ Large data sets
β‘ High throughput requirements
𧩠Solution - sharding:
π€ MongoDB provides horizontal scale-out on low-cost
π Automatically partitions and distributes data across multiple physical instances
Workload isolation:
π Operational Data Layer able to safely serve disparate workloads
π Analytical queries on up-to-date data without impact on production applications
Data locality:
π Allows precise control over where data is physically stored
πΊοΈ Control geographic region for latency, governance requirements
Reference:
https://www.mongodb.com/resources/basics/implementing-an-operational-data-layer
Implementing an Operational Data Layer
https://www.mongodb.com/resources/solutions/use-cases/mainframe-modernization-reference-architecture
Mainframe Modernization Reference Architecture
Editor
Danny Chan, specialty of FSI and Serverless
Kenny Chan, specialty of FSI and Machine Learning
Top comments (0)