. .

Improving Lessons Learned: Strengthening Operational Resilience Through Continuous Improvement
OR BB P2S5_LL_15

[OR] [P2] [S5] [LL] [C15] Practical Case Study (Banking Sector Example)

New call-to-action

While frameworks and methodologies provide structure, their true value is realised through practical application. This chapter presents a detailed banking sector case study showing how lessons learned:

  • Are captured following a disruption
  • Undergo Root Cause Analysis (RCA)
  • Are linked to CBS and impact tolerance
  • Lead to measurable improvements

The case demonstrates how organisations can move from:

  • Incident → Insight → Action → Improvement

New call-to-action

Moh Heng Goh
Operational Resilience Certified Planner-Specialist-Expert

New call-to-action

[P2] [S5] Chapter 15

New call-to-actionPractical Case Study (Banking Sector Example)

Introduction

0019 Cyber Attack Resilience Lab

While frameworks and methodologies provide structure, their true value is realised through practical application. This chapter presents a detailed banking sector case study showing how lessons learned:

  • Are captured following a disruption[OR] [P2] [S5] [LL] [C15] Practical Case Study (Banking Sector Example)
  • Undergo Root Cause Analysis (RCA)
  • Are linked to CBS and impact tolerance
  • Lead to measurable improvements

The case demonstrates how organisations can move from:

  • Incident → Insight → Action → Improvement

Purpose of the Chapter

To illustrate how a financial institution applies a structured Lessons Learned framework in a real-world scenario—demonstrating how incidents are analysed, lessons are derived, and improvement actions are implemented to strengthen Critical Business Services (CBS) and overall operational resilience.

 

Case Study Overview

Organisation Profile
  • Mid-to-large retail bank
  • Operates digital banking, payments, and deposit services
  • Highly dependent on technology and third-party providers
Critical Business Service (CBS) in Focus

CBS-2: Payments and Funds Transfer Services

This CBS includes:

  • Payment initiation
  • Transaction processing
  • Settlement and confirmation

 

Incident Description

Event Summary

The bank experienced a major outage in its payment processing system, resulting in:

  • Delayed transactions
  • Failed fund transfers
  • Customer complaints
Duration
  • Total disruption: 4 hours
  • Recovery time exceeded defined impact tolerance
Impact Assessment

Impact Area

Description

Customer Impact

Delayed and failed payments

Financial Impact

Compensation costs

Regulatory Impact

Breach of service expectations

Reputational Impact

Negative customer sentiment

 

Lessons Learned Process

Stage 1: Capture

Immediately after stabilisation:

  • Incident details were recorded
  • Stakeholders participated in a debrief session
  • Initial observations were documented

Key Observations

  • System performance degraded before failure
  • Delayed escalation to IT teams
  • Vendor response was slower than expected
Stage 2: Root Cause Analysis

A structured RCA was conducted using multiple techniques.

Findings

Level

Description

Immediate Cause

System overload during peak transaction period

Contributing Factors

Ineffective monitoring and delayed response

Root Cause

Lack of capacity planning and insufficient vendor resilience

Stage 3: Validation
  • Findings reviewed by:
    • IT teams
    • Operations
    • Risk management
  • Confirmed accuracy and completeness
Stage 4: Mapping to CBS

The incident was mapped to Sub-CBS components:

Sub-CBS Code

Sub-CBS

Issue

2.1

Payment Initiation

Transaction delays

2.2

Processing Engine

System overload

2.3

Settlement

Failed transactions

Stage 5: Impact Tolerance Assessment
  • Defined tolerance: 2 hours maximum downtime
  • Actual disruption: 4 hours

Conclusion:

Impact tolerance breached

 

Development of Improvement Actions

Based on lessons learned, the bank defined improvement actions.

Action Plan

Action ID

Description

Owner

Priority

Timeline

A1

Implement real-time monitoring system

IT

High

3 months

A2

Upgrade system capacity

IT

High

6 months

A3

Strengthen vendor SLAs

Procurement

High

2 months

A4

Improve escalation procedures

Operations

Medium

1 month

A5

Conduct stress testing

Risk/IT

High

Quarterly

 

Prioritisation of Actions

Actions were prioritised based on:

  • CBS criticality
  • Impact severity
  • Likelihood of recurrence

Priority Summary

  • High Priority: Capacity, monitoring, vendor management
  • Medium Priority: Process improvements

 

Implementation and Monitoring

Execution
  • Actions assigned to responsible teams
  • Progress tracked through a centralised system
Monitoring
  • Weekly progress reviews
  • Escalation of delays

 

Integration with Scenario Testing

Lessons learned were incorporated into future testing.

Updated Scenarios

  • Vendor failure scenario
  • Peak transaction load stress testing
  • Combined system and process failure

Outcome

  • Improved testing realism
  • Better preparedness

 

15.9 Results and Improvements

Measurable Outcomes

Metric

Before

After

Recovery Time

4 hours

1.5 hours

Incident Frequency

High

Reduced

Customer Complaints

High

Reduced

CBS Availability

Lower

Improved

Key Improvements
  • Enhanced system resilience
  • Improved monitoring capabilities
  • Stronger vendor management
  • Faster response times

 

Key Lessons from the Case Study

Importance of Service-Centric Approach
  • Focus on CBS enabled better prioritisation
Value of Root Cause Analysis
  • Identified systemic issues, not just symptoms
Need for Strong Governance
  • Ensured accountability and follow-through
Integration Across Functions
  • Collaboration between IT, operations, and risk was critical

 

Common Pitfalls Observed

  • Initial delay in escalation
  • Over-reliance on vendor capabilities
  • Lack of proactive monitoring

 

Best Practices Demonstrated

  • Structured lessons learned framework
  • Clear ownership and accountability
  • Integration with scenario testing
  • Continuous improvement approach

 

Extending Lessons Across the Organisation

The bank applied lessons learned beyond payments CBS to:

  • Other critical services
  • Vendor management framework
  • Risk management processes

[Banner] [Summing] [OR] [E2] [C13] Improving Lessons Learned

This case study demonstrates how a structured approach to lessons learned can:

  • Transform incidents into opportunities for improvement
  • Strengthen Critical Business Services
  • Enhance operational resilience

By following a disciplined methodology, organisations can ensure that:

  • Lessons are not lost
  • Improvements are implemented
  • Resilience capabilities are continuously enhanced

 

Transition to Next Chapter

Building on this practical case study, the next chapter will explore future trends in lessons learned, including the role of AI, predictive analytics, and evolving regulatory expectations in shaping the future of operational resilience.

 

New call-to-action

C1 C2 C3 C4 C5 C6
[OR] [P2] [S5] [LL] [C1] Introduction to Lessons Learned in OR [OR] [P2] [S5] [LL] [C2] The Role of Lessons Learned in the OR Lifecycle [OR] [P2] [S5] [LL] [C3] Governance and Ownership of Lessons Learned [OR] [P2] [S5] [LL] [C4] Sources and Triggers for Capturing Lessons Learned [OR] [P2] [S5] [LL] [C5] Lessons Learned Framework and Methodology [OR] [P2] [S5] [LL] [C6] Root Cause Analysis (RCA) Techniques
C7 C8 C9 C10 C11 C12 
[OR] [P2] [S5] [LL] [C7] Linking Lessons Learned to CBS [OR] [P2] [S5] [LL] [C8] Integration with Scenario Testing and Impact Tolerance [OR] [P2] [S5] [LL] [C9] Developing and Prioritising Improvement Actions [OR] [P2] [S5] [LL] [C10] Embedding Continuous Improvement [OR] [P2] [S5] [LL] [C11] Communication of Lessons Learned [OR] [P2] [S5] [LL] [C12] Technology and Tools for Lessons Learned Management
C13 C14 C15 C16 C17 C18
[OR] [P2] [S5] [LL] [C13] Regulatory Expectations and Compliance [OR] [P2] [S5] [LL] [C14] Common Challenges and Pitfalls [OR] [P2] [S5] [LL] [C15] Practical Case Study (Banking Sector Example) [OR] [P2] [S5] [LL] [C16] Future Trends in Lessons Learned [OR] [P2] [S5] [LL] [C17] Key Takeaways and Call to Action [OR] [P2] [S5] [LL] [C18] Back Cover

 

More Information About OR-5000 [OR-5] or OR-300 [OR-3]

To learn more about the course and schedule, click the buttons below for the OR-300 Operational Resilience Implementer course and the OR-5000 Operational Resilience Expert Implementer course.

BL-OR-3 Register Now BL-OR-3_Tell Me More BL-OR-3_View Schedule
BL-OR-5_Register Now BL-OR-5_Tell Me More  [BL-OR] [3-4-5] View Schedule
[BL-OR] [3] FAQ OR-300

If you have any questions, click to contact us.Email to Sales Team [BCM Institute]

FAQ BL-OR-5 OR-5000
OR Implementer Landing Page

New call-to-action

New call-to-action

 

Comments:

 

CTA Banner_OR

CTA Banner_ORA

CTA Banner_BCM

CTA Banner_ITDR

CTA Banner_CM