Get to know how to create a data story for Fraud Analytics to reduce fraudulent transactions.
Fraud is a big problem for banks and financial institutions. Every year, billions of dollars are lost to fraudsters who find ways to exploit weaknesses in the system.
Digital fraud is the banking sector's primary challenge, leading to immense losses every year. As per McAfee's reports, cyber fraud currently damages the economy by USD 600 billion of GDP on a global basis.
But, How does this number feel so relevant? Because tons of data flows in daily, and guess whose job it is to track and report it to stakeholders like the Chief Financial Officer?
It’s us Data Analysts!
Here, Data Analysis is not about just a Number; But a proven business solution which talks about the Charts, real time numbers and data driven insights.
In this python use case, we’ll talk about the Goals, Challenges and Objectives of the Chief Financial Officer to provide meaningful insights they crave for.
Let’s get started!
About Dataset
Let’s Inspect our Fraud Analytics dataset. This dataset got all the deeds on transaction – transaction IDs, completion dates, types, subtypes, payment service providers (PSP), cities, amounts.
This dataset encompasses a variety of dimensions including temporal, financial, geographical, technical, and behavioral aspects of transactions, making it a potent resource for identifying and understanding fraudulent activities in financial systems. Here are some sample Rows & Columns of the Fraud Analytics Dataset.
Transaction Table
txn_id
dt_txn_comp
txn_comp_time
txn_type
txn_subtype
initiating_channel_id
txn_status
error_code
payer_psp
payee_psp
payee_state
1
2021-06-22
8:40:00 PM
Fee
Transaction Fee
6
Successful
Axis Pay
Google Pay for Business
Uttar Pradesh
2
2023-06-02
8:36:00 AM
Payment
Peer-to-Peer (P2P)
18
Successful
PhonePe
PhonePe for Merchants
Karnataka
3
2021-12-12
10:26:00 PM
Reversal
Transaction Error Correction
16
On Hold
U66
Amazon Pay
HDFC Merchant Services
Uttarakhand
4
2023-05-05
8:34:00 AM
Withdrawal
ATM Withdrawal
16
Successful
Apple Pay (for international transactions)
ICICI Merchant Services
Goa
Payment Details Table
payer_os_type
payee_os_type
beneficiary_mcc_code
remitter_mcc_code
custref_transaction_ref
cred_type
cred_subtype
payer_app_id
payee_app_id
Others
Others
5945
7995
Reference 1
Credit Card
Balance Transfer Credit Card
BHIM IOB UPI
JustDial
iOS
Android
5099
7395
Reference 2
Credit Card
Balance Credit Card
MI Pay
Digibank (DBS)
Others
MacOS
5996
5598
Reference 3
Home Loan
Fixed-Rate Mortgage
BHIM Axis Pay UPI App
BHIM Baroda PAY
Windows
MacOS
7692
5963
Reference 4
Home Loan
Adjustable-Rate Mortgage (ARM)
BHIM Cent UPI(Central Bank of India)
WhatsApp Pay
Banking Transaction Details
beneficiary_bank
payer_handle
payer_app
payee_handle
payee_app
payee_requested_amount
payee_settlement_amount
payer_location
payer_city
Suryoday Small Finance Bank
ANDB
Bhim Andhra Bank One- UPI App
EQUITAS
Equitas UPI
19934
19934
700124
Barasat
Bank of India
POCKETS
ICICI Pockets
YESPAY
JusPay Technologies
62332
62332
332001
Sikar
Karur Vysya Bank
DNSBANK
DNS Pay (Dombivli Nagrik Sahakari Bank Ltd)
FINOBANK
Fino Bpay(Fino Payments Bank)
56336
56336
332001
Sikar
Cosmos Co-operative Bank
YBL
PhonePe
UCO
BHIM UCO UPI
89722
89722
825301
Hazaribagh
Here's a brief overview of the first few columns to set the stage for our analysis:
Transaction Information
Transaction Information
Field Name
Data Type
Description
txn_id
String
Unique identifier for each transaction.
dt_txn_comp
Date
Date when the transaction was completed.
txn_comp_time
Time
Time when the transaction was completed.
txn_type
String
Type of transaction (withdrawal, deposit, etc.).
txn_subtype
String
Further classification of the transaction type.
initiating_channel_id
String
ID of the channel used to initiate the transaction.
txn_status
String
Status of the transaction (completed, pending, failed).
error_code
String
Error codes encountered during the transaction.
payer_psp
String
Payment Service Provider for the payer.
payee_psp
String
Payment Service Provider for the payee.
remitter_bank
String
Bank of the sender.
beneficiary_bank
String
Bank of the recipient.
payer_handle
String
Identifier for the payer, potentially anonymized.
payer_app
String
Application through which the payer initiated the transaction.
payee_handle
String
Identifier for the payee, potentially anonymized.
payee_app
String
Application used by the payee.
Check Fraud Analytics Dataset to view all Columns 👉 Click Here
The Fraud Analytics Dataset contains over 50,000 Rows of transactions and 34 Distinct Columns. 30 of our Columns have String Data type, 2 of our Columns have Numeric Data Type and 2 of the columns have Date & Time Data Type.
How to Improve Real-time Fraud Detection & Prevention System?
To start with the analysis of fraud detection & prevention system, it is necessary to follow the 4 factors of Data Analysis and that are :
1. Identify the users or stakeholders for the dashboard.
2. Design Empathy Map to define Users' Goals and Challenges or pain points.
3. Identify Metrics or KPIs Matter the Most.
4. Understand the Objectives and Goals.
5. Ask Business Questions
In this article, we will explore how to improve such a system through a user-focused approach, taking the Chief Finance Officer (CFO) as a primary stakeholder.
Step 1: Identify the users or stakeholders for the analysis.
Define the User or stakeholder who will use the supply chain data story. In our case, We’ll use a Chief Finance Officer (CFO). Recognizing their diverse needs, challenges, and priorities becomes the cornerstone for tailoring an effective data story.
User Persona : Chief Finance Officer
Responsibilities:
Ensure the overall financial integrity of the organization & Safeguard financial assets against fraud activities.
Make informed and strategic decisions based on accurate financial information.
Evaluate risks and opportunities to support the organization's financial goals.
Implement policies and procedures to mitigate legal and financial risks.
Optimize resource allocation to maximize financial efficiency.
Allocate budgets strategically to support business objectives.
Needs:
Access to real-time and accurate financial data for effective decision-making.
Intuitive and user-friendly dashboards and tools for efficient monitoring and analysis.
Advanced fraud detection systems that provide proactive alerts and mitigation strategies.
Insights into emerging financial trends, risks, and opportunities to support strategic planning.
Challenges:
Constantly evolving fraud tactics and increasing sophistication of cyber threats.
Minimizing false positives in fraud detection to avoid unnecessary disruption to legitimate transactions.
Managing financial security within budgetary constraints and ensuring cost-effectiveness.
Since, we have defined the User Persona & have mapped the needs, challenges and responsibilities. Our Next step will be to design an Empathy Map which will map the pain points of the user.
Step 2: Design Empathy Map
Understanding the CFO's perspective is crucial for designing an effective fraud detection system. An empathy map helps us delve into their thoughts and feelings, allowing us to tailor the system to meet their specific needs.
Step 3: Identify the Key Performance Indicators (KPI’s)
Numbers tell a story. To gauge the effectiveness of a system, focus on metrics and Key Performance Indicators (KPIs) that truly matter. For fraud detection and prevention, consider:
KPI/Measure
Drill Down Dimensions
Purpose
Fraud Loss Value
Transaction type, Geographical location, Time period
Measure monetary impact, quantify financial losses due to fraud.
Number of Fraud Incidents
Transaction Type, Remitter Bank, Geographical location
Quantify frequency of fraudulent incidents, assess scale of the problem.
Fraud Transaction Rate
Type of transactions
Calculate percentage of fraudulent transactions, identify trends.
Analyze geographical spread of fraud, target high-risk areas.
User Behavior
Payer OS Type, Time of day, Payer payment service provider (PSP)
Monitor patterns in user behavior, detect anomalies for improved security.
Step 4: Understand the Goals & Objectives of User
Now, on the basis of Empathy Map and KPI’s, we need to define our goals and objectives of the Users. So, that it will align with the data story functionalities to ensure decision making. Here are the Key Objectives & Goals :
Objective :
To enhance the security of financial transactions by employing advanced data analysis techniques. This includes real-time monitoring, anomaly detection, and proactive measures to identify, investigate, and mitigate potential instances of credit card fraud, thereby safeguarding the financial system's integrity.
Goals :
Tooptimize resource utilization for enhanced operational efficiency and profitability, support strategic initiatives by planning finance and investment decisions, minimize financial, fraud, and operational risks to safeguard assets and reputation.
Step 5 : Ask Business Questions
Beyond KPI’s, organizations must engage in business-driven inquiry. This involves asking strategic questions that directly align with overarching business objectives.
Ask the following questions:
1. How have fraud incidents fluctuated over the years?
Python Code :
Visualization:
Metrics : Time Period, Volume Metric
Actionable Insight :
The graph shows a clear pattern of fraud incidents with constant spikes in Julyand August. This suggests the need for increased vigilance and preventive measures during these months.
The sharp decline after June may indicate effective countermeasures, which could be replicated to mitigate future spikes.
2. Which types of Credit fraud are causing the most financial damage?
Python :
Visualization:
Metrics : Credit Type, Total Amount of Fraudulent Transactions
Actionable Insight :
Credit card and Personal Loan fraud represent the highest total amounts, indicating these areas are particularly high-risk.
Given the high amount of fraud in credit cards, there could be an opportunity to educate customers on safe credit card practices.
3. Are there certain regions more prone to fraud incidents than others?
Python :
Visualization :
Metrics :Total Fraudulent Transaction Amount, Payer State
Actionable Insight :
States with higher fraudulent transaction amounts like Punjab, Bihar, and West Bengal may benefit from state-specific fraud prevention initiatives.
Punjab, showing the highest total fraudulent transaction amount, should be subjected to more rigorous monitoring and investigative activities.
4. What is the financial impact of fraud across different regions?
Python Code :
Visualization :
Metrics : Total Fraudulent Transaction Amount, Payer State
Actionable Insight :
Haryana, Tamil Nadu, and Odisha are the top states for fraudulent transactions, indicating the need for targeted anti-fraud initiatives in these regions.
5. How does user behavior vary by time of day in terms of number of fraudulent transactions and fraudulent transaction amount?
Python Code :
Visualization :
Metrics : Time of Day, Percentage of Total Fraudulent Transaction Amount
Actionable Insight :
The largest share of fraudulent transactions occurs at night (31.0%), suggesting that fraudsters may prefer times when oversight may be lower and victims less vigilant.
Conclusion
Enhancing our real-time fraud detection and prevention system transcends the realm of technical challenges—it's a collaborative effort.
We need to understand what the CFO worries about and ask the right questions. This way, we can create a system that not only meets but beats the CFO's expectations. That's how we keep our organization's finances safe in today's digital world.
Ready to get started?
Join Data Analysts who use Super AI to build world‑class real‑time data experiences.