Blockchain Fraud Detection
Fraud Detection in Blockchain Networks
Research Lead: Prof. Yuanyuan Zhang & Prof. Stephen Chan Primary Focus: Graph-based methods for detecting fraudulent activities in blockchain transactions
Research Overview
Blockchain networks generate massive transaction graphs where fraudulent actors leave detectable patterns. Our research develops advanced graph neural network methods to identify fraud in cryptocurrency ecosystems.
Key Research Areas
Ethereum Transaction Analysis
Detecting fraudulent smart contracts and Ponzi schemes in the Ethereum network.
Multi-Layer Graph Learning
Combining different graph views (transaction, contract, address) for improved detection.
Topology-Aware Contrastive Learning
Novel self-supervised methods that leverage blockchain graph structure.
The Blockchain Fraud Problem
Types of Blockchain Fraud
| Type | Description | Detection Challenge |
|---|---|---|
| Ponzi Schemes | Pyramid-style smart contracts | Complex fund flows |
| Phishing | Fake addresses/contracts | Address similarity |
| Exit Scams | Rug pulls on DeFi | Timing prediction |
| Wash Trading | Artificial volume | Transaction pattern |
| Money Laundering | Fund obfuscation | Multi-hop analysis |
Scale of the Problem
- Over $14 billion lost to crypto fraud in 2022
- Ethereum processes 1M+ transactions daily
- Manual review impossible at blockchain scale
- Fraudsters constantly evolve tactics
Methodology: Multilayer Topology-Aware GCL
Overview
Our published research introduces Multilayer Topology-Aware Graph Contrastive Learning (MT-GCL) for fraud detection.
Architecture
Input: Ethereum Transaction Graph
|
[Layer 1: Transaction Subgraph]
[Layer 2: Contract Call Graph]
[Layer 3: Token Transfer Graph]
|
[Topology-Aware Augmentation]
|
[Contrastive Learning Encoder]
|
[Multi-Layer Fusion]
|
Output: Fraud Probability per Address
Key Innovations
- Multi-Layer Representation: Captures different relationship types simultaneously
- Topology-Aware Augmentation: Preserves important graph structures during learning
- Self-Supervised Pretraining: Reduces need for labeled fraud examples
- Scalable Architecture: Handles millions of addresses
Experimental Results
Dataset: Ethereum Transaction Network
| Statistic | Value |
|---|---|
| Addresses | 2.9M |
| Transactions | 13.5M |
| Smart Contracts | 58K |
| Labeled Frauds | 4,328 |
| Time Period | 2015-2021 |
Performance Comparison
| Method | AUC-ROC | Precision | Recall | F1 |
|---|---|---|---|---|
| Random Forest | 0.721 | 0.189 | 0.312 | 0.235 |
| XGBoost | 0.756 | 0.214 | 0.356 | 0.267 |
| GCN | 0.812 | 0.287 | 0.423 | 0.342 |
| GAT | 0.834 | 0.312 | 0.456 | 0.371 |
| Graph-SAGE | 0.841 | 0.324 | 0.467 | 0.383 |
| MT-GCL (Ours) | 0.891 | 0.398 | 0.534 | 0.456 |
Ablation Study
| Variant | AUC-ROC | Change |
|---|---|---|
| MT-GCL (Full) | 0.891 | - |
| Single Layer | 0.856 | -3.9% |
| No Contrastive | 0.847 | -4.9% |
| Random Augmentation | 0.863 | -3.1% |
| No Pretraining | 0.858 | -3.7% |
Detection Examples
Case Study: Ponzi Scheme Detection
Identified Pattern
Star topology with central contract receiving funds from many addresses and distributing to few early investors.
Case Study: Phishing Attack Detection
Identified Pattern
Address clusters with similar naming patterns targeting legitimate DeFi protocols.
Publications
Multilayer Topology-Aware Graph Contrastive Learning for Fraud Detection in the Ethereum Transaction Network
Journal of the Royal Statistical Society Series A - In Press
Practical Applications
For Exchanges
| Application | Benefit |
|---|---|
| Deposit Screening | Block fraudulent funds |
| Account Monitoring | Detect suspicious behavior |
| Compliance | Meet regulatory requirements |
For Regulators
| Application | Benefit |
|---|---|
| Market Surveillance | Identify market manipulation |
| Investigation Support | Trace illicit funds |
| Risk Assessment | Evaluate platform risks |
For DeFi Protocols
| Application | Benefit |
|---|---|
| Contract Auditing | Identify risky contracts |
| User Protection | Warn about suspicious addresses |
| Insurance | Risk-based pricing |
Code & Resources
| Resource | Status |
|---|---|
| Paper Code | Available on GitHub |
| Trained Models | Available on request |
| Dataset | Ethereum public data |
| Documentation | Paper supplementary |
Future Directions
- Real-Time Detection: Streaming graph analysis for instant fraud alerts
- Cross-Chain Analysis: Detecting fraud across multiple blockchains
- Privacy-Preserving: Detection without revealing sensitive patterns
- Explainable AI: Human-interpretable fraud explanations
Related Research Themes
(c) Joerg Osterrieder 2025