A Specialized and Secure AI Orchestrator for Swiss Financial Compliance
View the Project on GitHub Digital-AI-Finance/wecan-innosuisse-ai-draft
Home > Overview
Develop an on-premise AI system that automates document processing for Swiss financial institutions while ensuring data sovereignty. The solution combines advanced OCR with large language models to transform manual document handling into automated workflows.
Swiss financial institutions process thousands of compliance documents annually:
| Document Type | Typical Length | Current Processing |
|---|---|---|
| KYC Forms | 20-50 pages | 3-5 days manual |
| Regulatory Filings | 50-100 pages | 1-2 weeks manual |
| Compliance Questionnaires | 30-80 pages | 1 week manual |
| Annual Reports | 100+ pages | 2-3 weeks manual |
+------------------+ +------------------+ +------------------+
| DOCUMENT | | DYNAMIC | | INTELLIGENT |
| DIGITIZATION | --> | FIELD MAPPING | --> | PRE-FILLING |
+------------------+ +------------------+ +------------------+
| - OCR extraction | | - Zero-shot | | - PDF population |
| - LLM processing | | schema mapping | | - Excel filling |
| - 50-100 pages | | - CRM integration| | - Variable forms |
| - Scanned docs | | - F1 > 85% | | - Checkboxes |
+------------------+ +------------------+ +------------------+
| Feature | Description |
|---|---|
| OCR Engine | PyMuPDF + Tesseract + PaddleOCR ensemble |
| LLM Processing | 7-13B parameter domain-adapted models |
| Document Types | PDF, scanned images, mixed layouts |
| Languages | German, French, Italian, English |
| Page Handling | Hierarchical attention for 50-100 pages |
| Feature | Description |
|---|---|
| Schema Discovery | Automatic detection of target fields |
| Zero-Shot Mapping | No manual template configuration |
| CRM Integration | Direct API connection to client systems |
| Validation | Confidence scoring per field |
| Learning | Improves from corrections |
| Feature | Description |
|---|---|
| Output Formats | PDF, Excel, Word |
| Field Types | Text, checkboxes, dropdowns, tables |
| Variable Layouts | Adapts to document structure |
| Review Interface | Human-in-the-loop verification |
| Export | Direct to CRM or file download |
| Component | Technology | Purpose |
|---|---|---|
| Base Models | Llama 3, Mistral, Qwen | Foundation models (7-13B) |
| Fine-tuning | LoRA, QLoRA adapters | Domain adaptation |
| OCR | Tesseract, PaddleOCR, Docling | Text extraction |
| Embeddings | Sentence-BERT, E5 | Semantic matching |
| Orchestration | LangChain, LlamaIndex | Pipeline management |
| Component | Technology | Purpose |
|---|---|---|
| Compute | NVIDIA A100/RTX 4090 | On-premise inference |
| Storage | Local NVMe + encrypted backup | Document storage |
| API | FastAPI, REST | Integration layer |
| Monitoring | Prometheus, Grafana | Performance tracking |
| Security | TLS 1.3, AES-256 | Data protection |
| Metric | Current | Target | Improvement |
|---|---|---|---|
| Processing time | 2-3 weeks | 1-2 hours | 99% reduction |
| Error rate | 20-30% | <10% | 66% reduction |
| Staff efficiency | 1 doc/week | 10+ docs/day | 50x increase |
| Cost per document | CHF 500-1000 | CHF 50-100 | 90% reduction |
| Year | Clients | Revenue | Notes |
|---|---|---|---|
| Year 1 | 10-15 | CHF 0.5M | Design partners |
| Year 2 | 15-25 | CHF 0.8M | Beta customers |
| Year 3 | 40-50 | CHF 1.5M | Market entry |
| Year 4 | 50-60 | CHF 2M | Scale-up |
| Year 5 | 80-90 | CHF 3M | Market leadership |
| Risk | Likelihood | Impact | Mitigation |
|---|---|---|---|
| Model accuracy insufficient | Medium | High | Ensemble methods, human review |
| Document quality issues | Medium | Medium | Quality filtering, preprocessing |
| Integration complexity | Low | Medium | API-first design, staging |
| Regulatory changes | Low | High | Modular architecture, rapid adaptation |
| Resource constraints | Low | Medium | Efficient fine-tuning (LoRA) |
The project will be considered successful if:
| Back to Home | Next: Objectives |