Listen to today's AI briefing

Daily podcast — 5 min, AI-narrated summary of top stories

PRAGMA: Revolut's Foundation Model for Banking Event Sequences
AI ResearchScore: 72

PRAGMA: Revolut's Foundation Model for Banking Event Sequences

A new research paper introduces PRAGMA, a family of foundation models designed specifically for multi-source banking event sequences. The model uses masked modeling on a large corpus of financial records to create general-purpose embeddings that achieve strong performance on downstream tasks like fraud detection with minimal fine-tuning.

GAla Smith & AI Research Desk·14h ago·6 min read·3 views·AI-Generated
Share:
Source: arxiv.orgvia arxiv_irSingle Source

What Happened

On April 9, 2026, researchers associated with digital banking platform Revolut published a preprint paper on arXiv titled "PRAGMA: Revolut Foundation Model." The paper presents a specialized foundation model architecture designed specifically for processing the complex, heterogeneous sequences of events generated by modern financial systems.

PRAGMA (which appears to be an acronym though not explicitly defined in the abstract) represents a significant departure from generic large language models. Instead of training on web-scale text corpora, the model is pre-trained on what the authors describe as "a large-scale, heterogeneous banking event corpus" using a self-supervised objective tailored to financial data's discrete, variable-length nature.

The core innovation lies in treating banking events—transactions, logins, account changes, and other financial activities—as sequences that can be modeled using transformer architectures with masked modeling techniques. This approach allows the model to learn rich representations of financial behavior patterns without explicit labeling for specific tasks.

Technical Details

The PRAGMA architecture builds on transformer-based foundations but adapts them specifically for financial event sequences. The key technical contributions appear to be:

  1. Domain-Specific Pre-training: Unlike general-purpose LLMs, PRAGMA is trained exclusively on banking event data, allowing it to develop specialized understanding of financial patterns, temporal dependencies, and economic signals encoded in transaction sequences.

  2. Masked Modeling for Financial Events: The researchers developed a self-supervised objective specifically designed for the discrete, variable-length nature of financial records. This likely involves masking certain events in sequences and training the model to predict them based on context—similar to how BERT masks tokens in text but adapted for financial event types, amounts, timestamps, and metadata.

  3. Multi-Source Integration: The model handles "heterogeneous" banking events, suggesting it can process data from multiple sources within a financial ecosystem—potentially including transaction data, customer service interactions, app usage patterns, and external economic indicators.

  4. Efficient Downstream Adaptation: The paper claims that "strong performance can be achieved by training a simple linear model on top of the extracted embeddings" and can be further improved with "lightweight fine-tuning." This suggests the model produces high-quality, task-agnostic representations that transfer well to specific applications.

The researchers conducted extensive evaluations on downstream tasks including credit scoring, fraud detection, and lifetime value prediction, demonstrating superior performance compared to existing approaches.

Retail & Luxury Implications

While PRAGMA is explicitly designed for banking applications, the underlying methodology has significant implications for luxury and retail companies that operate their own financial services or handle complex customer transaction data.

Figure 3:Tokenisation overview.A raw event record is decomposed into a temporal coordinate, semantic types (keys), an

Direct Applications for Retail Financial Services:
Luxury groups with captive financial arms—such as credit cards, installment payment plans, or branded banking services—could leverage similar foundation models to:

  • Enhanced Fraud Detection: Model customer transaction patterns across both retail purchases and financial activities to identify sophisticated fraud schemes that might bypass traditional rule-based systems.
  • Personalized Credit Scoring: Develop more nuanced creditworthiness assessments that consider not just traditional financial metrics but also luxury purchase patterns, brand loyalty, and customer lifetime value.
  • Customer Value Prediction: Better predict which customers will generate the highest lifetime value across both retail and financial dimensions.

Broader Pattern Recognition Applications:
The sequence modeling approach could be adapted for:

  • Customer Journey Modeling: Treating customer interactions (website visits, store visits, purchases, returns, customer service contacts) as event sequences to predict future behavior and optimize engagement strategies.
  • Supply Chain Event Prediction: Modeling sequences of supply chain events (production milestones, shipping updates, customs clearances) to predict delays and optimize inventory management.
  • Multi-Channel Behavior Analysis: Integrating events from physical stores, e-commerce platforms, mobile apps, and customer service into unified sequence models for holistic customer understanding.

Technical Transfer Potential:
The paper demonstrates that domain-specific foundation models pre-trained on proprietary event data can outperform general-purpose models. This validates an approach luxury retailers could adopt: building specialized foundation models on their unique customer interaction data rather than relying solely on generic AI solutions.

Implementation Considerations

For retail and luxury companies considering similar approaches:

Figure 2:Event timeline overview.After account creation, users generate a sequence of platform interactions over time

Data Requirements: Building such models requires large-scale, high-quality event sequence data. Companies would need robust data infrastructure to collect, clean, and sequence customer interactions across all touchpoints.

Privacy and Compliance: Financial event data is highly sensitive. Any implementation would require stringent privacy protections, potentially including federated learning approaches or differential privacy techniques.

Technical Expertise: Developing and maintaining foundation models requires significant ML engineering resources. Most luxury brands would likely partner with specialized AI firms rather than building such capabilities in-house.

Integration Complexity: Deploying these models into production systems—particularly legacy retail and financial systems—presents significant integration challenges.

Business Impact Assessment

The PRAGMA research suggests several potential business impacts for retail and luxury:

Figure 1:A single architecture from 10M to 1B parameters that outperforms task-specific models across tasks.

  1. Improved Risk Management: Better fraud detection could reduce losses from fraudulent transactions while minimizing false positives that inconvenience legitimate customers.

  2. Enhanced Customer Insights: Sequence models could reveal previously unrecognized patterns in customer behavior, enabling more personalized marketing and service offerings.

  3. Operational Efficiency: Automated analysis of event sequences could reduce manual review processes in fraud detection, credit assessment, and customer service.

However, it's important to note that this is a research paper, not a production system. The actual business impact would depend on successful implementation, which involves significant technical and organizational challenges beyond what's demonstrated in the academic research.

Governance & Risk Assessment

Data Privacy Risks: Modeling financial event sequences raises significant privacy concerns. Companies would need to ensure compliance with GDPR, CCPA, and financial regulations while maintaining customer trust.

Bias and Fairness: Foundation models trained on historical data can perpetuate existing biases in credit scoring, fraud detection, and customer valuation. Rigorous bias testing and mitigation would be essential.

Model Explainability: Black-box foundation models may struggle to meet regulatory requirements for explainability in financial decisions (like credit denials). Companies would need to invest in explainability techniques or hybrid approaches.

Dependency Risk: Building critical business functions on complex foundation models creates new dependencies and potential single points of failure.

Maturity Level: This is early-stage research. While promising, the approach hasn't been proven at scale in production retail environments. Companies should consider pilot projects before large-scale deployment.

Following this story?

Get a weekly digest with AI predictions, trends, and analysis — free.

AI Analysis

The PRAGMA paper represents an important trend toward domain-specific foundation models—a shift we've been tracking across multiple sectors. This follows a broader pattern of specialized AI models outperforming general-purpose solutions for specific business domains. The approach aligns with what we've seen in retail-specific AI research, such as the CoDiS framework for sequential recommendations that we covered on April 10, which also focuses on modeling customer interaction sequences. For luxury retail, the most immediate relevance isn't in adopting PRAGMA itself—which is explicitly designed for banking—but in recognizing the validity of the underlying methodology. Luxury companies with proprietary customer data (purchase histories, service interactions, loyalty program activities) could develop similar sequence models tailored to their specific needs. This is particularly relevant given the trend toward integrated retail-financial services among luxury groups. The paper also reinforces the value of self-supervised pre-training on proprietary data, which we've seen in other retail AI contexts. Rather than fine-tuning generic LLMs, companies with sufficient data can build specialized foundation models that better capture their unique business patterns. However, this requires significant technical investment—an approach more feasible for large luxury conglomerates than smaller brands. Interestingly, this research emerges amid ongoing discussions about the limitations of retrieval-augmented generation (RAG) systems, which we've covered extensively. While PRAGMA isn't a RAG system, it represents an alternative approach to incorporating domain knowledge: instead of retrieving relevant information at inference time, the domain knowledge is baked into the model during pre-training. This trade-off between flexibility (RAG) and performance (specialized foundation models) is becoming a key strategic decision for AI implementation in retail.

Mentioned in this article

Enjoyed this article?
Share:

Related Articles

More in AI Research

View all