Benchmarks

Realistic benchmarks for financial AI.

We evaluate AI models on the tasks that matter most to financial institutions—using real data, realistic scenarios, and the metrics that are most relevant for the domain.

FinSpread-Bench

Updated yesterday

The first public benchmark for agentic financial spreading. Evaluates how well AI systems extract, calculate, and reason across financial documents—like bank statements, tax returns, payslips, and financial spreads—in real-world decision scenarios.

Task types

  • Extraction
  • Cross-document reasoning
  • Calculation
  • Structured output

Data source

Anonymized data from Taktile co-development partners

Evaluation method

Automated metrics and expert human evaluation

Last updated

2026-03-04

View benchmark