Snowflake · BigQuery · DatabricksRead-only · Audit-grade

Ventra

Find the queries burning your warehouse bill.

Ventra reads your query history, ranks the top money-pits, and ships a pull-request rewrite with a dollar estimate — proven byte-identical before anyone merges.

1,284 queries diagnosed this hourAvg. customer recovers 38% of spend in 14 days
Vsavings preview·acme-data/snowflake
aisles 04scanning
L3L2L1
11
12
13
14
A·01
21
22
23
24
A·02
31
32
33
34
A·03
41
42
43
44
A·04
hover a crate to inspect
$86,200$142,400
↓ $56,200saved · attested on-chain
142K80K
D1D30
NotionRampVercelLinearAnthropicDatadogRetoolBrex

Teams waste an average of 38% of their warehouse spend.

It rarely shows up as one bad query. It shows up as a thousand small patterns — unpruned scans, repeated joins, oversized compute — quietly compounding into a six-figure leak.

$0.00

Based on the average mid-market warehouse bill of $1.4M/yr, 38% waste.

Full table scans

Dashboards re-scan terabytes when only the last 7 days are needed. Predicate pushdown wins back 80–96% of bytes scanned.

Redundant recompute

The same aggregate runs 31× per hour for a chart that refreshes hourly. Materialized views collapse the duplicate work.

Oversized warehouses

XL warehouses sit idle 71% of the time. Right-sizing and aggressive auto-suspend cut the line item without touching the SQL.

Stale clustering keys

When clustering keys drift from filter patterns, micro-partition pruning collapses. Re-clustering restores 90%+ pruning ratios.

A five-step path from query history to a merged savings PR.

01

Ingest

Read-only pull of QUERY_HISTORY, billing & credit usage. Connects in 8 minutes.

02

Attribute

Every query gets a dollar cost. Patterns are clustered and ranked by spend.

03

Diagnose

Rules engine + LLM proposes a concrete fix: rewrite, materialize, cluster, right-size.

04

Ship PR

Pull request against your dbt or SQL repo with a verified dollar estimate attached.

05

Prove

Shadow-runs the rewrite against a sample. Byte-identical results before merge.

Top 20 queries. One PR each.

Hover a row to expand the before/after diff. Every fix carries a dollar estimate validated against the last 30 days of actual runs.

Rank
Fingerprint
Pattern
Runs/day
$/mo
Est. savings
#1
q_8dea9c
Missing Cluster Key
261
$26.7K
$12,793
before → after · sql.diff
SELECT *
FROM PROD.MART.FCT_SESSIONS
WHERE account_id = '8f02a1'
AND event_date BETWEEN '2026-05-01' AND '2026-05-31';
-- Bytes scanned: 1.8 TB Partitions: 41,802 / 41,802
+ALTER TABLE PROD.MART.FCT_SESSIONS
+ CLUSTER BY (account_id, event_date);
+
+SELECT *
+FROM PROD.MART.FCT_SESSIONS
+WHERE account_id = '8f02a1'
+ AND event_date BETWEEN '2026-05-01' AND '2026-05-31';
+
+-- Bytes scanned: 24 GB Partitions: 318 / 41,802
Diagnosis

Predicates on (account_id, event_date) prune 0% of micro-partitions on PROD.MART.FCT_SESSIONS. Clustering by (account_id, event_date) reduces partitions scanned from 41,802 → 318 on the p95 query.

Scan / run
798 MB
Compute / run
302s
Est. savings
$12,793 / mo
Confidence
Medium
Validated against last 30 days of runs
#2
q_b0f940
Redundant Recompute
292
$25.8K
$16,351
#3
q_9595b4
Redundant Recompute
64
$25.7K
$14,895
#4
q_1a3744
Redundant Recompute
412
$24.9K
$13,115
#5
q_2f3789
SELECT *
311
$23.9K
$8,899
#6
q_35edf7
Redundant Recompute
173
$22.9K
$8,476
#7
q_0ae49e
Cartesian Join
307
$22.2K
$16,970
#8
q_d7ea42
Full Scan
239
$21.8K
$11,242
#9
q_4cc2a1
Oversized Warehouse
88
$20.4K
$9,810
#10
q_61b03d
Stale Cluster Key
145
$19.7K
$7,612

Nothing merges without byte-identical proof.

Every suggestion gets a shadow-run that executes the rewrite on a sample of production data and compares results to the original, bit for bit. Only when they match does the PR open.

  • Read-only credentials. No DDL, no writes.
  • Sampled execution, never re-runs the full workload.
  • Suggestion-only. Your engineers approve the PR.
  • Cryptographic attestation per fix, auditable forever.
  • SOC 2 Type II. SSO, SCIM, and audit log export.
Shadow run
q_a3f8e1
passed
before
1
2
3
4
5
6
7
after
1
2
3
4
5
6
7
10,000 sampled rows compared ✓ byte-identical

Every saving, attested.

When a fix passes its shadow-run, we cryptographically attest the measured delta on-chain. Finance gets a tamper-proof trail. Billing settles trustlessly against verified savings, never against estimates.

Savings ledger · last 5 attestationsverified
0xfe6ee1…240c1bblock 21,448,044q_8dea9c$12,793
0xd1ebd3…a3433eblock 21,448,216q_b0f940$16,351
0x3c148a…023ff5block 21,448,301q_9595b4$14,895
0x765871…a94735block 21,448,448q_1a3744$13,115
0x9ab2c4…7710eablock 21,448,612q_2f3789$8,899

Plugs into the stack you already run.

  • Snowflake
  • BigQuery
  • Databricks
  • Redshift
  • Trino
  • dbt Core
  • dbt Cloud
  • SQLMesh
  • Coalesce
  • GitHub
  • GitLab
  • Bitbucket
  • Azure DevOps
  • Datadog
  • Grafana
  • Slack
  • PagerDuty
  • Linear

What teams say after the first scan.

Ventra found six figures of waste in our first week. Two PRs, both merged, both byte-identical.
Head of Data, Series C fintech
It reads like a senior data engineer left a sticky note on every bad query — with the dollar number attached.
Analytics Engineering Lead
Finance loves the on-chain ledger. We finally stopped arguing about whether the savings were real.
VP Finance, mid-market SaaS

Questions teams ask before the first scan.

Don't see yours? Talk to an engineer.

Will Ventra ever modify our warehouse?+

No. Credentials are read-only and Ventra never executes DDL or writes. Every suggestion ships as a pull request your engineers review and merge.

How accurate are the savings estimates?+

Every estimate is shadow-run against the last 30 days of actual production traffic on a sample. The dollar number on the PR is the measured delta, not a model guess.

What data leaves our environment?+

Query text, plan, and billing metadata. Never row data. SOC 2 Type II controls, VPC peering and on-prem deployment available on Enterprise.

How long does a connection take?+

About 8 minutes. You create a read-only role, paste credentials, and the first report is ready in under an hour.

What is the on-chain attestation actually for?+

Finance teams need a tamper-proof trail of which fixes saved how much. The attestation is the cryptographic receipt that the Ventra fee is calculated against.

Do you support BigQuery and Databricks?+

Yes — Snowflake, BigQuery, Databricks and Redshift are first-class. Trino is in beta.

Find the queries burning your bill.

14-day audit. Read-only. No commitment. You only pay a slice of savings we prove.