Nodal Data — How it works

Stage I — Implementation

Stand up AI analytics on top of the systems you already have

Today's leading AI agents already work for analytics. You don't need a specialized AI analyst product — you need the context and trust layer around the agent.

Context

GitHub Notion Snowflake Cortex Atlan

Agents

Claude Code Codex Gemini Claude

Warehouse

Snowflake BigQuery Redshift

BI

Sigma Tableau Hex Looker

analyst@nodal — answered

What's our revenue last quarter?

$4.2M

Q4 2025 · ↑ 12% vs Q3 2025

source: fct_revenue (dbt) · finance domain
fresh: 3h ago
trust: 94 / 100 · context match

Stage II — Safe rollout

Catch under-specified questions before they get wrong answers

Non-technical users don't ask fully-specified questions. Nodal makes the gaps visible before SQL runs — so widening access doesn't scale plausible-but-wrong answers. The move from demo to internal deployment runs through the business user; the system has to be usable in the flow of work, not just technically correct.

Question reframed with defaults from your documentation; assumptions in brackets the user can change.
Confidence score from auditable signals — entity resolution, schema grounding, doc coverage, context freshness.
You approve the interpretation, not the SQL.
Every under-specified question becomes a signal — and a candidate test case for the eval suite.

Claude

You

How does length of stay compare across facilities in the Northeast?

Claude

Your question, disambiguated

What is the [mean inpatient days] per facility for [all facility types] in [the Northeast region] over [trailing 12 months — assumed]?

Defaults pulled from your documentation. Change any [bracket] before I run it.

Where each default came from

"mean inpatient days" — dbt model fct_encounters: discharge_date - admission_date (Confluence confirmed)
"Northeast" — dim_facilities.region = 'Northeast' (found in 12 dashboards)
"all facility types" — no filter specified; you may want acute care only
"trailing 12 months" — assumed; not stated in your question

Confidence score 71/100

Core metric well-documented across sources
Join path validated in dbt lineage
Time window assumed
Facility scope may be broader than intended

Should I run this, or would you like to change any of the defaults?

You

Narrow to acute care only. Run it.

Stage III — Continuous reliability

Regression tests for AI analytics — every commit, every piece.

The same discipline software engineers apply to code, applied to AI analytics. Every dbt commit, doc edit, prompt change, or model swap triggers a re-run. Drift gets attributed to the specific change that caused it. Accuracy and cost-benefit get measured per piece of the system — not assumed.

Re-run on every change — schema migrations, dbt commits, doc edits, prompt changes, model swaps. Failures get pinned to the commit that caused them, with affected questions, SQL diffs, and result deltas.
Ablation tests on each context source — drop a data dictionary, a Notion page, a glossary entry; measure the answer-quality delta against the token-cost delta.
Model trade-off tests — swap Claude for a cheaper model, Codex for Gemini; read off pass rate vs. cost per run.
Cost optimization stops being a guess — every piece of the system is benchmarked against the trust it actually delivers.

Benchmark Run — April 8, 2026

Trigger: dbt model change (commit a3f8c2d)

92 questions evaluated

88 passed

4 drifted

0 failed

Affected

dim_patients → enrollment_status

Drifted questions

"Active Medicare patients by region" — result changed
"Enrollment trend by quarter" — confidence score dropped -12
"Payer mix for active patients" — SQL changed
"Patient count by enrollment status" — result changed

View full benchmark report View dbt diff

Documentation health report

67% of answered questions relied on dbt column descriptions

23% used Confluence documentation — but 40% of those pages hadn't been updated in over a year

15% lower consistency on questions grounded in stale docs

Stand up AI analytics on top of the systems you already have

Catch under-specified questions before they get wrong answers

Regression tests for AI analytics — every commit, every piece.

Self-service analytics is adding AI context to the tools you already have.

Data warehouse(s)

Data lineage

Code as context

Business-context layer

Get this running on your stack.