Quality Agent

Overview

The Quality Agent runs automated data quality checks across your data estate using Great Expectations. It profiles your data, generates intelligent quality expectations based on column types and distributions, and executes validations — surfacing issues proactively so problems are caught before they reach your reports and dashboards.

What You Can Ask

“Check the quality of our customer table”
“Are there any null values in the orders dataset?”
“Run a freshness check on all our staging tables”
“What’s the data quality score for our sales data?”
“Find duplicate records in the user profiles table”
“Validate the web analytics dataset”

How It Works

Analyze dataset structure — The agent queries your warehouse for a data sample (up to 5,000 rows) and creates a comprehensive profile — column types, null percentages, unique counts, value distributions, and statistical summaries.
Generate quality expectations — Based on the data profile, an LLM generates a tailored set of quality checks — typically 10–25 high-confidence expectations covering completeness, accuracy, consistency, and more.
You select which checks to run — The proposed expectations are presented for your review. You choose which ones to execute — keeping you in control of how quality is measured.
Run quality validation — Selected expectations are executed against your data using Great Expectations. Results include per-column pass/fail status, detailed statistics, and an overall quality score.

Quality Dimensions

Completeness

Are required fields populated? What percentage of values are missing or null? Checks null rates against configurable thresholds.

Accuracy

Do values match expected patterns, ranges, and business rules? Validates formats (emails, phones), statistical bounds, and value domains.

Consistency

Is the data coherent across columns and tables? Checks referential integrity, cross-column relationships, and value set membership.

Freshness

How recently was the data updated? Is it within expected SLA windows? Monitors timestamps and update frequency.

Supported Quality Checks

The Quality Agent supports over 50 types of data quality expectations, organized by category:

Category	Example Checks
Completeness	Null value detection, non-null proportion thresholds
Uniqueness	Duplicate detection, compound column uniqueness
Value Ranges	Min/max bounds, mean/median ranges, z-score outlier detection
Pattern Matching	Regex validation for emails, phones, and custom formats
Set Membership	Value-in-set checks, distinct value set validation
Schema	Column count, column ordering, column type verification
Cross-Column	Pair comparisons (A > B), multi-column sum validation
Distribution	Statistical bounds on mean, median, standard deviation

Each expectation includes a severity level (High / Medium / Low) and a confidence score so you can prioritize what matters most.

Quality Reports

Every validation produces a detailed quality report:

Dataset Overview — Total rows, columns, and overall missing value percentage
Column Analysis — Per-column breakdown showing type, null rate, unique count, and quality status
Failed Expectations — Specific details on what failed and why, with counts and examples
Overall Quality Score — Percentage of checks passed, stored as a trackable metric

Quality status per column is displayed as:

Pass — All checks for the column passed
Warning — Some checks passed, some failed
Fail — Critical checks failed for the column

Human-in-the-Loop

The Quality Agent includes a human-in-the-loop step before executing validations:

The agent profiles your data and generates recommended expectations
You review the proposed checks — each with severity, confidence, and description
You select which expectations to run
Only approved checks are executed

This prevents unnecessary validations and ensures quality measurement aligns with your business priorities.

Data Connections

The Quality Agent accesses data through the platform’s secure infrastructure:

Redshift Warehouse

Queries your dedicated Redshift cluster for data samples via cross-account IAM roles. Supports fully qualified table names with schema isolation.

OpenMetadata Catalog

Reads asset metadata to identify tables, resolve names, and record quality execution results back to the catalog.

S3 Storage

Quality reports are stored as formatted documents in your organization’s S3 storage for long-term access and auditing.

Great Expectations

Validations execute in an ephemeral environment — no persistent configuration required. Each run is isolated and stateless.

Works With Other Agents

Governance Agent — Quality checks are part of the governance workflow. Quality scores feed into compliance reporting and data lifecycle tracking.
Analyst Agent — Quality issues are flagged before analysis begins so you know your confidence level in the underlying data.
Engineering Agent — Quality checks validate transformation outputs after dbt models run, ensuring transformations produce correct results.
Metadata Agent — Quality scores and execution history are stored as metadata on data assets in your catalog.

The Quality Agent is part of the BrightAgent architecture. See capabilities for the full list of what BrightAgent can do.

Getting Started

BrightAgent

Platform

Overview

What You Can Ask

How It Works

Quality Dimensions

Completeness

Accuracy

Consistency

Freshness

Supported Quality Checks

Quality Reports

Human-in-the-Loop

Data Connections

Redshift Warehouse

OpenMetadata Catalog

S3 Storage

Great Expectations

Works With Other Agents

Getting Started

BrightAgent

Platform

​Overview

​What You Can Ask

​How It Works

​Quality Dimensions

Completeness

Accuracy

Consistency

Freshness

​Supported Quality Checks

​Quality Reports

​Human-in-the-Loop

​Data Connections

Redshift Warehouse

OpenMetadata Catalog

S3 Storage

Great Expectations

​Works With Other Agents

Overview

What You Can Ask

How It Works

Quality Dimensions

Supported Quality Checks

Quality Reports

Human-in-the-Loop

Data Connections

Works With Other Agents