FDA AI/ML Guidelines

The FDA's framework for AI/ML-based Software as a Medical Device (SaMD) requires continuous monitoring and demographic performance tracking.

Overview

Aspect
Details

Scope

AI/ML-based medical devices

Framework

Good Machine Learning Practice (GMLP)

Enforcement

FDA 510(k), De Novo, PMA pathways

Updates

Predetermined Change Control Plan (PCCP)

Key Requirements

Section IV.A: Performance Monitoring

"Manufacturers should monitor AI/ML device performance in the real-world setting"

ATHENA Solution:

  • Real-time trust calibration

  • Continuous accuracy tracking

  • Performance degradation alerts

const engines = await athena.engines.status();
const metrics = await athena.system.metrics();

console.log('System health:', engines.overallStatus);
console.log('Error rate:', metrics.errorRate);

Section IV.B: Demographic Performance

"Manufacturers should evaluate device performance across demographic subgroups"

ATHENA Solution:

  • Subgroup accuracy tracking

  • Demographic disparity detection

  • FDA-format reporting

Section IV.C: Human Factors

"Manufacturers should understand how users interact with AI/ML outputs"

ATHENA Solution:

  • Trust calibration analysis

  • Automation bias detection

  • Override quality tracking

Section V.A: Documentation

"Manufacturers should maintain documentation of AI/ML performance"

ATHENA Solution:

  • Audit trail with full context

  • FDA-format compliance reports

  • Continuous monitoring logs

FDA Report Contents

1. Performance Summary

  • Overall accuracy metrics

  • Sensitivity/specificity

  • Error rate trends

2. Demographic Analysis

  • Performance by age group

  • Performance by sex

  • Performance by ethnicity

  • Performance by clinical indication

3. Human Factors Analysis

  • User calibration scores

  • Automation bias incidents

  • Override patterns

  • Training effectiveness

4. Adverse Events

  • Misdiagnosis incidents

  • User intervention required

  • Near-misses logged

5. Predetermined Changes

  • Model updates applied

  • Performance impact

  • Validation results

Subgroup Requirements

Subgroup
FDA Priority
ATHENA Tracking

Age (pediatric)

High

age_group:0-17

Age (geriatric)

High

age_group:65+

Sex

High

gender

Race/Ethnicity

High

ethnicity

Disease severity

Medium

Custom attribute

Clinical indication

Medium

Custom attribute

Implementation

Step 1: Track Clinical Decisions

Step 2: Monitor Demographics

Step 3: Track User Performance

Step 4: Generate Quarterly Reports

Continuous Monitoring Dashboard

ATHENA Dashboard provides:

Metric
Refresh Rate
Alert Threshold

Overall accuracy

Real-time

<85%

Subgroup accuracy

Hourly

<80%

Automation bias

Real-time

Any detection

Override rate

Hourly

<10% or >50%

Sample FDA Report


Next: California SB-53

Last updated