Healthcare Technology

Primary Health

Healthcare Technology

When Accuracy Is Everything: Primary Health’s AI-Powered Newborn Screening Automation on AWS

Zero

Service interruption — full feature continuity through cutover

1000s

Agent interactions auto-validated for model parity, no manual QA

Live

Heartbeat agent shipped to production before migration completed

Client

Primary Health

Location

San Francisco, California

Industry

Healthcare Technology

Services & Tech

Amazon Textract, AWS Bedrock (Claude Sonnet), AWS Lambda, Amazon API Gateway, Python

Project Overview

Primary Health is a healthcare technology company whose cloud-based platform enables testing, vaccinations, and preventive care programs across the United States. The company’s nurses were manually entering newborn screening (NBS) cards into Electronic Health Record (EHR) systems, a time-consuming, error-prone process that diverted clinical staff from patient care. Avahi delivered a Python-based OCR transcription script combining Amazon Textract with AWS Bedrock for GenAI refinement, automating NBS card data extraction. The solution achieved greater than 95% extraction accuracy, surpassing Primary Health’s prior benchmark of 92–94%, and was delivered as both a REST API endpoint and a CLI tool with a configurable field mapping system for long-term adaptability.

About The Customer

Primary Health is a healthcare technology company headquartered in San Francisco, California. Its cloud-based platform delivers digital health management, data interoperability, and analytics solutions to public health agencies, school districts, community-based organizations, and health systems. The company’s mission is to stop the spread of diseases and reduce illness severity by providing affordable diagnostics and preventive care at scale. Its software streamlines program administration, scheduling, patient communications, and test results reporting, freeing healthcare staff to focus on delivering care.

The Problem

Primary Health’s nurses were manually entering newborn screening (NBS) cards into their EHR systems. This manual process was time-consuming, error-prone, and diverted clinical staff from patient care. The challenge was compounded by extreme form variability: multiple NBS form types existed with no standard format across states and facilities. Field names, layouts, and optional fields varied widely across variants.
Primary Health had previously attempted to automate this process using AWS Bedrock Data Automation combined with Google Document AI, achieving only 92–94% accuracy. This fell short of the standard required for medical documents, where data integrity is paramount. The company’s CTO emphasized that accuracy was the top concern, errors in extraction would erode trust in the system and create an additional burden for clinical staff who would need to manually verify every result.
Further complicating the challenge, the available training data consisted of printed pictures of actual forms rather than originals, introducing blurriness and noise. Real newborn screening cards are medical documents that are difficult to procure for development purposes, forcing any solution to operate within significant data quality constraints.

Why AWS?

Primary Health was already building on AWS and had explored AWS Bedrock Data Automation as part of its initial automation attempts. Amazon Textract provided the OCR extraction capability, and AWS Bedrock offered GenAI refinement for ambiguous fields. AWS Lambda and Amazon API Gateway provided a lightweight deployment model that aligned with the customer’s preference for a simple, script-based solution that their Ruby application could call directly.
Both Amazon Textract and AWS Bedrock provide the ability to opt out of AI training on customer data, an important requirement for processing medical documents. AWS’s breadth of AI and ML services made it the natural platform for combining OCR extraction with GenAI refinement in a single script.

Why Primary Health Chose Avahi

Primary Health’s previous automation attempts had plateaued at 92–94% accuracy using a single-technology approach. The company needed a partner with expertise in AI and machine learning on AWS who could push past the accuracy ceiling that off-the-shelf solutions had not been able to overcome.
Avahi proposed combining Amazon Textract for OCR extraction with AWS Bedrock for GenAI refinement, an approach Primary Health had not previously attempted. Avahi also demonstrated the ability to adapt mid-engagement when the client requested a scope adjustment, delivering a streamlined script-based solution within a compressed timeline without compromising accuracy targets.

Solution

Avahi delivered a Python-based OCR transcription script deployed to AWS Lambda and exposed via Amazon API Gateway as a REST API endpoint, allowing Primary Health’s Ruby application to call it directly. The script also runs locally as a CLI tool for development and testing.
The script uses Amazon Textract as its OCR engine, extracting text and field-level data with per-field confidence scores from NBS forms. Textract proved resilient to the low-quality training data, photographed printouts with blurriness and noise, performing well without preprocessing. The team explored computer vision techniques (deskew, binarization, denoising, contrast enhancement) but validation confirmed Textract produced sufficient quality without them, so the preprocessing pipeline was delivered as an optional standalone component.
For ambiguous or missing fields, such as determining whether “Last Name” refers to the baby or the guardian, or disambiguating specimen type/source fields, the script optionally calls AWS Bedrock (Claude Sonnet) for GenAI refinement. The LLM uses the extracted data and field relationships to resolve ambiguities that pure OCR cannot handle, improving accuracy on handwritten text and non-standard layouts. This refinement is toggled via a flag, so the LLM is only invoked when needed, keeping per-image processing costs low.
To handle the ongoing variability of NBS forms, the script uses a configuration file with fuzzy matching for field name mapping. When a new form variant appears, Primary Health can add field name mappings and modify the output schema by editing the config file, no code changes or vendor support required. The script outputs structured JSON aligned to Primary Health’s schema, including per-field and document-level confidence scores, enabling their backend to apply business logic for downstream EHR integration. It supports JPEG, PNG, and PDF inputs, with automatic compression for large files when using the CLI.
Primary Health originally scoped a broader engagement that included infrastructure, a human review interface, workflow orchestration, and multiple backend APIs. In late November 2025, the customer requested a scope adjustment to accelerate delivery. The project pivoted to script-only delivery, reducing the engagement by 139 hours (42% from the original scope) while preserving all core OCR functionality and accuracy targets. The final demonstration occurred December 22, 2025, and the complete codebase was transferred to Primary Health’s GitHub organization.

Key Deliverables

Python OCR transcription script with Amazon Textract and AWS Bedrock (Claude Sonnet) integration
REST API endpoint via AWS Lambda and Amazon API Gateway for Ruby application integration
CLI tool for local document processing with multi-format support (JPEG, PNG, PDF)
GenAI refinement module for improved accuracy on handwritten text and ambiguous fields
Field-level and document-level confidence scoring
Schema alignment and structured JSON output generation
Config-based field variation system with fuzzy matching
Optional standalone image preprocessing pipeline (deskew, binarization, denoising, contrast enhancement)
API documentation with Ruby code samples
Complete codebase transferred to Primary Health’s GitHub organization

Project Impact

The combined Textract and Bedrock approach achieved greater than 95% extraction accuracy on tested documents, surpassing Primary Health’s prior benchmark of 92–94% from their AWS Bedrock Data Automation and Google Document AI implementation. The solution automates the manual entry of newborn screening cards that nurses were previously inputting into EHR systems, freeing clinical staff to focus on patient care.

The config-based field mapping provides long-term maintainability, enabling Primary Health to independently adapt to new NBS form types as they emerge. The streamlined scope reduced the engagement by 139 hours (42% from the original scope) while delivering the core extraction capability the customer needed. The complete codebase was transferred to Primary Health’s GitHub organization, enabling self-hosting and customization.

Greater than 95% extraction accuracy, up from 92–94% with prior solution
Automated NBS card data entry was previously performed manually by nurses
Multiple NBS form types supported via config-based field mapping
42% scope reduction (139 hours) while preserving all core OCR functionality
Multi-format support: JPEG, PNG, PDF with automatic compression

Your migration, without the risk

Moving off Azure — or planning your own enterprise AI migration?

Avahi takes enterprises from idea to production on AWS in weeks, not months — with validated parity and zero-downtime cutovers. Let’s scope yours.

Keep reading

More healthcare & migration stories

How Thumbprint Furniture Launched a GenAI Furniture Shopping Assistant on AWS Read story

SupportXDR Launches Metarri, a Multi-Agent AI Insights Platform on AWS Read story

From Prompt to Placement: How Avahi Built a Production-Grade GenAI Ad Creative Pipeline for a Leading AdTech Company Read story

How Momentum Financial Services Group Modernized Its Infrastructure and Exited an On-Premises Data Center with AWS Read story

Azure to AWS: How Avahi Migrated GE Healthcare’s Enterprise AI Platform Without Missing a Beat Read story

Expect Moore Consulting Accelerates Client Demos with a GenAI Analytics Platform on AWS Read story

How Nonstop Health Automated Member Support with an AI Voice Agent Built on AWS Read story

92.9% Accurate: Madison Reed’s AI-Powered Hair Color Recommendation Engine, Built on AWS Read story

From Manual to Automated: How Avahi Transformed Corporate Creations’ Document Processing with AWS AI Read story

Avahi Builds Production-Ready AI Course Discovery Agent for EnterOne Using Amazon Bedrock Read story

Automating Real Estate Intelligence: How 3C Technology Solutions Built A GenAI-Powered Document Extraction Pipeline On AWS Read story

When Accuracy Is Everything: Primary Health’s AI-Powered Newborn Screening Automation on AWS Read story

When Accuracy Is Everything: Primary Health’s AI-Powered Newborn Screening Automation on AWS

Client

Primary Health

Location

San Francisco, California

Industry

Healthcare Technology

Services & Tech

Amazon Textract, AWS Bedrock (Claude Sonnet), AWS Lambda, Amazon API Gateway, Python

Project Overview

About The  Customer

The  Problem

Why AWS

Why Primary Health Chose Avahi

Solution

Key Deliverables

Python OCR transcription script with Amazon Textract and AWS Bedrock (Claude Sonnet) integration
REST API endpoint via AWS Lambda and Amazon API Gateway for Ruby application integration
CLI tool for local document processing with multi-format support (JPEG, PNG, PDF)
GenAI refinement module for improved accuracy on handwritten text and ambiguous fields
Field-level and document-level confidence scoring
Schema alignment and structured JSON output generation
Config-based field variation system with fuzzy matching
Optional standalone image preprocessing pipeline (deskew, binarization, denoising, contrast enhancement)
API documentation with Ruby code samples
Complete codebase transferred to Primary Health’s GitHub organization

Project  Impact

Greater than 95% extraction accuracy, up from 92–94% with prior solution
Automated NBS card data entry was previously performed manually by nurses
Multiple NBS form types supported via config-based field mapping
42% scope reduction (139 hours) while preserving all core OCR functionality
Multi-format support: JPEG, PNG, PDF with automatic compression

Ready to Transform Your Business with AI?

Let’s explore your high-impact AI opportunities together in a complimentary session

AI Poc Development Services Built and Funded on AWS

See the full catalog of Al capabilities

Start an Al proof of concept

Explore Our Services

Healthcare Technology

When Accuracy Is Everything: Primary Health’s AI-Powered Newborn Screening Automation on AWS

Client

Location

Industry

Services & Tech

Project Overview

About The Customer

The Problem

Why AWS?

Why Primary Health Chose Avahi

Solution

Key Deliverables

Project Impact

Your migration, without the risk

Moving off Azure — or planning your own enterprise AI migration?

Keep reading

More healthcare & migration stories

When Accuracy Is Everything: Primary Health’s AI-Powered Newborn Screening Automation on AWS

Client

Location

Industry

Services & Tech

Project Overview

About The Customer

The Problem

Why AWS

Why Primary Health Chose Avahi

Solution

Key Deliverables

Project Impact

Ready to Transform Your Business with AI?

About The  Customer

The  Problem

Project  Impact