T
traeai
登录
返回首页
Databricks

From months to minutes: Building real-time clinical data pipelines with natural language

7.5Score
From months to minutes: Building real-time clinical data pipelines with natural language
AI 深度提炼
  • 自然语言处理可大幅优化临床数据管道的构建效率
  • 实时数据处理在医疗领域的应用潜力巨大
  • Databricks平台支持高效的数据集成与分析
#Databricks#自然语言处理#数据管道#医疗
打开原文

From months to minutes: Building real-time clinical data pipelines with natural language | Databricks Blog

Skip to main content

[![Image 1](blob:http://localhost/c3d26385bd032c882a09c45135533626)](http://www.databricks.com/)

[![Image 2](blob:http://localhost/c3d26385bd032c882a09c45135533626)](http://www.databricks.com/)

  • Why Databricks
  • * Discover
  • Customers
  • Partners
  • Product
  • * Databricks Platform
  • Integrations and Data
  • Pricing
  • Open Source
  • Solutions
  • * Databricks for Industries
  • Cross Industry Solutions
  • Migration & Deployment
  • Solution Accelerators
  • Resources
  • * Learning
  • Events
  • Blog and Podcasts
  • Get Help
  • Dive Deep
  • About
  • * Company
  • Careers
  • Press
  • Security and Trust
  • DATA + AI SUMMIT ![Image 3: Data+ai summit promo JUNE 15–18|SAN FRANCISCO Last chance to save 50% — ends April 30. Register](http://www.databricks.com/dataaisummit?itm_source=www&itm_category=home&itm_page=home&itm_location=navigation&itm_component=navigation&itm_offer=dataaisummit)

1. All blogs 2. / Industries

Table of contents

Table of contents

Table of contents

Healthcare & Life SciencesApril 28, 2026

From months to minutes: Building real-time clinical data pipelines with natural language

by Matthew Giglia, Tim Kessler and Assunta Carey-Saylor

Summary

  • From months to minutes — Building EHR data pipelines used to take weeks of specialized integration work. The Databricks + Redox partnership lets teams stand up real-time clinical data pipelines using natural language prompts, no HL7 experts required.
  • Real-time means actually real-time — Redox's Zerobus streams clinical data directly into Databricks Unity Catalog with subsecond latency, eliminating the intermediary storage layers that turn "real-time" into "delayed."
  • Data that acts, not just informs — AI outputs can be written back into the EHR in real time, closing the loop between insight and intervention — turning Databricks from an analytics system into an operational layer at the point of care.

_This post was co-written by Assunta Carey-Saylor (Senior Product Marketing at Redox), Tim Kessler (Field Chief Technology Officer at Redox), and Matt Giglia (Forward Deployed Engineer for Healthcare & Life Sciences at Databricks)_

Most healthcare data teams spend months building and maintaining pipelines to move data from EHR systems into their analytics environment. Even after that work is done, data is often routed through intermediary storage layers before it can be processed, introducing latency that limits real-time use cases.

The result is a system that is complex to maintain and too slow to act on.

Databricks andRedox have teamed up to change this model, removing two core barriers:

  • The complexity of building healthcare data integrations
  • The latency of getting data into the analytics environment

Now, teams can stand up real-time clinical data pipelines using natural language prompts directly within Databricks, simplifying how data is accessed and activated.

_**TL;DR: Databricks and Redox enable healthcare teams to build real-time clinical data pipelines using natural language, stream data into cloud environments with subsecond latency, and write AI outputs back into the EHR to drive action at the point of care.**_

**The Problem: AI Strategy Is Bottlenecked by Integration + Latency**

Healthcare organizations are under pressure to operationalize AI, but execution consistently bottlenecks at clinical data integration.

Before any models or workflows can be deployed, teams must integrate with EHR systems, normalize formats like HL7, CCD, and X12, and build ETL pipelines. This work is time-intensive and requires specialized expertise.

Even after pipelines are in place, data often flows through intermediary storage layers before reaching Databricks, adding latency and operational overhead. What is labeled “real-time” becomes delayed, and engineering teams remain focused on infrastructure instead of delivering insights.

As a result, AI initiatives stall and time-to-insight stretches from weeks into months.

**A New Approach**

Databricks’Zerobus Ingest and the RedoxMCP Server introduce a different approach for building and operating clinical data pipelines in Databricks.

The Redox MCP Server allows teams to build and manage integrations using natural language, reducing the time and complexity involved in building new data pipelines. Zerobus streams clinical data directly into Databricks Unity Catalog managed tables with subsecond latency, removing the need for intermediary storage layers or workflows to load the files for processing.

Together, they change both how pipelines are built and how quickly data becomes usable. Integration is defined through prompts instead of code, and data is available for processing as it is generated. This means teams can connect to EHR systems, stream clinical data, and activate workflows directly within Databricks.

**What This Looks Like in Practice**

Inside the Databricks platform, a user connects to the Redox MCP Server and begins with a prompt. Within a few interactions:

  • The system identifies available environments and datasets
  • Suggests next steps to complete the workflow
  • Executes integration tasks across systems
  • Surfaces validation signals like logs and performance summaries

Behind the scenes, Zerobus handles the direct delivery of streaming data into Databricks, eliminating the need for staging layers or batch ingestion. This allows each step of the workflow to operate on live data as it arrives.

In one example, the system retrieved a recent patient admission and returned both structured data and a plain-language summary. Complex clinical data became immediately usable for downstream analytics and decisioning.

Image 4: image_102

Expand

_Figure 1. The system summarized a recent patient admission in plain English._

**From Months to Minutes: Accelerating Time-to-Insight**

What previously required weeks or months of integration work can now be initiated in minutes all using natural language without leaving Databricks.

By eliminating intermediary data landing steps with Zerobus and simplifying pipeline creation through natural language prompting, organizations can dramatically reduce both latency and development time. This enables teams to validate AI use cases faster, shorten development cycles, and move from experimentation to production more quickly.

Integration is no longer the gating factor, and data latency is no longer the limiting constraint.

**Reducing Dependency on Specialized Expertise**

Healthcare integration has traditionally required deep expertise in HL7, APIs, and data orchestration.

With the Redox MCP Server, much of that complexity is abstracted away. Organizations can reduce their reliance on HL7 experts and integration specialists, enabling broader teams to work with clinical data.

Data scientists and ML engineers can focus on building models and generating insights instead of managing ETL pipelines. Engineering teams can shift from maintaining integrations to enabling new capabilities.

The focus moves from “how do we get the data” to “how do we use the data.”

**Enabling Real-Time Clinical Intelligence**

With Zerobus, clinical data arrives in Databricks with subsecond latency without intermediate storage layers or batch processing. This allows FHIR to be treated like it was always meant to be treated–as transactional and sent to REST API endpoints. For example, a health plan or provider organization can send prior authorization specific bundles through Redox directly to a dedicated REST API endpoint where it needs to be in Databricks instead of landing in a FHIR Store or as a substantiated FHIR JSON file first.

Image 5: image_95

Expand

This allows ML and AI Agent frameworks to process prior authorization requests and responses instantly. By bypassing the need for additional data pulls or the complex ETL required to parse raw FHIR JSON files, these systems can capture and act on the patient journey as it unfolds, not after the fact.

Paired with Redox EHR writeback capabilities, these insights can be operationalized directly within clinical workflows.AI-generated outputs can be written back into the EHR in real time, closing the loop between data, intelligence, and action at the point of care. This shifts Databricks beyond a system of analytics into an application layer for healthcare operations, where data pipelines and AI models do not just inform decisions, but actively drive interventions inside core clinical systems.

This unlocks a new class of real-time use cases:

  • **Track patient state in real time**

Continuously update admissions, transfers, discharges, and clinical events as they happen

  • **Optimize capacity as conditions change**

Use live patient movement and census signals to improve bed management, staffing, and throughput

  • **Trigger interventions at the right moment**

Engage patients, care teams, or systems based on live clinical signals rather than delayed reports

  • **Detect risk as it emerges**

Identify deterioration, gaps in care, or operational bottlenecks while there is still time to respond

  • **Adapt discharge and care pathways dynamically**

Adjust next steps based on current patient status rather than static snapshots

  • **Synchronize clinical and financial workflows**

Capture events as they occur to improve coding, billing, and revenue cycle accuracy

Now, healthcare organizations can operate with greater agility and responsiveness, aligning their data workflows with the pace of patient care delivery.

**Why This Works: A Trusted Foundation AI Can Act On**

This capability depends on a reliable, standardized clinical data foundation. Redox provides the standardized layer for exchanging healthcare data across EHRs and other downstream systems through a single platform and API.

Zerobus ensures that this data arrives in Databricks without delay. The Redox MCP Server makes that data accessible and actionable through natural language by translating intent into execution while enforcing enterprise-grade security, compliance, and operational controls.

Together, they create a real-time execution layer for clinical data and AI workflows.

**Beyond Pipelines: Building the "Redox Agent"**

And we’re not stopping at data ingestion. Because the Redox MCP server sits alongside Databricks Genie Spaces on top of the clinical data, AI teams can now build Redox Agents on Databricks.

These specialized agents are capable of:

  • Reviewing Redox platform logs to troubleshoot or audit data flows in real-time.
  • Answering questions about the clinical data sent by Redox using natural language.
  • Embedding intelligence directly into clinical applications while maintaining the strict governance, security, and traceability that enterprise healthcare requires.

By combining the power of the Redox MCP and Genie, teams can move from simply moving data to creating conversational, intelligent interfaces that sit right inside the provider's existing workflow.

**See It In Action**

In an upcoming Redox and Databricks webinar on April 30, we’ll demo how Zerobus and the MCP Server work together to deliver real-time data pipelines inside Databricks.

You will see how teams can move from intent to execution using natural language, while operating on live data with subsecond latency.

Register here to explore how this approach applies to your organization.

Get the latest posts in your inbox

Subscribe to our blog and get the latest posts delivered to your inbox.

Sign up

*

Work Email

*

Country Country*

By clicking “Subscribe” I understand that I will receive Databricks communications, and I agree to Databricks processing my personal data in accordance with its Privacy Policy.

Subscribe

View all blogs

![Image 6: databricks logo](https://www.databricks.com/)

Why Databricks

Discover

Customers

Partners

Why Databricks

Discover

Customers

Partners

Product

Databricks Platform

Pricing

Open Source

Integrations and Data

Product

Databricks Platform

Pricing

Open Source

Integrations and Data

Solutions

Databricks For Industries

Cross Industry Solutions

Data Migration

Professional Services

Solution Accelerators

Solutions

Databricks For Industries

Cross Industry Solutions

Data Migration

Professional Services

Solution Accelerators

Resources

Documentation

Customer Support

Community

Learning

Events

Blog and Podcasts

Resources

Documentation

Customer Support

Community

Learning

Events

Blog and Podcasts

About

Company

Careers

Press

Security and Trust

About

Company

Careers

Press

Security and Trust

![Image 8: databricks logo](https://www.databricks.com/)

Databricks Inc.

160 Spear Street, 15th Floor

San Francisco, CA 94105

1-866-330-0121

  • [](https://www.linkedin.com/company/databricks)
  • [](https://www.facebook.com/pages/Databricks/560203607379694)
  • [](https://twitter.com/databricks)
  • [](https://www.databricks.com/feed)
  • [](https://www.glassdoor.com/Overview/Working-at-Databricks-EI_IE954734.11,21.htm)
  • [](https://www.youtube.com/@Databricks)
Image 10

See Careers

at Databricks

  • [](https://www.linkedin.com/company/databricks)
  • [](https://www.facebook.com/pages/Databricks/560203607379694)
  • [](https://twitter.com/databricks)
  • [](https://www.databricks.com/feed)
  • [](https://www.glassdoor.com/Overview/Working-at-Databricks-EI_IE954734.11,21.htm)
  • [](https://www.youtube.com/@Databricks)

© Databricks 2026. All rights reserved. Apache, Apache Spark, Spark, the Spark Logo, Apache Iceberg, Iceberg, and the Apache Iceberg logo are trademarks of the Apache Software Foundation.

We Care About Your Privacy

Databricks uses cookies and similar technologies to enhance site navigation, analyze site usage, personalize content and ads, and as further described in our Cookie Notice. To disable non-essential cookies, click “Reject All”. You can also manage your cookie settings by clicking “Manage Preferences.”

Manage Preferences

Reject All Accept All

Image 13: Databricks Company Logo

Privacy Preference Center

Opt-Out Preference Signal Honored

Privacy Preference Center

  • ### Your Privacy
  • ### Strictly Necessary Cookies
  • ### Performance Cookies
  • ### Functional Cookies
  • ### Targeting Cookies
  • ### TOTHR

#### Your Privacy

When you visit any website, it may store or retrieve information on your browser, mostly in the form of cookies. This information might be about you, your preferences or your device and is mostly used to make the site work as you expect it to. The information does not usually directly identify you, but it can give you a more personalized web experience. Because we respect your right to privacy, you can choose not to allow some types of cookies. Click on the different category headings to find out more and change our default settings. However, blocking some types of cookies may impact your experience of the site and the services we are able to offer.

#### Opting out of sales, sharing, and targeted advertising

Depending on your location, you may have the right to opt out of the “sale” or “sharing” of your personal information or the processing of your personal information for purposes of online “targeted advertising.” You can opt out based on cookies and similar identifiers by disabling optional cookies here. To opt out based on other identifiers (such as your email address), submit a request in our Privacy Request Center.

More information

#### Strictly Necessary Cookies

Always Active

These cookies are necessary for the website to function and cannot be switched off in our systems. They assist with essential site functionality such as setting your privacy preferences, logging in or filling in forms. You can set your browser to block or alert you about these cookies, but some parts of the site will no longer work.

#### Performance Cookies

  • [x] Performance Cookies

These cookies allow us to count visits and traffic sources so we can measure and improve the performance of our site. They help us to know which pages are the most and least popular and see how visitors move around the site.

#### Functional Cookies

  • [x] Functional Cookies

These cookies enable the website to provide enhanced functionality and personalization. They may be set by us or by third party providers whose services we have added to our pages. If you do not allow these cookies then some or all of these services may not function properly.

#### Targeting Cookies

  • [x] Targeting Cookies

These cookies may be set through our site by our advertising partners. They may be used by those companies to build a profile of your interests and show you relevant advertisements on other sites. If you do not allow these cookies, you will experience less targeted advertising.

#### TOTHR

  • [x] TOTHR

Cookie List

Consent Leg.Interest

  • [x] checkbox label label
  • [x] checkbox label label
  • [x] checkbox label label

Clear

  • - [x] checkbox label label

Apply Cancel

Confirm My Choices

Allow All

![Image 14: Powered by Onetrust](https://www.onetrust.com/products/cookie-consent/)

!Image 16!Image 17

Image 18
Image 19