5 数据基础和科技栈缺口阻碍您的AI代理

Elastic Blog

Elastic Blog2026年5月26日

5 数据基础和科技栈缺口阻碍您的AI代理

8.5内容质量

TL;DR · AI 摘要

企业技术已经从仅提供行动建议的AI助手转向自主行动的AI代理，这带来了效率提升但同时也给技术领导者带来新的挑战。成功的组织通过构建基础能力来支持自主系统，而不是盲目建设代理。文章列出了五个阻碍AI代理发展的关键差距，并提供了相应的解决方案。

核心要点

成功的企业通过构建基础能力来支持自主系统，而不是盲目建设代理。
数据访问性和质量是AI项目成功的关键，需要建立统一的数据访问层、实时数据管道和自动化数据质量监控。
上下文工程能力对于大型语言模型的成功至关重要，需要实现检索增强生成（RAG）、管理内存策略和优化工具可用性。

结构提纲

按章节快速跳转。

§引言
企业技术已从AI助手转向自主行动的AI代理，带来效率提升但挑战也随之增加。
·成功的关键在于基础能力
成功的组织通过构建基础能力来支持自主系统，而不是盲目建设代理。
·数据访问性和质量
数据访问性和质量是AI项目成功的关键，需要建立统一的数据访问层、实时数据管道和自动化数据质量监控。
·上下文工程能力
上下文工程能力对于大型语言模型的成功至关重要，需要实现检索增强生成（RAG）、管理内存策略和优化工具可用性。
·案例研究：Laptop Refresh Automation
通过解决数据精度问题，Elastic成功实施了笔记本刷新自动化，展示了基础能力的重要性。

思维导图

用一张图看清主题之间的关系。

查看大纲文本（无障碍 / 无 JS 友好）

AI代理发展中的关键差距
- 数据访问性和质量
  - 统一的数据访问层
  - 实时数据管道
  - 自动化数据质量监控
  - 语义搜索能力
- 上下文工程能力
  - 检索增强生成（RAG）
  - 管理内存策略
  - 优化工具可用性
- 遗留系统集成挑战
- AI性能监控不足
- 缺乏治理和组织结构

金句 / Highlights

值得收藏与分享的关键句。

成功的企业通过构建基础能力来支持自主系统，而不是盲目建设代理。
— Gap 2
⬇︎ 下载 PNG 𝕏 分享到 X
数据访问性和质量是AI项目成功的关键，需要建立统一的数据访问层、实时数据管道和自动化数据质量监控。
— Gap 2
⬇︎ 下载 PNG 𝕏 分享到 X
上下文工程能力对于大型语言模型的成功至关重要，需要实现检索增强生成（RAG）、管理内存策略和优化工具可用性。
— Gap 2
⬇︎ 下载 PNG 𝕏 分享到 X

#AI#企业技术#自主系统#数据管理#上下文工程

打开原文

Image 1: getty-images-tWYVqVzsN5w-unsplash.jpeg

We have crossed a critical threshold in enterprise technology. The industry has shifted from AI assistants that merely suggest actions to AI agents that autonomously act on our behalf.

This evolution promises massive efficiency gains, but it also creates a new dilemma for technology leaders. CIOs are caught between board pressure to adopt autonomous systems and the quiet fear that their underlying infrastructure is entirely unprepared.

The organizations that will succeed with agentic AI are not diving headfirst into building agents. They are building the foundational capabilities that any autonomous system requires.

As senior director of enterprise technology and innovation at Elastic, I’ve seen firsthand how the right foundation can transform AI projects from costly experiments into scalable solutions. When we recently enabled laptop refresh automation, for example, we quickly learned that our data needed more precision before the agent could be rolled out wider. So, now we're building an asset management system that will help provide the structured foundation we need to scale. Addressing these gaps ensures your efforts actually drive business value.

Gap 1: Data accessibility and quality

Gap 2: Context engineering capabilities

Gap 3: Legacy system integration challenges

Gap 4: Inadequate AI performance monitoring

Gap 5: Missing governance and organizational structure

Gap 1: Data accessibility and quality

Quality data is the fundamental bedrock of effective AI. Without it, even the most advanced models will produce inaccurate or irrelevant results that destroy user trust.

Data needs to be accurate, clean, and managed under clear governance. You can push all the data into an AI tool, but if the quality is not there, the output will not tell the right story.

Autonomous agents need complete, real-time data to make decisions. When data is scattered across 50 different systems with inconsistent quality standards, your AI will inevitably hallucinate or fail.

Organizations need systems that can ingest, process, and retrieve data in real time while maintaining security and compliance standards. Poor or siloed data will always show up in the results.

To address the data accessibility gap, you must implement the following solutions:

Build a unified data access layer that connects all critical data sources through a single platform.

Deploy real-time data pipelines instead of relying on batch processing.

Establish automated data quality monitoring to catch errors before they reach your models.

Incorporate semantic search capabilities so that agents can find relevant concepts rather than just exact keyword matches.

Gap 2: Context engineering capabilities

Large language models (LLMs) are powerful, but they possess a fundamental limitation. An LLM's performance is not solely a function of its static internal knowledge, which is frozen at its last training date.

Its practical success critically depends on the external information and tools provided to it at the moment of inference. Without a native ability to access your live proprietary data, models will generate plausible but factually incorrect information.

This is where context engineering becomes essential. For models to pull the most relevant context, teams need access to advanced search techniques that match user query intent with relevant context from source systems.

Without accurate context, an agent will fail by hallucinating, selecting the wrong tool, or drifting away from its original objective. Context poisoning occurs when these errors compound across subsequent interactions.

Build a comprehensive context engineering practice by:

Implementing**retrieval augmented generation (RAG)**: This involves the AI retrieving external data "just in time" from a knowledge base, such as internal company documents or public websites. RAG enables the AI to answer questions using information it was not originally trained on, thereby ensuring its responses are both current and accurate.

Managing memory strategically:

Short-term: Use checkpointers for current conversation state.

Long-term: Persist cross-conversation information to appropriate stores.

Implement trimming, summarization, and relevance filtering.

Optimizing tool availability: Minimize tool count while maintaining coverage. Consider RAG-based tool selection to combat "tool confusion" where too many tools reduce accuracy.

Ranking outputs for smarter content surfacing:Tools like Jina Reranker rescore retrieved content by how closely it actually matches a query, replacing rough similarity matches with precise relevance ranking. This ensures a reliable way to control what material gets surfaced and prioritized.

Gap 3: Legacy system integration challenges

Organizations carry years of architectural compromises that make it exceedingly difficult to build enterprise-scale AI. Legacy infrastructure and outdated systems severely limit the capabilities of modern agents.

AI agents need to both retrieve context from and take actions within existing systems. When legacy systems lack standardized interfaces, agents cannot easily interact with your enterprise environment.

Without proper integration architecture, organizations face a difficult dilemma. They must choose between building custom AI from scratch or relying on fragmented SaaS AI features that remain disconnected from relevant enterprise context.

We need a durable enterprise architecture that connects context from all relevant distributed source systems. Modern integration layers are non-negotiable for autonomous operations.

Address legacy integration challenges by:

Building a**durable enterprise architecture**: Ensure you connect context from all relevant distributed source systems and SaaS applications. SaaS applications still serve a purpose by bringing use case-specific knowledge, providing a workflow engine, and delivering a system of record.

Implementing a context retrieval mechanism: This will pull knowledge from SaaS applications within your enterprise architecture to build context-aware AI applications.

Modernizing incrementally: Don't attempt to replace all legacy systems. Instead, as we did at Elastic, adopt tools like LangChain as a library that helps orchestrate AI integrations, which allows us to build a more structured framework that naturally inherits the access controls and context of our native systems.

Establishing AI/machine learning (ML) integrations: We use a platform approach using the Elasticsearch Platform that gives teams quick access to preferred LLMs and AI development frameworks to speed up development.

Gap 4: Inadequate AI performance monitoring

Organizations frequently lack visibility into how their AI systems actually perform in production environments. Without robust performance management, IT teams have no visibility and cannot trust the outputs of their autonomous experiences.

This unprecedented power of AI agents brings an unprecedented level of complexity. We are deploying systems that can draft a brilliant business proposal one minute and hallucinate an entire legal case the next.

Large language models are notoriously opaque and non-deterministic. This introduces severe challenges across reliability, cost, quality, and safety dimensions that traditional monitoring simply cannot handle.

Their latency and resource utilization spike unpredictably based on the sheer length and complexity of the output. With highly variable output lengths, your token count and infrastructure bill can fly off the rails without warning.

Layer observability into your AI architecture with three critical components:

Implement application performance monitoring (APM) to track performance: Ensure you have visibility into the services and infrastructure supporting AI applications, enabling quick isolation of bottlenecks before they impact users.

Deploy LLM observability: Provide critical business insights into AI model performance, context accuracy, and usage patterns including conversation grouping by use case.

Add cloud monitoring: Correlate the performance and cost of cloud infrastructure, enabling quick diagnosis of infrastructure-related bottlenecks.

Gap 5: Missing governance and organizational structure

Beyond the technical hurdles, organizations suffer from cultural and structural problems that inhibit innovation. Legacy processes, shadow IT, and inconsistent governance models all slow down and introduce risk to agentic AI adoption.

AI cannot succeed as an IT-only initiative. When business partners actively participate in defining requirements and maintaining data quality, AI initiatives advance the strategic goals of the organization.

Without proper governance and organizational alignment, security vulnerabilities silently accumulate. This technical debt is the result of years of small security shortcuts and legacy systems left in place for far too long.

When AI is treated as a plug-and-play novelty, it stays a novelty. When it is approached as a strategic capability backed by strong governance, it becomes a massive force multiplier for your workforce.

Implement organizational structures and processes to ensure long-term success:

Designate an AI champion: Successful AI implementations involve identifying a single-threaded leader to drive your organization's vision forward. Pair this with a dedicated group of individuals set on a clear goal and measurable outcomes.

Build a center of excellence: Start with a core team and expand. Our AI outcomes at Elastic accelerated exponentially when we identified a lead for our use cases. We have since expanded into a small center of excellence focused on AI.

Establish data governance frameworks: Implement data contracts that create accountability while improving data quality and accessibility through the data mesh approach.

Create security protocols:Use AI-driven tools to automate threat detection, streamline vulnerability analysis, and even handle some of the routine documentation.

Turn your AI vision into business value with a durable enterprise architecture

Scaling AI initiatives demands that modernization investments, organizational change, and strategic AI deployment must proceed in parallel.

When you address your data accessibility, context engineering, system integration, monitoring, and governance gaps, you transform your infrastructure. You move from a fragile environment into a future-ready foundation.

The shift to agentic AI requires clear business objectives, a pristine data foundation, and robust human oversight. By preparing your architecture today, you ensure your autonomous systems will deliver a clear ROI tomorrow.

Ready to put these lessons to work and transform your infrastructure? Read the 8 steps to build a scalable generative AI app to ensure your next initiative delivers measurable results.

_The release and timing of any features or functionality described in this post remain at Elastic's sole discretion. Any features or functionality not currently available may not be delivered on time or at all._

_In this blog post, we may have used or referred to third party generative AI tools, which are owned and operated by their respective owners. Elastic does not have any control over the third party tools and we have no responsibility or liability for their content, operation or use, nor for any loss or damage that may arise from your use of such tools. Please exercise caution when using AI tools with personal, sensitive or confidential information. Any data you submit may be used for AI training or other purposes. There is no guarantee that information you provide will be kept secure or confidential. You should familiarize yourself with the privacy practices and terms of use of any generative AI tools prior to use._

_Elastic, Elasticsearch, and associated marks are trademarks, logos or registered trademarks of elasticsearch B.V. in the United States and other countries. All other company and product names are trademarks, logos or registered trademarks of their respective owners._