T
traeai
Sign in
返回首页
Towards Data Science

What Is a Data Agent?

2.0Score
What Is a Data Agent?

TL;DR · AI Summary

This article introduces the concept of a Data Agent and its application in modern software development.

Key Takeaways

  • A Data Agent is a tool for automating data operations.
  • It can improve data processing efficiency.
  • It is applicable to various sizes of enterprises.

Outline

Jump quickly between sections.

  1. Briefly introduce the basic concept and role of a Data Agent.

  2. Detailed explanation of how a Data Agent works.

  3. Explain how a Data Agent retrieves data from multiple sources.

  4. Analyze how a Data Agent cleans and transforms data.

  5. Discuss the applications of a Data Agent in different scenarios.

  6. Provide examples of how a Data Agent is applied in large enterprises.

  7. Provide examples of how a Data Agent is applied in small and medium-sized enterprises.

Mindmap

See how the topics connect at a glance.

查看大纲文本(无障碍 / 无 JS 友好)
  • Cookie介绍
    • Cookie分类
      • 必要Cookie
      • 分析Cookie
    • Cookie管理
      • 拒绝所有Cookie
      • 接受所有Cookie

Highlights

Key sentences worth saving and sharing.

#Data Agent#Automation#Data Processing
Open original article

What Is a Data Agent? | Towards Data Science

Image 5: Revisit consent button

We value your privacy

We use cookies to enhance your browsing experience, serve personalized ads or content, and analyze our traffic. By clicking "Accept All," you consent to our use of cookies.

Customize Reject All Accept All

Customize Consent PreferencesImage 6

We use cookies to help you navigate efficiently and perform certain functions. You will find detailed information about all cookies under each consent category below.

The cookies that are categorized as "Necessary" are stored on your browser as they are essential for enabling the basic functionalities of the site. ...Show more

Necessary Always Active

Necessary cookies are required to enable the basic features of this site, such as providing secure log-in or adjusting your consent preferences. These cookies do not store any personally identifiable data.

  • Cookie BCTempID
  • Duration 10 minutes
  • Description No description available.
  • Cookie __cf_bm
  • Duration 1 hour
  • Description This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
  • Cookie AWSALBCORS
  • Duration 7 days
  • Description Amazon Web Services set this cookie for load balancing.
  • Cookie _cfuvid
  • Duration session
  • Description Cloudflare sets this cookie to track users across sessions to optimize user experience by maintaining session consistency and providing personalized services
  • Cookie li_gc
  • Duration 6 months
  • Description LinkedIn set this cookie for storing visitor's consent regarding using cookies for non-essential purposes.
  • Cookie __hssrc
  • Duration session
  • Description This cookie is set by Hubspot whenever it changes the session cookie. The __hssrc cookie set to 1 indicates that the user has restarted the browser, and if the cookie does not exist, it is assumed to be a new session.
  • Cookie __hssc
  • Duration 1 hour
  • Description HubSpot sets this cookie to keep track of sessions and to determine if HubSpot should increment the session number and timestamps in the __hstc cookie.
  • Cookie wpEmojiSettingsSupports
  • Duration session
  • Description WordPress sets this cookie when a user interacts with emojis on a WordPress site. It helps determine if the user's browser can display emojis properly.
  • Cookie BCSessionID
  • Duration 1 year 1 month 4 days
  • Description Blueconic sets this cookie as a unique identifier for the BlueConic profile.
  • Cookie _octo
  • Duration 1 year
  • Description No description available.
  • Cookie logged_in
  • Duration 1 year
  • Description No description available.
  • Cookie __Secure-YEC
  • Duration past
  • Description YouTube sets this cookie to store the user's video player preferences using embedded YouTube video
  • Cookie __eoi
  • Duration 6 months
  • Description Description is currently not available.
  • Cookie AWSALBTGCORS
  • Duration 7 days
  • Description No description available.
  • Cookie login-status-p
  • Duration past
  • Description Description is currently not available.
  • Cookie AWSALBTG
  • Duration 7 days
  • Description No description available.
  • Cookie csrf_token
  • Duration session
  • Description No description available.
  • Cookie token_v2
  • Duration 1 day
  • Description Description is currently not available.
  • Cookie D
  • Duration 1 year
  • Description Description is currently not available.
  • Cookie PHPSESSID
  • Duration session
  • Description This cookie is native to PHP applications. The cookie stores and identifies a user's unique session ID to manage user sessions on the website. The cookie is a session cookie and will be deleted when all the browser windows are closed.
  • Cookie VISITOR_PRIVACY_METADATA
  • Duration 6 months
  • Description YouTube sets this cookie to store the user's cookie consent state for the current domain.
  • Cookie cookietest
  • Duration session
  • Description The cookietest cookie is typically used to determine whether the user's browser accepts cookies, essential for website functionality and user experience.
  • Cookie __Host-airtable-session
  • Duration 1 year
  • Description This cookie is used to enable us to integrate the services of Airtable.
  • Cookie __Host-airtable-session.sig
  • Duration 1 year
  • Description This cookie is used to enable us to integrate the services of Airtable.
  • Cookie m
  • Duration 1 year 1 month 4 days
  • Description Stripe sets this cookie for fraud prevention purposes. It identifies the device used to access the website, allowing the website to be formatted accordingly.
  • Cookie BIGipServer*
  • Duration session
  • Description Marketo sets this cookie to collect information about the user's online activity and build a profile about their interests to provide advertisements relevant to the user.
  • Cookie __cfruid
  • Duration session
  • Description Cloudflare sets this cookie to identify trusted web traffic.
  • Cookie _GRECAPTCHA
  • Duration 6 months
  • Description Google Recaptcha service sets this cookie to identify bots to protect the website against malicious spam attacks.
  • Cookie __Secure-YNID
  • Duration 6 months
  • Description Google cookie used to protect user security and prevent fraud, especially during the login process.
  • Cookie cookieyes-consent
  • Duration 1 year
  • Description CookieYes sets this cookie to remember users' consent preferences so that their preferences are respected on subsequent visits to this site. It does not collect or store any personal information about the site visitors.

Functional

  • [x]
markdown
Functional cookies help perform certain functionalities like sharing the content of the website on social media platforms, collecting feedback, and other third-party features.

*   Cookie lidc 
*   Duration 1 day 
*   Description LinkedIn sets the lidc cookie to facilitate data center selection. 

*   Cookie brw 
*   Duration 1 year 
*   Description No description available. 

*   Cookie brwConsent 
*   Duration 5 minutes 
*   Description Description is currently not available. 

*   Cookie WMF-Uniq 
*   Duration 1 year 
*   Description Description is currently not available. 

*   Cookie loom_anon_comment 
*   Duration 1 year 
*   Description No description available. 

*   Cookie loom_referral_video 
*   Duration session 
*   Description Description is currently not available. 

*   Cookie VISITOR_INFO1_LIVE 
*   Duration 6 months 
*   Description A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface. 

*   Cookie yt-remote-connected-devices 
*   Duration Never Expires 
*   Description YouTube sets this cookie to store the user's video preferences using embedded YouTube videos. 

*   Cookie ytidb::LAST_RESULT_ENTRY_KEY 
*   Duration Never Expires 
*   Description The cookie ytidb::LAST_RESULT_ENTRY_KEY is used by YouTube to store the last search result entry that was clicked by the user. This information is used to improve the user experience by providing more relevant search results in the future. 

*   Cookie yt-remote-device-id 
*   Duration Never Expires 
*   Description YouTube sets this cookie to store the user's video preferences using embedded YouTube videos. 

*   Cookie yt-remote-session-name 
*   Duration session 
*   Description The yt-remote-session-name cookie is used by YouTube to store the user's video player preferences using embedded YouTube video. 

*   Cookie yt-remote-fast-check-period 
*   Duration session 
*   Description The yt-remote-fast-check-period cookie is used by YouTube to store the user's video player preferences for embedded YouTube videos. 

*   Cookie yt-remote-session-app 
*   Duration session 
*   Description The yt-remote-session-app cookie is used by YouTube to store user preferences and information about the interface of the embedded YouTube video player. 

*   Cookie yt-remote-cast-available 
*   Duration session 
*   Description The yt-remote-cast-available cookie is used to store the user's preferences regarding whether casting is available on their YouTube video player. 

*   Cookie yt-remote-cast-installed 
*   Duration session 
*   Description The yt-remote-cast-installed cookie is used to store the user's video player preferences using embedded YouTube video. 

*   Cookie cp_session 
*   Duration 3 months 
*   Description Codepen sets this cookie for Help systems found in the website. 

*   Cookie loid 
*   Duration 1 year 1 month 4 days 
*   Description This cookie is set by the Reddit. The cookie enables the sharing of content from the website onto the social media platform. 

Analytics

- [x] 

Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics such as the number of visitors, bounce rate, traffic source, etc.

*   Cookie __hstc 
*   Duration 6 months 
*   Description Hubspot set this main cookie for tracking visitors. It contains the domain, initial timestamp (first visit), last timestamp (last visit), current timestamp (this visit), and session number (increments for each subsequent session). 

*   Cookie hubspotutk 
*   Duration 6 months 
*   Description HubSpot sets this cookie to keep track of the visitors to the website. This cookie is passed to HubSpot on form submission and used when deduplicating contacts. 

*   Cookie _ga 
*   Duration 1 year 1 month 4 days 
*   Description Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors. 

*   Cookie _ga_* 
*   Duration 1 year 1 month 4 days 
*   Description Google Analytics sets this cookie to store and count page views. 

*   Cookie __Host-psifi.analyticsTrace 
*   Duration 6 hours 
*   Description Description is currently not available. 

*   Cookie __Host-psifi.analyticsTraceV2 
*   Duration 6 hours 
*   Description Description is currently not available. 

*   Cookie _gh_sess 
*   Duration session 
*   Description GitHub sets this cookie for temporary application and framework state between pages like what step the user is on in a multiple step form. 

*   Cookie YSC 
*   Duration session 
*   Description YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages. 

*   Cookie ajs_anonymous_id 
*   Duration 1 year 
*   Description This cookie is set by Segment to count the number of people who visit a certain site by tracking if they have visited before. 

*   Cookie vuid 
*   Duration 1 year 1 month 4 days 
*   Description Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos on the website.  

Performance

- [x] 

Performance cookies are used to understand and analyse the key performance indexes of the website which helps in delivering a better user experience for the visitors.

*   Cookie AWSALB 
*   Duration 7 days 
*   Description AWSALB is an application load balancer cookie set by Amazon Web Services to map the session to the target. 

*   Cookie acq 
*   Duration past 
*   Description Description is currently not available. 

*   Cookie acq.sig 
*   Duration past 
*   Description Description is currently not available. 

*   Cookie ptc 
*   Duration 2 years 
*   Description No description available. 

Advertisement

- [x] 
markdown
Advertisement cookies are used to provide visitors with customised advertisements based on the pages you visited previously and to analyse the effectiveness of the ad campaigns.

*   Cookie muc_ads 
*   Duration 1 year 1 month 4 days 
*   Description Twitter sets this cookie to collect user behaviour and interaction data to optimize the website. 

*   Cookie guest_id_marketing 
*   Duration 1 year 1 month 4 days 
*   Description Twitter sets this cookie to identify and track the website visitor. 

*   Cookie guest_id_ads 
*   Duration 1 year 1 month 4 days 
*   Description Twitter sets this cookie to identify and track the website visitor. 

*   Cookie personalization_id 
*   Duration 1 year 1 month 4 days 
*   Description Twitter sets this cookie to integrate and share features for social media and also store information about how the user uses the website, for tracking and targeting. 

*   Cookie guest_id 
*   Duration 1 year 1 month 4 days 
*   Description Twitter sets this cookie to identify and track the website visitor. It registers if a user is signed in to the Twitter platform and collects information about ad preferences. 

*   Cookie bcookie 
*   Duration 1 year 
*   Description LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser IDs. 

*   Cookie __Secure-ROLLOUT_TOKEN 
*   Duration 6 months 
*   Description YouTube sets this cookie to manage feature rollout and experimentation. It helps Google control which new features or interface changes are shown to users as part of testing and staged rollouts, ensuring consistent experience for a given user during an experiment. 

*   Cookie yt.innertube::nextId 
*   Duration Never Expires 
*   Description YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen. 

*   Cookie yt.innertube::requests 
*   Duration Never Expires 
*   Description YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen. 

*   Cookie session_tracker 
*   Duration session 
*   Description This cookie is set by the Reddit. This cookie is used to identify trusted web traffic. It also helps in advertising on the website. 

*   Cookie edgebucket 
*   Duration session 
*   Description Reddit sets this cookie to save the information about a logged-in Reddit user, for the purpose of advertisement recommendations and updating the content. 

*   Cookie did 
*   Duration 1 year 
*   Description Arbor sets this cookie to show targeted ads to site visitors. This cookie expires after 2 months or 1 year. 

Uncategorised

Other uncategorised cookies are those that are being analysed and have not been classified into a category as yet.

No cookies to display.

Reject All Save My Preferences Accept All

[Skip to content](https://towardsdatascience.com/what-is-a-data-agent/#wp--skip-link--target)

[![Image 7: Towards Data Science](https://towardsdatascience.com/wp-content/uploads/2025/02/TDS-Vector-Logo.svg)](https://towardsdatascience.com/)

Publish AI, ML & data-science insights to a global community of data professionals.

[Sign in](https://towardsdatascience.com/what-is-a-data-agent/)

[Submit an Article](https://contributor.insightmediagroup.io/)

*   [Latest](https://towardsdatascience.com/latest/)
*   [Editor’s Picks](https://towardsdatascience.com/tag/editors-pick/)
*   [Deep Dives](https://towardsdatascience.com/tag/deep-dives/)
*   [Newsletter](https://towardsdatascience.com/tag/the-variable/)

* * *

*   [Write For TDS](https://towardsdatascience.com/submissions/)

[![Image 8: Towards Data Science](https://towardsdatascience.com/wp-content/uploads/2025/02/TDS-Vector-Logo.svg)](https://towardsdatascience.com/)

Toggle Mobile Navigation

*   [LinkedIn](https://www.linkedin.com/company/towards-data-science/?originalSubdomain=ca)
*   [X](https://x.com/TDataScience)

Toggle Search

Search 

[Agentic AI](https://towardsdatascience.com/category/artificial-intelligence/agentic-ai/)

# What Is a Data Agent?

A simple explanation of what a data agent is and how it works

[Marina Tosic](https://towardsdatascience.com/author/martosi/)

May 26, 2026

 5 min read

 Share 

![Image 9: What is a data agent?](https://towardsdatascience.com/wp-content/uploads/2026/05/image-13.jpeg)

[Photo](https://unsplash.com/photos/two-white-speech-bubbles-sitting-on-top-of-a-brown-surface-sWRPYgjpygQ) by Kelly Sikkema on Unsplash

Working at Microsoft, I have the opportunity to try new AI-powered analytical tools, including Microsoft Fabric's data agent. That's why I want to share what I've learned, explain what a data agent is, and highlight the difference between it and a "standard" AI agent.

So, without further ado, here is my definition of a data agent:

> A data agent is a report you can talk to.

For those of us in analytics, this means two long-held wishes might finally become a reality:

**#1:** Analysts spend way less time building visualisations.

**#2:** Self-service insights come closer to business users.

Let me elaborate on each of these points a bit more.

## Fewer visualisations, not fewer insights

I really enjoy a good report that can tell me “what’s up” with the metrics I am currently interested in. But being trained in analytics, I know how reports can sometimes cast metrics in the wrong light, leading business users to frequently ask analysts for KPIs interpretation, usually 10 minutes before important meetings.

And that’s one of the reasons we often end up in a vicious cycle of having dashboards no one is using, and stakeholders constantly wanting “the number” served ad hoc or via spreadsheets.

On the bright side, visualisations and spreadsheets are not going anywhere, but serving the insights has a new way with a Fabric data agent.
markdown
Instead of wrapping queries in graphs, you can wrap them in prompts and instructions paired with the consumption-ready governed data estate in Fabric, i.e., in a lakehouse, warehouse, Power BI semantic models, KQL database, or even an ontology. This implies the underlying data still needs to be prepared and modeled to answer business questions such as “_What was the revenue this week compared to last week?_

However, from a design perspective, rather than creating a scoped visual report to answer this business question, you now create a scoped data agent to provide this, and other subsets of answers derived from the underlying data model(s).

More precisely, the input-output flow goes as follows:

(1) a stakeholder asks a question, (2) the agent, powered by Azure OpenAI Assistant API, interprets the question and “decides” which of data sources is most likely to have the answer based on source schemas and agent instructions, (3) generates the appropriate query (SQL, DAX, or KQL depending on the source type), (4) validates it, (5) executes it under the stakeholder’s credentials, and (6) returns the result as a text or a table, not (yet) as a visual.

In sum, a stakeholder interaction with insights via the data agent is a Q&A session on top of the curated dataset, and drill-down visuals can be replaced with follow-up questions, such as “_Can you also break the revenue out by segment?_

With that, it is clear how analysts’ work no longer needs to be re-expressed only via dashboards, aka the long-known tangible proof that the work of capturing the business logic within data models was delivered.

Now, let’s talk about…

## Self-service insights, closer to where business users“live”

I mentioned before that reports can sometimes misrepresent metrics, but that’s not the only reason why “[_If you build it, they will come_](https://youtu.be/5Ay5GqJwHF8?si=ogbGpP0j2XnliDzo&t=27)” rarely works for them or analytics in general. The truth is, the knowledge barrier is often too high to understand the underlying semantic models and how to use BI tools to create visuals on top.

Although this points to data literacy, which is a change-management problem, it’s a fact that the targeted business audience, who should be report consumers, often has too much on their plate to bother learning BI tools for self-service analytics.

That’s why it’s important to bring insights closer to where end users “live”, which nowadays points towards AI-powered tools like M365 Copilot.

With the possibility to expose insights via data agents outside of Fabric, analysts can now focus on the analytical logic behind self-service data agents, and end users can access insights in the same AI-powered tools that support their other daily tasks, without the complexity of switching to another platform.

I have to note this is not the only way to integrate Fabric data agents in the workflows, and regardless of whether you’re a developer or a consumer, it’s good to know…

## The difference between data and an AI agent

![Image 10](https://contributor.insightmediagroup.io/wp-content/uploads/2026/05/0Ng4qm30VuQl44HY5.jpg)

Photo by [Dynamic Wang](https://unsplash.com/@dynamicwang?utm_source=medium&utm_medium=referral) on[Unsplash](https://unsplash.com/?utm_source=medium&utm_medium=referral)

We’ve learned so far that the Fabric data agent is an analytical agent focused on read-only, governed data access, capable of translating natural language prompts into complex database queries that unlock insights, even outside the Fabric tenant.

On the other side, [an AI agent](https://microsoft.github.io/ai-agents-for-beginners/01-intro-to-ai-agents/) is defined as a system that allows Large Language Models (LLMs) to _do things,_ not just respond to prompts, on behalf of users or other systems by accessing tools and knowledge.

Meaning, the whole magic is in the AI agent setup, where you can use a Fabric data agent as a specialized tool or knowledge source.

I’ll illustrate this with one simple example.

Imagine an authorized user requests the AI agent to _“Draft an email to the team summarizing last week’s revenue by segment.”_ To get this work done, the AI agent would, among other things, need to prepare revenue insights from the enterprise database. So, in an aim to reduce errors in revenue calculation, the developer would design an agentic workflow to route the input prompt to the Fabric data agent _tool_, which would handle the heavy lifting of determining the schema, writing the query, executing it, and returning the precise figures. Finally, the AI agent would then use those figures to finish its broader workflow and write the email.

What’s the difference between those two, then? It’s that an AI agent _acts_, while the data agent _grounds_.

* * *

> _Thank you for reading._
> 
> 
> _If you found this post valuable, feel free to share it with your network.![Image 11: 👏](https://s.w.org/images/core/emoji/17.0.2/svg/1f44f.svg)_
> 
> 
> _Connect for more stories on_[_Medium_](https://medium.com/@martosi/subscribe)_![Image 12: ✍️](https://s.w.org/images/core/emoji/17.0.2/svg/270d.svg) and_[_LinkedIn_](https://www.linkedin.com/in/martosi/)_![Image 13: 🖇️](https://s.w.org/images/core/emoji/17.0.2/svg/1f587.svg)._

* * *

## Want to learn more about data agents?

If that’s the case, check out the following resources:

[**Fabric data agent creation – Microsoft Fabric** _Learn how to create a Fabric data agent that can answer questions about data._ learn.microsoft.com](https://learn.microsoft.com/en-us/fabric/data-science/concept-data-agent)[](https://learn.microsoft.com/en-us/fabric/data-science/concept-data-agent)

[**Implement Microsoft Fabric Data Agents – Training** _Implement Microsoft Fabric Data Agents (chat with your data)_ learn.microsoft.com](https://learn.microsoft.com/en-us/training/modules/implement-fabric-data-agents/)[](https://learn.microsoft.com/en-us/training/modules/implement-fabric-data-agents/)

* * *

Written By
markdown
Marina Tosic

[See all from Marina Tosic](https://towardsdatascience.com/author/martosi/)

[AI Agents](https://towardsdatascience.com/tag/ai-agents-2/), [Artificial Intelligence](https://towardsdatascience.com/tag/artificial-intelligence/), [Data Science](https://towardsdatascience.com/tag/data-science/), [Data Visualization](https://towardsdatascience.com/tag/data-visualization/), [Microsoft Fabric](https://towardsdatascience.com/tag/microsoft-fabric/)

Share This Article

*   [Share on Facebook](https://www.facebook.com/sharer/sharer.php?u=https%3A%2F%2Ftowardsdatascience.com%2Fwhat-is-a-data-agent%2F&title=What%20Is%20a%20Data%C2%A0Agent%3F)
*   [Share on LinkedIn](https://www.linkedin.com/shareArticle?mini=true&url=https%3A%2F%2Ftowardsdatascience.com%2Fwhat-is-a-data-agent%2F&title=What%20Is%20a%20Data%C2%A0Agent%3F)
*   [Share on X](https://x.com/share?url=https%3A%2F%2Ftowardsdatascience.com%2Fwhat-is-a-data-agent%2F&text=What%20Is%20a%20Data%C2%A0Agent%3F)

Towards Data Science is a community publication. Submit your insights to reach our global audience and earn through the TDS Author Payment Program.

[Write for TDS](https://towardsdatascience.com/questions-96667b06af5/)

## Related Articles

*   ![Image 14](https://towardsdatascience.com/wp-content/uploads/2025/07/stanford-research-scaled-1.jpeg)## [The Stanford Framework That Turns AI into Your PM Superpower](https://towardsdatascience.com/the-stanford-framework-that-turns-ai-into-your-pm-superpower/)

[Machine Learning](https://towardsdatascience.com/category/artificial-intelligence/machine-learning/) A human-centric guide to AI automation for product managers. [Rahul Vir](https://towardsdatascience.com/author/rahulvir/) July 28, 2025  6 min read      
*   ![Image 15](https://towardsdatascience.com/wp-content/uploads/2025/08/Screenshot-2025-08-08-175859.jpg)## [LangGraph + SciPy: Building an AI That Reads Documentation and Makes Decisions](https://towardsdatascience.com/langgraph-scipy-building-an-ai-that-reads-documentation-and-makes-decisions/)

[Agentic AI](https://towardsdatascience.com/category/artificial-intelligence/agentic-ai/) Stop guessing your statistical test. Let this AI do it for you. [Gustavo Santos](https://towardsdatascience.com/author/gustavorsantos/) August 11, 2025  11 min read      
*   ![Image 16](https://towardsdatascience.com/wp-content/uploads/2025/08/a_minimalist_clean_illustration_of_a_futuristic-2.jpeg)## [Smarter Model Tuning: An AI Agent with LangGraph + Streamlit That Boosts ML Performance](https://towardsdatascience.com/smarter-model-tuning-an-ai-agent-with-langgraph-streamlit-that-boosts-ml-performance/)

[Agentic AI](https://towardsdatascience.com/category/artificial-intelligence/agentic-ai/) Automating model tuning in Python with Gemini, LangGraph, and Streamlit for regression and classification improvements [Gustavo Santos](https://towardsdatascience.com/author/gustavorsantos/) August 20, 2025  12 min read      
*   ![Image 17](https://towardsdatascience.com/wp-content/uploads/2025/09/architecture-diagram.png)## [Using LangGraph and MCP Servers to Create My Own Voice Assistant](https://towardsdatascience.com/using-langgraph-and-mcp-servers-to-create-my-own-voice-assistant/)

[Machine Learning](https://towardsdatascience.com/category/artificial-intelligence/machine-learning/) Built over 14 days, all locally run, no API keys, cloud services, or subscription fees. [Benjamin Lee](https://towardsdatascience.com/author/bl3e967/) September 4, 2025  30 min read      
*   ![Image 18](https://towardsdatascience.com/wp-content/uploads/2025/09/Tool_Mask_Illustration.jpg)## [Tool Masking: The Layer MCP Forgot](https://towardsdatascience.com/tool-masking-the-layer-mcp-forgot/)

[Agentic AI](https://towardsdatascience.com/category/artificial-intelligence/agentic-ai/) Tool masking for AI improves AI agents: shape MCP tool surfaces to cut tokens and… [Frank Wittkampf](https://towardsdatascience.com/author/frankw_usa/) September 5, 2025  16 min read      
*   ![Image 19: Building AI Agents](https://towardsdatascience.com/wp-content/uploads/2025/09/assets_task_01k4jhtmgtfn88yjgf9rcq4641_1757264422_img_2.webp)## [How to Build Effective AI Agents to Process Millions of Requests](https://towardsdatascience.com/how-to-build-effective-ai-agents-to-process-millions-of-requests/)

[Agentic AI](https://towardsdatascience.com/category/artificial-intelligence/agentic-ai/) Learn how to build production ready systems using AI agents [Eivind Kjosbakken](https://towardsdatascience.com/author/oieivind/) September 9, 2025  9 min read      
*   ![Image 20: LangChain for EDA - Learn to build a CSV Sanity-Check Agent in Python](https://towardsdatascience.com/wp-content/uploads/2025/09/LangChain-for-EDA-Build-a-CSV-Sanity-Check-Agent-in-Python.png)## [LangChain for EDA: Build a CSV Sanity-Check Agent in Python](https://towardsdatascience.com/langchain-for-eda-build-a-csv-sanity-check-agent-in-python/)

[Agentic AI](https://towardsdatascience.com/category/artificial-intelligence/agentic-ai/) A practical LangChain tutorial for data scientists to inspect CSVs [Sarah Schürch](https://towardsdatascience.com/author/schuerch_sarah/) September 9, 2025  19 min read      

*   [YouTube](https://www.youtube.com/c/TowardsDataScience)
*   [X](https://x.com/TDataScience)
*   [LinkedIn](https://www.linkedin.com/company/towards-data-science/?originalSubdomain=ca)
*   [Threads](https://www.threads.net/@towardsdatascience)
*   [Bluesky](https://bsky.app/profile/towardsdatascience.com)

[![Image 21: Towards Data Science](https://towardsdatascience.com/wp-content/uploads/2025/02/TDS-Vector-Logo.svg)](https://towardsdatascience.com/)

Your home for data science and Al. The world’s leading publication for data science, data analytics, data engineering, machine learning, and artificial intelligence professionals.

 © Insight Media Group, LLC 2026 

 Subscribe to Our Newsletter 

Some sections of this page may rearrange if you adjust the browser window size. Make sure to verify the headings and document structure.

Image 23Image 24

AI may generate inaccurate information. Please verify important content.