Towards Data Science 最近有什么新动态？

traeai 已收录 13 篇与 Towards Data Science 相关的内容。最新一篇是「RAG Is Burning Money — I Built a Cost Control Layer to Fix It」，由 Towards Data Science 发布。

公司

Towards Data Science

别名：TDS

一个专注于数据科学和机器学习的在线平台。

已跟踪 13 条高相关材料

TraeAI 观察

如果只读 3 篇

RAG Is Burning Money — I Built a Cost Control Layer to Fix It

Towards Data Science · 9.2 分

RAG系统在生产中常因上下文过取、无缓存、无模型路由导致成本激增；作者构建成本控制层，通过语义缓存（98.5%命中率）、查询路由（81%请求转向低成本模型）与令牌预算熔断机制，在10,000请求/日下实现85.8%成本削减且质量不变。

From Regex to Vision Models: Which RAG Technique Fits Which Problem

Towards Data Science · 9 分

RAG 技术并非万能，应根据文档结构和问题控制程度选择合适方法：模板化文档用正则表达式，客服对话需 LLM 判断语调，工程图纸必须使用视觉模型。

RAG Is Not Machine Learning, and the ML Toolkit Solves the Wrong Problem

Towards Data Science · 8.7 分

RAG 不是机器学习，使用 ML 工具包解决的是错误问题。文章指出，尽管 RAG 看似类似 ML，但其核心是构建搜索系统而非训练模型，因此超参数调优、嵌入模型微调等 ML 方法无法解决 RAG 的真实故障，反而导致资源浪费和信任下降。

RAG Is Burning Money — I Built a Cost Control Layer to Fix It

Towards Data Science5月30日4995 字 (约 20 分钟)

RAG systems often incur hidden costs due to context over-fetching, lack of caching, and no model routing; the author built a cost control layer using semantic caching (98.5% hit rate), query routing (81% requests shifted to low-cost models), and token-budget circuit breaking, achieving 85.8% cost reduction at 10k requests/day without quality loss.

入选理由：上下文过取使每查询平均多消耗350 tokens，10k请求/日造成$52.5/日浪费（按$0.015/1K tokens计）

FeaturedArticle#RAG#Cost Optimization#Semantic Caching#Model Routing#LLM英文

From Regex to Vision Models: Which RAG Technique Fits Which Problem

Towards Data Science6月2日4997 字 (约 20 分钟)

RAG techniques are not universal; choose based on document structure and query control: use regex for templated docs, LLMs for sarcasm detection in transcripts, and vision models for schematics.

入选理由：模板化文档（如保险单、银行流水）适合用正则表达式提取字段，避免使用高成本的 RAG 流程。

FeaturedArticle#RAG#LLM#Document Intelligence#Vision Models#Enterprise AI英文

RAG Is Not Machine Learning, and the ML Toolkit Solves the Wrong Problem

Towards Data Science6月2日6346 字 (约 26 分钟)

RAG is not machine learning, and the ML toolkit solves the wrong problem. The article argues that despite its resemblance to ML, RAG is fundamentally a search system, not a model, making hyperparameter tuning and embedding fine-tuning ineffective and misleading.

入选理由：RAG 解决的是确定性答案查找问题，而非预测未知结果，因此不能用 ML 方法优化。

FeaturedArticle#RAG#Machine Learning#Enterprise AI#Information Retrieval#LLM英文

Embeddings Aren’t Magic: The Predictable Failure Modes of RAG Retrieval

Towards Data Science6月1日9526 字 (约 39 分钟)

RAG systems rely on embeddings that fail predictably: when queries use different terms than docs (e.g., ‘overtime’ vs ‘non-employee labor’), contain negations, or depend on exact IDs/codes, retrieval fails. The article argues enterprise reliability comes from upstream filtering (expert keywords, doc structure), not rerankers atop weak retrieval.

入选理由：嵌入模型在处理同义词/拼写变体时表现优异（如‘cancel’→‘termination procedures’），但对术语不一致问题无能为力

FeaturedArticle#RAG#Embedding#Retrieval#Enterprise AI#Document Intelligence英文

Proxy-Pointer RAG: Solving Entity and Relationship Sprawl in Large Knowledge Graphs

Towards Data Science5月21日3847 字 (约 16 分钟)

Proxy-Pointer RAG reduces the computational cost of entity and relationship reconciliation in knowledge graphs by over 90% by preserving document structure, enabling millisecond-scale ingestion without full-graph traversal.

入选理由：Proxy-Pointer RAG 使用 Skeleton Tree 和 Breadcrumb Injection 技术，使向量检索能精准定位文档完整结构段，而非碎片化块。

FeaturedArticle#RAG#Knowledge Graph#Proxy-Pointer#Entity Resolution#Vector Retrieval英文

The Exact ML Project I’d Build to Get Hired in 2026

Towards Data Science6月10日1642 字 (约 7 分钟)

构建一个个性化、创新、相关且可实际运行的机器学习项目，是获得2026年数据科学职位的关键。

入选理由：优秀的机器学习项目需具备个性化、创新性、相关性和实际运行性。

FeaturedArticle#机器学习#数据科学#项目构建#招聘英文

10 Common RAG Mistakes We Keep Seeing in Production

Towards Data Science6月10日5639 字 (约 23 分钟)

RAG系统在生产环境中常见错误包括解析失败、忽略文档结构、固定窗口分块等，影响检索精度。

入选理由：解析文档时应保留结构，避免将表格转换为字符串。

FeaturedArticle#RAG#AI#文档解析#企业应用英文

Sequential Fitting: A Different Perspective on the Spectral Bias of Neural Networks

Towards Data Science6月8日3803 字 (约 16 分钟)

Neural networks exhibit a 'spectral bias' when fitting high-frequency functions, fitting low-frequency components first, which leads to inefficient training. This article analyzes this phenomenon from different perspectives and provides explanations.

入选理由：神经网络在拟合高频率函数时需要更多训练轮次，导致效率低下。

FeaturedArticle#Neural Networks#Spectral Bias#Machine Learning#Activation Functions英文

FPN Paper Walkthrough: Leveraging the Internal Pyramid

Towards Data Science6月5日4625 字 (约 19 分钟)

FPN solves small object detection by introducing a Neck structure to fuse multi-scale features. This article details the Backbone-Neck-Head evolution and provides a from-scratch implementation guide connecting FPN with CNN and RPN, essential for understanding modern detection optimization.

入选理由：FPN作为Neck组件位于Backbone与Head之间，通过特征增强机制显著提升小物体检测精度。

FeaturedArticle#FPN#Object Detection#YOLOv3#Feature Pyramid#Computer Vision英文

Small Data, Big Maps: Training Geospatial ML Models When Samples Are Scarce

Towards Data Science6月5日1566 字 (约 7 分钟)

The core bottleneck in geospatial ML is expensive field samples, not compute; solving small-sample issues requires increasing per-sample information density via multi-source feature engineering and prioritizing low-variance models like Random Forest to control overfitting.

入选理由：亚马逊雨林单个森林清查样地成本相当于一台ML训练计算机，实地标签稀缺是核心约束。

FeaturedArticle#Geospatial ML#Small Data#Feature Engineering#Random Forest#Remote Sensing英文

Five Ways to Fine-Tune Chronos-2, the Time Series Foundation Model

Towards Data Science6月5日4139 字 (约 17 分钟)

Chronos-2 TSFM can be fine-tuned via LoRA to address zero-shot gaps, detailing five scenarios including single-building adaptation, portfolio pooling, and covariate injection with strict data splitting.

入选理由：使用LoRA冻结120M参数主模型，仅训练低秩适配器以高效适配私有数据。

FeaturedArticle#Chronos-2#Time Series Foundation Model#LoRA#Fine-tuning#Forecasting英文

Why Gradient Descent Became Stochastic

Towards Data Science5月30日4695 字 (约 19 分钟)

The core reason gradient descent evolved into stochastic gradient descent (SGD) is computational scalability: as dataset size grows, batch gradient descent (BGD) becomes prohibitively expensive, while SGD updates parameters using only one or a few samples per iteration—reducing cost and leveraging noise to escape local minima; the article illustrates this via linear regression, deriving the closed-form solution from MSE and naturally motivating iterative optimization.

入选理由：线性回归中β₀=27315.74、β₁=9020.66的解析解可通过MSE对β₀/β₁求偏导并令其为0推导得出

FeaturedArticle#Gradient Descent#Stochastic Gradient Descent#Linear Regression#Optimization#Machine Learning英文

Can Machine Learning Predict the World Cup?

Towards Data Science6月10日3800 字 (约 16 分钟)

机器学习模型在预测世界杯比赛结果上表现有限，86%的主场胜利预测表明模型存在偏差。

入选理由：使用了包括多元回归、LightGBM等模型进行预测。

FeaturedArticle#机器学习#足球预测#数据科学#R语言英文

跨材料问答 · Towards Data Science

回答基于：Towards Data Science 相关 13 条材料