T
traeai
Sign in

公司

Towards Data Science

别名:TDS

一个专注于数据科学和机器学习的在线平台。

已跟踪 13 条高相关材料

TraeAI 观察

相关材料

已收录 13 条与 Towards Data Science 相关的内容,按评分排序。

RAG Is Burning Money — I Built a Cost Control Layer to Fix It

RAG Is Burning Money — I Built a Cost Control Layer to Fix It

Towards Data Science4995 字 (约 20 分钟)
92

RAG systems often incur hidden costs due to context over-fetching, lack of caching, and no model routing; the author built a cost control layer using semantic caching (98.5% hit rate), query routing (81% requests shifted to low-cost models), and token-budget circuit breaking, achieving 85.8% cost reduction at 10k requests/day without quality loss.

入选理由:上下文过取使每查询平均多消耗350 tokens,10k请求/日造成$52.5/日浪费(按$0.015/1K tokens计)

FeaturedArticle#RAG#Cost Optimization#Semantic Caching#Model Routing#LLM英文
From Regex to Vision Models: Which RAG Technique Fits Which Problem

From Regex to Vision Models: Which RAG Technique Fits Which Problem

Towards Data Science4997 字 (约 20 分钟)
90

RAG techniques are not universal; choose based on document structure and query control: use regex for templated docs, LLMs for sarcasm detection in transcripts, and vision models for schematics.

入选理由:模板化文档(如保险单、银行流水)适合用正则表达式提取字段,避免使用高成本的 RAG 流程。

FeaturedArticle#RAG#LLM#Document Intelligence#Vision Models#Enterprise AI英文
RAG Is Not Machine Learning, and the ML Toolkit Solves the Wrong Problem

RAG Is Not Machine Learning, and the ML Toolkit Solves the Wrong Problem

Towards Data Science6346 字 (约 26 分钟)
87

RAG is not machine learning, and the ML toolkit solves the wrong problem. The article argues that despite its resemblance to ML, RAG is fundamentally a search system, not a model, making hyperparameter tuning and embedding fine-tuning ineffective and misleading.

入选理由:RAG 解决的是确定性答案查找问题,而非预测未知结果,因此不能用 ML 方法优化。

FeaturedArticle#RAG#Machine Learning#Enterprise AI#Information Retrieval#LLM英文
Embeddings Aren’t Magic: The Predictable Failure Modes of RAG Retrieval

Embeddings Aren’t Magic: The Predictable Failure Modes of RAG Retrieval

Towards Data Science9526 字 (约 39 分钟)
87

RAG systems rely on embeddings that fail predictably: when queries use different terms than docs (e.g., ‘overtime’ vs ‘non-employee labor’), contain negations, or depend on exact IDs/codes, retrieval fails. The article argues enterprise reliability comes from upstream filtering (expert keywords, doc structure), not rerankers atop weak retrieval.

入选理由:嵌入模型在处理同义词/拼写变体时表现优异(如‘cancel’→‘termination procedures’),但对术语不一致问题无能为力

FeaturedArticle#RAG#Embedding#Retrieval#Enterprise AI#Document Intelligence英文
Towards Data Science 图标

Proxy-Pointer RAG: Solving Entity and Relationship Sprawl in Large Knowledge Graphs

Towards Data Science3847 字 (约 16 分钟)
87

Proxy-Pointer RAG reduces the computational cost of entity and relationship reconciliation in knowledge graphs by over 90% by preserving document structure, enabling millisecond-scale ingestion without full-graph traversal.

入选理由:Proxy-Pointer RAG 使用 Skeleton Tree 和 Breadcrumb Injection 技术,使向量检索能精准定位文档完整结构段,而非碎片化块。

FeaturedArticle#RAG#Knowledge Graph#Proxy-Pointer#Entity Resolution#Vector Retrieval英文
Towards Data Science 图标

The Exact ML Project I’d Build to Get Hired in 2026

Towards Data Science1642 字 (约 7 分钟)
85

构建一个个性化、创新、相关且可实际运行的机器学习项目,是获得2026年数据科学职位的关键。

入选理由:优秀的机器学习项目需具备个性化、创新性、相关性和实际运行性。

FeaturedArticle#机器学习#数据科学#项目构建#招聘英文
Towards Data Science 图标

10 Common RAG Mistakes We Keep Seeing in Production

Towards Data Science5639 字 (约 23 分钟)
85

RAG系统在生产环境中常见错误包括解析失败、忽略文档结构、固定窗口分块等,影响检索精度。

入选理由:解析文档时应保留结构,避免将表格转换为字符串。

FeaturedArticle#RAG#AI#文档解析#企业应用英文
Towards Data Science 图标

Sequential Fitting: A Different Perspective on the Spectral Bias of Neural Networks

Towards Data Science3803 字 (约 16 分钟)
85

Neural networks exhibit a 'spectral bias' when fitting high-frequency functions, fitting low-frequency components first, which leads to inefficient training. This article analyzes this phenomenon from different perspectives and provides explanations.

入选理由:神经网络在拟合高频率函数时需要更多训练轮次,导致效率低下。

FeaturedArticle#Neural Networks#Spectral Bias#Machine Learning#Activation Functions英文
FPN Paper Walkthrough: Leveraging the Internal Pyramid

FPN Paper Walkthrough: Leveraging the Internal Pyramid

Towards Data Science4625 字 (约 19 分钟)
82

FPN solves small object detection by introducing a Neck structure to fuse multi-scale features. This article details the Backbone-Neck-Head evolution and provides a from-scratch implementation guide connecting FPN with CNN and RPN, essential for understanding modern detection optimization.

入选理由:FPN作为Neck组件位于Backbone与Head之间,通过特征增强机制显著提升小物体检测精度。

FeaturedArticle#FPN#Object Detection#YOLOv3#Feature Pyramid#Computer Vision英文
Small Data, Big Maps: Training Geospatial ML Models When Samples Are Scarce

Small Data, Big Maps: Training Geospatial ML Models When Samples Are Scarce

Towards Data Science1566 字 (约 7 分钟)
82

The core bottleneck in geospatial ML is expensive field samples, not compute; solving small-sample issues requires increasing per-sample information density via multi-source feature engineering and prioritizing low-variance models like Random Forest to control overfitting.

入选理由:亚马逊雨林单个森林清查样地成本相当于一台ML训练计算机,实地标签稀缺是核心约束。

FeaturedArticle#Geospatial ML#Small Data#Feature Engineering#Random Forest#Remote Sensing英文
Five Ways to Fine-Tune Chronos-2, the Time Series Foundation Model

Five Ways to Fine-Tune Chronos-2, the Time Series Foundation Model

Towards Data Science4139 字 (约 17 分钟)
82

Chronos-2 TSFM can be fine-tuned via LoRA to address zero-shot gaps, detailing five scenarios including single-building adaptation, portfolio pooling, and covariate injection with strict data splitting.

入选理由:使用LoRA冻结120M参数主模型,仅训练低秩适配器以高效适配私有数据。

FeaturedArticle#Chronos-2#Time Series Foundation Model#LoRA#Fine-tuning#Forecasting英文
Towards Data Science 图标

Why Gradient Descent Became Stochastic

Towards Data Science4695 字 (约 19 分钟)
78

The core reason gradient descent evolved into stochastic gradient descent (SGD) is computational scalability: as dataset size grows, batch gradient descent (BGD) becomes prohibitively expensive, while SGD updates parameters using only one or a few samples per iteration—reducing cost and leveraging noise to escape local minima; the article illustrates this via linear regression, deriving the closed-form solution from MSE and naturally motivating iterative optimization.

入选理由:线性回归中β₀=27315.74、β₁=9020.66的解析解可通过MSE对β₀/β₁求偏导并令其为0推导得出

FeaturedArticle#Gradient Descent#Stochastic Gradient Descent#Linear Regression#Optimization#Machine Learning英文
Towards Data Science 图标

Can Machine Learning Predict the World Cup?

Towards Data Science3800 字 (约 16 分钟)
70

机器学习模型在预测世界杯比赛结果上表现有限,86%的主场胜利预测表明模型存在偏差。

入选理由:使用了包括多元回归、LightGBM等模型进行预测。

FeaturedArticle#机器学习#足球预测#数据科学#R语言英文

跨材料问答 · Towards Data Science

回答基于:Towards Data Science 相关 13 条材料
    0 / 500

    AI may generate inaccurate information. Please verify important content.