From Data Analyst to Data Engineer: My 12-Month Self-Study Roadmap
This article shares a 12-month self-study roadmap for transitioning from data analyst to data engineer.
入选理由:作者通过公开学习数据工程,提升自身技能并应对职业发展需求。
产品
别名:Pandas库
Python中用于数据处理和分析的库。
已收录 5 条与 Pandas 相关的内容,按评分排序。
This article shares a 12-month self-study roadmap for transitioning from data analyst to data engineer.
入选理由:作者通过公开学习数据工程,提升自身技能并应对职业发展需求。
Pandas remains the go-to tool for data wrangling due to its powerful features and strong community support.
入选理由:Pandas 在数据清洗和转换方面具有显著优势。
This article introduces five practical Python scripts for handling common tasks in time series data, including resampling, anomaly detection, and trend decomposition.
入选理由:提供了五个 Python 脚本,涵盖时间序列数据处理的常见任务。
Polars outperforms Pandas in handling large datasets, especially in parallel computing and lazy evaluation.
入选理由:Polars 使用 Rust 构建,支持并行计算和懒加载,性能优于 Pandas。
The article introduces how to build modern EDA pipelines using the Pingouin library, validating data normality, multivariate normality, and homoscedasticity through statistical tests.
入选理由:Pingouin 提供了 Shapiro-Wilk 和 Henze-Zirkler 检验来验证数据正态性