Data Engineering
Python Pandas Streamlit AWS Glue ETL Data Engineering Fuzzy Algorithm Machine Learning Artificial Intelligence K-Means XGBoost LangChain LLM
Blog Posts (1)
Article
Building a Scalable ETL Pipeline with AWS Glue (CSV to Parquet + Partitioning)
A hands-on walkthrough of building a serverless ETL pipeline with AWS Glue, PySpark, and Amazon Athena: converting raw CSV files to partitioned Parquet for efficient querying at scale.
08 Apr, 2026
Read