Spark In Depth: Group by with Case statementI recently encountered a complex problem while working on my project. I needed to group the data for a my use case, which involved several…Feb 24, 20242Feb 24, 20242
Mastering Data Engineering: A breakdown of Data Pipeline Stages and ToolsThe market is flooded with numerous data engineering tools due to the exponential increase in data volumes. Starting a career as a Data…Feb 13, 20244Feb 13, 20244
Optimise an Already Optimised Heavy Spark Job with Long Lineage.Upon receiving the initial requirement to write a Spark job , you inquired about the volume of data that the job would be processing. The…Jan 27, 20243Jan 27, 20243
Evaluating KPI Dashboards to Increase ROI for Data Engineering Products — I“Uncover the ROI: Evaluating Data Processing and Storage Costs for KPI Dashboards” -Jan 27, 2024Jan 27, 2024