Open Source Reporting Pipeline

Start Timer

0:00:00

Upvote
0
Downvote
Save question
Mark as completed
View comments

You work as a data engineer at Spotify. The executive team requests a weekly analytics report summarizing user engagement and streaming trends, using data from internal databases and log files. Strict budget constraints prohibit the use of commercial or managed cloud services; you must rely solely on open-source technologies such as PostgreSQL, Apache Airflow, Apache Spark, and Metabase.

How would you architect this reporting pipeline from data ingestion to report delivery?

.
.
.
.
.


Comments

Loading comments