Skip to main content

Questions tagged [etl]

Extract, Transform, Load - process in a database

6 questions with no upvoted or accepted answers
2 votes
0 answers
42 views

How to manage scheduled ETL jobs that are time sensitive?

We have some ETL jobs that are scheduled to run every day, and some that are scheduled to run every week via Control-M. These types of jobs tag data with the date the job was run and perform filter ...
Igneous01's user avatar
  • 2,333
2 votes
0 answers
393 views

Data pipeline architecture: airflow triggered by message broker

Let us say we have: a web app with a Postgres DB that produces data over time, another DB optimized for analytics that we would like to populate over time. My goal is to build and monitor an ETL ...
sunless's user avatar
  • 151
1 vote
0 answers
822 views

Parsing a JSON file from S3 using Airflow

I'm new to Airflow and I'm working on a proof of concept. The project is fairly simple... every day some 10,000 JSON files are loaded onto a folder on AWS S3. I have to get each one of them, parse ...
Gabe's user avatar
  • 143
1 vote
0 answers
836 views

Is MapReduce a correct framework for Extract, Transform, Load of data?

EDIT I am working on a project to update a legacy ETL infrastructure that supports a number of clients, each with a slightly different setup. Constraints that cannot be changed: Source data can ...
Noah Goodrich's user avatar
-1 votes
1 answer
279 views

WebApp for ETL with visual mapping - read csv and map it to data model

a few years ago I wrote a python script for reading CSV, handling the headers, filtering data, renaming stuff via RegEx...bascially to do various ETL stuff. This was done using a exhaustive ...
and0r's user avatar
  • 109
-1 votes
1 answer
152 views

How to incrementally update value of features in a machine learning pipeline?

I am working on a machine learning pipeline where we have to compute certain measures on streaming data. Every day, new raw data enters our pipeline. To update our features, we have to run an ETL that ...
spoderman's user avatar