# 1. data pipeline orchestrator ?
- task define, schedule
- monitor, error handling
- coordinate dependency
- execute order of tasks
- data movement(ETL)
- scalar or parallel
# 2. similar product
- oozie, Uber-temporal(go-base), AWS-step
# 3. components
- Web Server : monitor
- Metadata Database
- Scheduler
- Executor
- Worker
- Triggerer
# 4. DAG
- set of tasks, task is unit of execution
- tasks can be written by python, bash, SQL
- operator : Action operator, Transfer operator, Sensor operator
# 5. Architecture
- Single node : Web UI, Queue, Scheduler, Metadata DB, Executor
- Multi node : seperate Single node unit by each feature
'공부 이야기 > 그냥 찾아보는 공부' 카테고리의 다른 글
[python] gradio -> UnicodeDecodeError: 'cp949' codec can't decode byte 0xe2 in position 2072: illegal multibyte sequence 에러 해결 방법 (0) | 2024.03.27 |
---|---|
Webhook이란? (0) | 2024.03.25 |
소수의 성질 1 - 메르센 소수 (0) | 2022.02.25 |
에레토스테네스의 체 (0) | 2022.02.24 |
비트연산자로 몫 구하기, 홀수 구하기 (0) | 2022.02.20 |