Target Flow - stream

Flow architecture

The technical target is to build a new flow based on streaming and DFI tools.

The documentation related to customer growth data ingestion is here

target stream

CDC

Data is exported with CDC tooling using Debezium component. This component is available through the streaming team: CDC

The data source is a dedicated outbox table which stores data events about favorite sports.

target stream

Data Avro schema

Data stream into the kafka topic are formalised thanks to AVRO technology.
The Sport Avro schema are available inGithub Logo Github .

Sink to S3

The Sink to S3 bucket is built with aGithub Logo S3 connector .

Deployment

The CDC Streaming is manage thanks to a kafka connect cluster available in theGithub Logo prod-gke-eu flux repository
This kafka connect cluster is common to all the Sport public streaming.