Formation Data Architect

De l'ingenierie des donnees a l'architecture data enterprise - 30 mois de mentorat

8 Phases - 30 mois - Mentor 30+ ans d'experience

Parcours de Formation - 8 Phases

1
2
3
4
5
6
7
8
1

Fondamentaux Data

SQL avance, bases de donnees NoSQL & NewSQL, Python pour la data et pipelines de transformation

Mois 1-4
SQL Avance PostgreSQL MongoDB Redis Cassandra Neo4j Python pandas
2

Modelisation & Design

Data modeling Kimball/Inmon, Data Vault 2.0, gouvernance DAMA-DMBOK et patterns d'architecture

Mois 5-8
Star Schema Kimball Inmon Data Vault 2.0 DAMA-DMBOK SCD Data Mesh Data Fabric
3

Modern Data Stack

Cloud Data Warehouses, dbt, orchestration de pipelines et outils d'ingestion

Mois 9-12
Snowflake BigQuery Databricks dbt Airflow Dagster Fivetran Parquet
4

Architecture Cloud Data Platform

Lakehouse, TOGAF, data contracts, semantic layer et FinOps pour la data

Mois 13-16
Lakehouse Iceberg Delta Lake TOGAF AWS GCP Azure FinOps
5

Data Engineering Avancee

Streaming temps reel, data quality, observabilite et pratiques DataOps CI/CD

Mois 17-20
Kafka Flink Spark Streaming Great Expectations Soda DataOps CI/CD lakeFS
6

Specialisation & Leadership

Certifications strategiques, portfolio de projets, communication et gestion d'equipe data

Mois 21-24
CDMP SnowPro GCP Data Engineer Portfolio ADR Stakeholders
7

IA & Data Architecture

Feature stores, MLOps, Vector databases, RAG, LLM infrastructure et Knowledge Graphs

Mois 25-27
Feature Store MLflow Vector DB RAG LLM Neo4j Ethical AI
8

Chief Data Architect

Strategie data enterprise, budget & ROI, organisation des equipes et culture data-driven

Mois 28-30
Data Strategy Roadmap Budget & ROI CoE Data Literacy Self-Service

Certifications Visees

CDMP - DAMA

Certified Data Management Professional - La reference pour les Data Architects

SnowPro Core

Snowflake - Architecture, Virtual Warehouses, Data Sharing

GCP Professional Data Engineer

Google Cloud - BigQuery, Dataflow, Pub/Sub, Dataproc

Databricks Data Engineer

Databricks - Spark, Delta Lake, Unity Catalog, Lakehouse

AWS Data Engineer

Amazon Web Services - S3, Glue, Redshift, Athena, EMR

Confluent Kafka

Apache Kafka - Streaming, Event-Driven Architecture

Ressources Essentielles

Livres Incontournables

Designing Data-Intensive Applications, The Data Warehouse Toolkit, Data Mesh, DAMA-DMBOK, Fundamentals of Data Engineering

Plateformes de Cours

Coursera, DataCamp, dbt Learn, Snowflake Learn, Databricks Academy, Confluent Developer

Communautes

dbt Community Slack, Data Engineering subreddit, Data Talks Club, Locally Optimistic

Conferences

dbt Coalesce, Snowflake Summit, Databricks Data+AI Summit, Kafka Summit, Data Council