Foco
Engenharia de dados senior em lakehouse, analytics engineering e entrega em tempo real.
Cobertura cloud
AWS, Azure, GCP, dbt, Spark, Terraform, Kafka
Melhor uso
Melhor para processos internacionais, conversas de consultoria e clientes globais.
Michael Barbosa Santos
Senior Data Engineer | Spark, dbt, Airflow, Terraform | AWS, Azure, GCP | Lakehouse, Streaming, Analytics
- Location: Joao Pessoa, Brazil
- Email: eng.michaelbarbosa@hotmail.com
- LinkedIn: https://www.linkedin.com/in/michael-bs/
- GitHub: https://github.com/michael-eng-ai
Summary
Senior Data Engineer with experience designing and operating cloud data platforms across AWS, Azure, and GCP. Strong background in Spark/PySpark, SQL, dbt, Airflow, Terraform, Kafka, and dimensional modeling, with hands-on delivery in regulated and data-intensive environments. Focused on scalable pipelines, analytics enablement, governance, and production-grade data architecture.
Core Stack
- Spark and PySpark
- SQL
- dbt
- Airflow
- Terraform
- Kafka
- AWS, Azure, GCP
- Data modeling
- Streaming and batch pipelines
- Data quality and governance
Recent Experience
Five Acts | Data Engineer | 2024 - Present
- Designed and implemented AWS-based data platform flows using Spark, Glue, S3, Athena, Airflow, EMR, Lambda, and API Gateway.
- Worked on Snowflake-to-AWS data lake migration scenarios integrating MongoDB, APIs, and Kafka-based sources.
- Migrated analytics processes from Qlik Sense to Databricks on Azure using Python, SQL, and Spark.
Act Digital | Data Engineer | 2023 - 2024
- Supported migration of Azure and Databricks data platform workloads integrated with Data Factory, GCP services, and dbt for banking fraud investigation use cases.
- Built data movement flows from SQL Server and MongoDB into an Oracle data warehouse using Kafka, Python, and PowerCenter.
Vert | Data Engineer | 2022 - 2023
- Built analytics-focused ETL pipelines in PySpark on Cloudera.
- Automated regulatory financial processes on GCP using Dataform, BigQuery, and Dataflow.
Selected Projects
- kafka-debezium-dbt
- aws-databricks-lakehouse
- gcp-dbt-modern-data-stack
- azure-snowflake-pipeline
- streaming-kafka-fastapi
- AI-Data-Analyst-Bot