Latest Content

African super app Yassir delivers on data with BigQuery migration
Mar 06, 2025
Article
1

• 13 years of experience in Information Technology, with 9+ years specializing in the Big Data ecosystem and a wide range of associated tools and technologies.
• Expertise in designing robust data architecture solutions, ensuring top-tier data integrity and security.
• Expert in building data quality frameworks using diverse tools and technologies to enhance data quality, ensuring reliable data for downstream analytics systems.
• Skilled in conceptualizing, designing, developing, and productionizing Apache Spark applications, ETL processes, data warehousing, modeling, and recommending optimal solutions for applications and analytics systems.
• Strong hands-on experience in Data Governance:
- define data quality rules, standards and policies.
- data cataloging and managing
- handling PII and sensitive data
- data auditing, managing and reporting to stakeholders and leadership team
• Proficient in deriving data strategies and roadmaps that directly supported and advanced business objectives.
• Deep understanding of Spark optimization techniques and extensive experience in writing Spark jobs in Scala and Python.
• Certified AWS Solutions Architect with strong exposure to various AWS data services.
• Proficient in developing and optimizing batch and streaming jobs on the Databricks analytics platform, with proven experience in cluster optimization and cost-efficiency.
• Strong programming knowledge in Scala, Python, and Java 8, with a focus on building high-performance, scalable data solutions.
• Passionate about driving data-driven transformation and delivering innovative, scalable, and efficient data solutions.

Skills:
• Data Ecosystems: Data Management, Data Governance, Data Engineering, Data Analytics, Data Warehousing, Data Lakehouse, Data Modelling, Spark, Streaming, Kafka, Databricks, DBT, Snowflake, SQL, NoSQL (MongoDB, Cassandra)
• AWS: S3, IAM, EMR, Kinesis, Glue, Lambda, Redshift, Athena
• GCP: GCS, Dataproc, Dataform, Dataflow, BigQuery, Looker, Cloud Composer, Dataplex
• Azure: ADLS, ADF, Synapse, Purview, Functions, Cosmos, Key Vault, Azure DevOps

Let’s connect to discuss how I can help transform your data initiatives with cutting-edge engineering practices!

1
article