Data Transformation and Data Modeling
Efficient transformation of data to building data models that enable business intelligence.
Data Engineer, with a specialty in organizing and optimizing information for businesses. From creating efficient databases to ensuring seamless data flow, I transform raw data into valuable insights to drive informed decision-making. Let's turn your data into a strategic asset.
Dedicated to crafting Data Solutions with an emphasis on scalable data architectures. I specialize in implementing robust data pipelines, designing efficient databases, and creating analytics solutions to meet the unique needs of businesses.
Efficient transformation of data to building data models that enable business intelligence.
Robust databases to ensure seamless data flow and accessibility.
Efficient organization and optimization of data from different sources and building scalable data pipelines, ensuring data quality for actionable insights.
Designed and deployed a fully orchestrated ETL pipeline to extract Reddit posts, transform them using AWS Glue, and make the data queryable via Athena and Redshift Spectrum. Utilized Apache Airflow, Docker, Terraform, and AWS (S3, Glue, Athena Workgroup, IAM, VPC, Redshift). This pipeline enabled scalable ingestion and transformation of social media data for downstream analytics.
Developed a robust data platform to streamline data ingestion, transformation, and storage for predictive analytics. This enabled a travel agency’s data science team to forecast travel demand and identify high-potential markets. Utilized Docker, Apache Airflow, Terraform, AWS (VPC, S3, ECR, SSM, Redshift), and dbt.
Built a secure ELT pipeline to process over 1 million global health records on Google Cloud Platform, enabling country-specific data access and analysis of diseases lacking treatment or vaccination. Automated data ingestion from GCS to BigQuery using Apache Airflow, and transformed data into clean, analysis-ready tables
Built an ETL data pipeline that automated the retrieval of upcoming rocket launch images for a space enthusiast, using the Launch Library 2 API. This streamlined access to up-to-date rocket visuals is achieved by storing image URLs in a structured format. Leveraged Apache Airflow for orchestration, Docker for containerization, and the launch API library for data retrieval.
This project is about building a dimensional data warehouse in BigQuery by transforming an OLTP system in MySQL into an OLAP system in BigQuery, using dbt as a data transformation tool.
This project involved creating a comprehensive database using PostgreSQL to manage customer information for a bank's marketing campaigns. I used Python to import, clean, and load data, ensuring its quality and reliability, and authored scripts to set up database tables.
In this project, I utilized SQL to analyze The Look's data; an e-commerce clothing store and answered some business questions regarding the performance of the e-commerce marketplace, gain insights, and provided some recommendations to increase revenue.
This case study is all about calculating metrics, growth and helping Data Bank; a FinTech startup analyse their data in a smart way to better forecast and plan for their future developments.
In this project, using Python and its libraries, I built a web scrapper, scrapped and analyzed top 10 Cryptocurencies live data from Cryptowatch.
This case study centers on leveraging Foodie-Fi’s digital data, which follows a subscription-based model, to analyze critical business metrics relating to the customer journey, payment transactions, and overall business performance.
I analyzed data from Adventure Works database and conducted a Sales Performance Analysis, answering some pertinent business questions.
Parch and Posey is an e-commerce paper selling company that sold 3 diffetrent types of papers to companies(accounts) via different channels at diferent point in time in different regions.
In this project, I web-scrapped Amazon best-selling books using Selenium and Beautiful Soup. I wrangled and analyzed the best selling books beginning from 2009 to 2019.
"In God we trust, all others must bring data" - W. Edwards Deming
In this project, I queried Twitter API using Tweepy, wrangled and analyzed the tweet archive of Twitter user @dog_rates, also known as WeRateDogs.
In this project, using Python and its libraries, I wrangled and analyzed more than 100k Medical Appointments in Brazil to derive insights on the reasons why patients showed up or not for their appointments.
Who Grows and who Eats the food we grow in Africa? The project focuses on finding insights, patterns and trends through visualizations on food Production and Supply data in Africa using Seaborn and Matplotlib.
I participated in a hackathon and worked on the case study; Tackling the Health Crisis in Africa and provided recommendations on how to curb this crisis using Power BI.
The CMO of VarnArsdel, a fictitious manufacturing company wanted to keep an eye on the company's performance both internally and externally.
As the Business Intelligence Analyst, I analyzed the sales and market share data of the company and its competitors and built an interactive Excel dashboard answering the relevant business questions.
I will read all emails. Send me any message you want and I'll get back to you.
I need your Name and Email Address, but you won't receive anything other than your reply.