Hi, I'm Nhlakanipho—a Data Engineer crafting scalable ETL solutions for smarter workflows and insights.

Profile picture of Nhlakanipho Ngubo

What I Offer

Certified in Business Analysis, Python, and Data Engineering, I build scalable ETL pipelines that turn raw data into actionable insights. From web scraping and data wrangling to API development and workflow optimization, I craft efficient data solutions that empower businesses to make smarter decisions.

Need expert ETL solutions? You've come to the right place! Let's streamline your data and unlock powerful insights.

Latest Projects

ETL: Top Banks

Bank building Image

Specifications

  • Programming Language:
    • Python logo Python
  • Database:
    • SQLite logoSQLite
  • Development Environment:
    • Jupyter logo Jupyter Notebook
  • Frameworks / Libraries:
    • Pandas logoPandas
    • Numpy iconNumpy
    • BeautifulSoup iconBeautifulSoup
    • Requests IconRequests

Description

Automates financial data extraction, transformation, and storage, ensuring structured insights for efficient analysis.

Key Features

  • Scrapes and ranks banks, converting market capitalization values for consistency.
  • Adds currency conversions (GBP, EUR, INR) for global financial tracking.
  • Stores refined data in a CSV file and SQLite database for seamless querying and decision-making.

Flask: CompTrack API

CompTrack API Image

Specifications

  • Programming Language:
    • Python logo Python
  • Database:
    • SQLite logoSQLite
  • Frameworks / Libraries:
    • Bug On Screen ImageUnittest
    • Flask LogoFlask
    • Flask LogoFlask-SQLAlchemy

Description

Streamlines computer hardware data collection for smarter resource management and analytics integration.

Key Features

  • Organizes and structures computer specifications for easy access.
  • Enhances scalability for seamless data handling in analytics workflows.
  • Optimizes precision in hardware tracking for informed decision-making.

MongoDB: Visitor Admin

Visitor Admin Image

Specifications

  • Programming Language:
    • Python logo Python
  • Database:
    • MongoDB logo MongoDB
  • Containerization:
    • Docker logo Docker
  • Frameworks / Libraries:
    • Bug On Screen ImageUnittest
    • MongoDB LogoPymongo
    • MongoDB LogoBson
    • MongoDB LogoMongomock

Description

Captures visitor credentials, ensuring efficient data collection and retrieval.

Key Features

  • Stores visitor data in MongoDB with structured indexing for quick lookups.
  • Lays the foundation for advanced workflows and seamless data access.

Pandas: Data Wrangling

Data Wrangling Image

Specifications

  • Programming Language:
    • Python logo Python
  • Development Environment:
    • Jupyter logo Jupyter Notebook
  • Frameworks / Libraries:
    • Pandas logoPandas

Description

Transforms messy datasets into clean, structured formats to enable predictive insights.

Key Features

  • Cleans and standardizes personality scores and department data.
  • Identifies high / low-risk learners, supporting performance predictions.
  • Guides proactive decision-making, ensuring effective learner placement.

PostgreSQL: Shop Database

Shop Database Image

Specifications

  • Query Language:
    • SQL logoSQL
  • Database:
    • PostgreSQL logoPostgreSQL
  • Containerization:
    • Docker logo Docker

Description

A scalable relational database designed for optimized querying and analytics workflows.

Key Features

  • Ensures data integrity with structured relations and key constraints.
  • Supports ETL processes for seamless data retrieval and transformation.
  • Applies clean coding practices, ensuring long-term maintainability.

Technologies I Use

Visual Studio Code

GitHub

HTML

CSS

Git

Certificates

Umuzi Academy | National Certificate: Business Analysis Support Practice NQF Level 5

Business Analysis Support Practice Certificate

Bringing together analytical thinking and creative problem-solving to research technical challenges and develop effective solutions within consulting environments. Business needs are translated into clear, actionable specifications while troubleshooting is conducted with precision. Adaptable communication skills and a structured approach support collaborative Data Engineering, optimizing pipelines and integrating solutions for meaningful results.

IBM Certificate: Python for Data Science, AI, and Development

Python for Data Science IBM Certificate

Built a strong foundation in Python with a focus on critical data structures, programming logic, and core libraries used in Data Science workflows. Applied these skills to manipulate and analyze datasets, and to develop basic data-driven applications. Gained hands-on experience with essential tools such as pandas, and numpy, reinforcing my readiness to contribute to data engineering tasks such as data wrangling, transformation, and integration within ETL pipelines.

IBM Certificate: Python Project for Data Engineering | Top Banks

Python project for Data Engineering IBM Certificate

Developed a data pipeline for banking sector analysis, extracting financial data via APIs and web scraping. Transformed datasets across formats, applied structured logging for ETL tracking, and prepared analysis-ready data for repository loading. This project showcases practical Python-based Data Engineering skills, scalability in pipeline design, and initiative in applying industry-relevant techniques.

Additional Skills

Reviewed 111+ Pull Requests, ensuring high coding standards.

Completed 46+ projects, demonstrating expertise in scalable data solutions.

Solved 100+ problems across multiple coding platforms, sharpening problem-solving skills.

Experienced in Agile workflows, leading peer learning through POD sessions, and creating clear documentation for seamless project onboarding.

Contact Me

Let's build powerful, scalable data solutions that drive real impact. Got a project in mind? Let's connect—I'm ready to help!

mpilongubo07@gmail.com: