Data Engineer · Ho Chi Minh City

Nguyen Dang
Hong Huy

I build scalable data platforms, enterprise ETL/ELT pipelines, and modern lakehouse systems for high-volume analytics.

  • 4+ Years Experience
  • 95% Latency Reduction
  • 60% Cost Reduction
Nguyen Dang Hong Huy portrait
Pipeline SLA 99.9%
Lakehouse Bronze - Silver - Gold

About

Data platforms designed for scale, reliability, and business reporting.

Data Engineer with 4+ years of experience designing and building scalable data platforms, data warehouses, and large-scale ETL/ELT pipelines. Experienced in processing billions of records using modern Big Data technologies including Apache Spark, Airflow, Kafka, and cloud-based data architectures. Skilled in implementing Medallion Data Platforms (Bronze–Silver–Gold), optimizing distributed data pipelines, and building enterprise reporting systems that support high-performance analytics and data-driven decision making. Passionate about distributed systems, real-time data processing, and modern Data Lakehouse architectures, with a strong focus on performance optimization, scalability, and reliable data infrastructure.

AI & Data Engineering

AI-Ready Lakehouse Architecture

AI-Driven Processing

By dividing the data pipeline into distinct layers, I ensure data is primed for Machine Learning. Raw data lands in the Bronze layer, gets cleaned in the Silver layer, then passes through AI/ML Processing for predictions, before serving high-performance insights in the Gold layer.

Nguyen Dang Hong Huy with a city and sea view

Creative Builder

Turning messy data problems into clear, useful systems.

I like working at the intersection of engineering discipline and creative problem-solving: mapping business questions, shaping reliable pipelines, and building tools that make data easier for teams to use.

Nguyen Dang Hong Huy near a riverside landmark

Experience

Recent Work

Jan 2026 - Present

Data Engineer - Galaxy Pay

Ho Chi Minh City

  • Architected a Medallion Data Platform on Amazon S3, Spark, Delta Lake, Airflow, and Snowflake.
  • Built ETL/ELT pipelines for ingestion, validation, transformation, and serving.
  • Developed Data Quality and Observability dashboards with FastAPI and ReactJS.
  • Reduced data latency by 95% and infrastructure cost by 60%.
Mar 2024 - Dec 2025

Data Engineer - HDBank

Ho Chi Minh City

  • Implemented enterprise ETL/ELT pipelines for data warehouse and reporting systems.
  • Built a dynamic report builder for on-demand report creation without redeployment.
  • Developed Excel/PDF generation, storage, cache, and permission management services.
  • Migrated legacy SQL procedures to Spark jobs, improving performance by 30-70%.
Oct 2021 - Oct 2023

Data Engineer & Full-stack Developer - Fujinet Systems

Ho Chi Minh City

  • Built real-time sales analytics using Kafka, Spark Streaming, and Cassandra.
  • Designed batch-processing pipelines on Hadoop/HDFS for large-scale workloads.
  • Created analytics features with Python, AWS services, ReactJS, FastAPI, NodeJS, and PostgreSQL.

Skills

Technical Stack

Architecture

Lakehouse Data Lake Data Warehouse Medallion Data Modeling Governance

Big Data

Apache Spark Kafka Airflow Delta Lake Apache Iceberg Databricks Hadoop HDFS

Storage

S3 MinIO PostgreSQL Oracle MongoDB Cassandra MySQL

Engineering

Python SQL JavaScript FastAPI ReactJS NodeJS Docker Git CI/CD

Cloud & BI

AWS ECS AWS EKS Kinesis AWS Glue RDS Snowflake PowerBI Superset

Soft Skills

Teamwork Problem-solving Ownership Leadership English Communication Multitasking

Projects

Selected Projects

ETL

Optimizing ETL Processes for Data Warehouse

Designed and optimized ELT pipelines for enterprise DWH using Spark, Airflow, Oracle, and Python.

Spark Airflow Oracle Python
Reduced ETL runtime by 40-60% and compute cost by about 25%.
RT

Real-Time Sales Analytics Pipeline

Built a Kafka to Spark Streaming to Cassandra pipeline with dashboards for real-time sales monitoring.

Kafka Spark Streaming Cassandra VueJS
Achieved 3-5s end-to-end latency for business visibility.

Education

Academic Background

  • Master of Computer Science, HCMUS - 2023-Ongoing - GPA 8.20
  • B.S. Information Systems, HCMUS - 2018-2022 - GPA 8.33

Certifications

Professional Learning

  • AWS - Azure Databricks Platform Architect - Issued Oct 2025
  • Databricks with AI Agent Fundamentals - Issued Oct 2024
  • Google Data Analytics Specialization - Issued Jan 2023

Hobbies

Traveling, observing systems, and sharing what I learn.

01

Travel

I enjoy exploring new cities, local culture, and everyday systems. Travel gives me fresh perspectives on how people move, decide, work, and use technology.

02

Sharing Experience

I like turning practical lessons into short notes, videos, and blog posts so other engineers can avoid common mistakes and learn faster.

03

Creative Work

Outside of data platforms, I care about visual storytelling, personal branding, and making technical ideas easier to understand.

Channels

Where I share data engineering notes.

Contact

Available for data engineering and platform roles.

I am interested in building reliable data infrastructure, real-time processing systems, and analytics platforms that help teams move faster.