Shawn Ng

Shawn Ng

Data Engineer

I'm Shawn Ng. I am a Data Engineer from Singapore.

I start this website to keep track of my data and software projects. I hope that you find something useful here.

My main image

What I do

Here is an overview of my technical skill sets

Data Engineering / Science

  • Python
  • Apache (Spark, Airflow, Hadoop)
  • SQL, NoSQL, Graph Database
  • Web Scraping
  • Data Visualization
  • Machine Learning
  • Natural Language Processing

Cloud Engineering

  • Linux
  • Amazon Web Service
  • Google Cloud Platform
  • Heroku

Backend Engineering

  • Django
  • Ruby on Rails
  • Express.js

Frontend Engineering

  • Bootstrap
  • JavaScript

Latest Blog Posts

Apache Spark VS Pandas VS Koalas

Apache Spark is an open-source unified analytics engine for large-scale data processing. Pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language. Koalas is Pandas API on Apache Spark.

Read more →