avatar
Dong Wang
Data Scientist

Machine learning, predictive analytics, dashborading
Handle data covering its full cycles
ETL pipeline, survey design, predictive modelling, geospatial analysis, visualization, auto-reporting

Skills

R

  • Shiny
  • data.table
  • purrr
  • dplyr
  • rMarkdown
  • ggplot2
  • SQL in R

Python

  • pandas
  • scikit-learn
  • matplotlib
  • bokeh
  • statsmodels
  • web scraping survey monkey

A.I.

  • neural netwoks
  • SVM
  • bagging
  • bootstrapping
  • XGBoost
  • classification
  • keras

Stats Prob.

  • gam glm
  • PCA
  • sampling
  • time series
  • hypothesis test
  • mixed model

Web & Database

  • plotly.js
  • leaflet.js
  • npm
  • css: grid, flex
  • MySQL

Present analytic findings with easily digestible pieces for both lay and professional audience

Experience
DOC, New Zealand
Statistician/Data Scientist
2017 -

Sensitivity Analyses on complex system

Web Scraping using Python on Survey Monkey

  • Automate data extraction, using SurveyMonkey API in Python, building data pipelines

Shiny dashborad: Extract, aggregate and visualize data from database

Business report generating

Use R and PowerBI to manage data in Azure SQL Database

NIWA, New Zealand
Predictive Modeller
2012 - 2017
Lincoln University, New Zealand
Modeller
2009 - 2012

Individual based modelling

Education
City University of Hong Kong
PH.D
2002 - 2007

Develop and apply new machine learning models in air pollution study

Interests
icon icon icon icon