All round data professional with more than 25 years experience in RDBMS including SQLServer, MySQL. I am flexible and will fit into your team. Below is my summary:
Highly effective Technical Leader with over 25 years of experience, Andrew Kim is specialising in data integration, data conversion, data engineering, ETL, big data architect, data analytics, data visualization, data science, analytics platforms, and cloud architecture.
TECHNICAL SKILLS
• Big Data (Hortonworks and Cloudera) – Spark(PySpark, Scala), Kafka, Hive,Impala, NiFi, HDFS, Sqoop, Ranger, Yarn, Solr, SAM, Schema Registry, SuperSet
• Language: Python, Scala, R, JavaScript
• Data Visualization – Tableau, PowerBI, OBIEE, DOMO
• Plunk & ELK Stack – ElasticSearch, Logstash, Filebeat, Kibana
• AWS – S3, RefShift, DynamoDB, Athena, Kinesis, EMR, Aurora, Glue
• Azure – Data Warehouse, Polybase, SQL Server, HDInsight, SSIS, SSAS
• ETL – Informatica Power Centre, Informatica Big Data Management (BDM), SSIS
• Data Science & Engineering – R / RStudio / SparkR Packages: dplyr, ggplot2, stringr, plyr, carrat, SparkR, NLP, tibble, TensorFlow, curl, Python / PySpark, MLLib
• Libraries: MLLib, NumPy, SciPy, Pandas, Matplotlib, Seaborn, SciKit-Learn
• DBMS / OLAP – Oracle, SQL Server, TeraData, MySQL, Postgres, Essbase, SSAS
• Data Modelling – Kimball, Vault
• Machine Learning / Statistical Modelling - Linear Regression, ?Logistic Regression, Classification and Regression Trees, Naive Bayes, ?K-Nearest Neighbors