Jazz Yao-Tsung Wang

Data Engineering Evangelist / Hybrid Cloud Practitioner

Career Summary

A multi-domain, cross-functional fast learner.
"Data-driven" Evangelist since 2008. Hybrid Cloud Practitioner since 2010. (AWS, GCP, Azure)

Act in different roles, including People Management, Product Management, Project Management, Sales Consult, Solution Architect, Data Architect, Cloud Admin, System Admin, Network Admin, and Developer.

Serve different industries, such as Healthcare IT, AdTech (RTB), Bank, 3rd-party payment, Retail, Gaming, E-commerce, Telecom, Semiconductor, Bioinformatics, Academic Research, etc.

Work Experience

Senior Director, Head of Delivery, TDC

Innova Solutions
2024/05 ~ Now

Director of Engineering

2021/12 ~ 2024/04

Senior Software Engineering Manager

2018/07 ~ 2021/12
2018/07 ~ 2023/07
(3rd party workforce of) Change Healthcare
2023/08 ~ 2024/03
(contractor of) Optum, Unitedhealth Group

Roles:

  • People Management: recruit and manage 9 Scrum teams, 84+ members
  • Stakeholder Management: communicate with differemt roles across 4 organizations.
  • Architect: support data architecture design and troubleshoot production issues.

Achievements:

  • [AI POC] Hospital Price Transparency Chatbot with LangChain and GPT-3.5. 🖥️
  • [Data] Build HL7 C-CDA and FHIRv4 Synthetic Test Data Generator with Synthea.
  • [Culture] Build "Confluence Insight" to observe team development progress.
  • [Scrum] Establish the "Asynchronous Communication Model"x for 5 teams.
  • [Innovation] Submit 3 Invention Disclosure.

Soft skills used:

  • Conflict Resolution
  • Negociateion
  • Facilitator
  • Retrospective
  • Distributed Team

Technologies used:

  • AWS EMR
  • AWS Glue
  • AWS RDS (MySQL/PostgreSQL)
  • AWS ECS
  • AWS Lambda
  • Apigee
  • Apache Spark
  • Scala
  • Python
  • Golang
  • LangChain
  • Chainlit
  • OpenAI GPT-3.5/4

Data Architect

TenMax AdTech Lab Co. Ltd.
2016/04 ~ 2018/06

Roles:

  • Cloud/Net/Sys Admin: manage on-premises data center infrastructure and automation deployment Ansible script to Azure & GCP.
  • Architect: refine data pipeline. Reduce operational cost. Design and implement full-stack monitoring and notification architecture.

Achievements:

  • [Data] migrate data pipeline to Kafka. Reduce 65% of Cassandra Cost.
  • [DevOps] Enable Full Stack Monitoring and Notification with Prometheus.
    Reduce MTTR from hours to 10 minutes.
  • [Law] Write GDPR documents for audience and publisher.

Technologies used:

  • Azure
  • GCP L7 loadbalancer
  • Prometheus
  • Grafana
  • Docker
  • Ansible
  • Java

Assistant Vice President, Product Management

Etu Corporation, SYSTEX Group
2014/02 ~ 2016/03

Roles:

  • Product Owner: define data platform roadmap and specification of Etu Manager 2.5 & 3 software appliance based on Cloudera Manager.
  • Pre-sales & Solution Architect: team with Sales and BD to provide solutions.
  • Architect & Evangelist: promoting ASF Big Data ecosystem in Taiwan.

Achievements:

  • [Product] build and sell Etu Manager to 5 customers.
  • [Profit] covers 50% annual revenue in 2015.
  • [Innovation] receive 2 TW and US patents.

Technologies used:

  • Cloudera Manager
  • Cloudera CDH
  • Apache Hadoop
  • Apache Hive
  • Apache HBase
  • Apache Sqoop
  • Apache Impala
  • PostgreSQL
  • Bootstrap
  • Docker
  • VirtualBox
  • CentOS
  • Puppet

Associate Researcher

National Center for High-Performance Computing
2003/01 ~ 2014/02

Roles:

  • Principal Investigator (PI): manage up to 4 teams, 25 members for government-funded projects.
  • Project Manager: manage projects from university, ITRI, TWNIC, MiTAC (TRTC), etc.
  • System Admin: maintain Hadoop Cluster, underwater ecology observation system (video streaming for coral reef) and Agriculture Grid.
  • Developer: develop hadoop4win, drbl-hadoop, etc.

Achievements:

  • [Embedded] develop the Wireless Service Unit (WSU) of TRTC (Metro Taipei).
  • [Hadoop] build "Hadoop as a service", 4000+ registered users (2009/03 ~ 2014/04).
  • [DRBL] 2008 Award for Outstanding Contributions in Science and Technology.
  • [Innovation] receive 1 TW patent .

Technologies used:

  • PC Cluster
  • PXE Boot
  • Apache Hadoop
  • Xen
  • KVM
  • Grid Computing
  • Google Map API
  • jQuery
  • PHP
  • Access Grid
  • Video Streaming
  • Sensor Network
  • Fast Roaming
  • C/C++
  • NSYS
  • Shell Script

Skills & Tools

Cloud

  • AWS
  • GCP
  • Azure

Data

  • Hadoop
  • Spark
  • Kafka

NoSQL/SQL

  • HBase
  • Cassandra
  • MySQL
  • PostgreSQL

Language

  • Shell Script
  • Java / Scala
  • Python
  • PHP
  • C/C++

Others

  • DevOps
  • Git / SVN / CVS
  • Selenium
  • Linux Kernel Module
  • Video Streaming
  • Data Grid
  • Distributed File System
  • IEEE 1516
  • Sensor Network

Education

  • MSc in Electrical and Control Engineering
    National Chiao-Tung University, Taiwan
    2000/09 ~ 2002/08
  • BSc in Electrical and Control Engineering
    National Chiao-Tung University, Taiwan
    1996/09 ~ 2000/08

Patents

  • TW I550418
    Issued: 2016-06-16
    METHOD, APPARATUS, AND APPLICATION SYSTEM FOR REAL-TIME PROCESSING THE DATA STREAMS
  • TW I530808
    Issued: 2016-03-08
    System and Method for Providing Instant Query
  • TW I307599
    Issued: 2009-03-11
    Design of underwater long-term monitoring device that can delay biological proliferation

Awards

  • 2008 行政院科技貢獻獎
    2008 Award for Outstanding Contributions in Science and Technology.
  • 2007
    DRBL won first place in the 'Public Sector Applications' category at the Free Software Contest in France.

Publications

  • 2016/2013/2011
    Hadoop The Definitive Guide (Traditional Chinese Translation), 4e/3e/2e
    ISBN: 9789864761364 (4e)
    ISBN: 9789862766682 (3e)
    ISBN: 9789862762967 (2e)
  • 2014
    Hadoop Operations, 1e ( Tranditional Chinese Translation )
    ISBN: 9789862769973