Jazz Yao-Tsung Wang

Data Engineering Evangelist / Hybrid Cloud Practitioner

Career Summary

A multi-domain, cross-functional fast learner.
"Data-driven" Evangelist since 2008. Hybrid Cloud Practitioner since 2010. (AWS, GCP, Azure)

Act in different roles, including People Management, Product Management, Project Management, Sales Consult, Solution Architect, Data Architect, Cloud Admin, System Admin, Network Admin, and Developer.

Serve different industries, such as Healthcare IT, AdTech (RTB), Bank, 3rd-party payment, Retail, Gaming, E-commerce, Telecom, Semiconductor, Bioinformatics, Academic Research, etc.

Work Experience

Director of Engineering

Innova Solutions Taiwan
2021 December - Present

Senior Software Engineering Manager

2018 July - 2021 December
Innova Solutions Taiwan
2018 July - 2024 March
(3rd party workforce of) Change Healthcare
2023 August - 2024 March
(contractor of) Optum, Unitedhealth Group

Roles:

  • People Management: recruit and manage up to 7 Scrum teams, 50+ members in Taipei.
  • Stakeholder Management: communicate with SVPs, VPs, directors, managers within 2 organizations.
  • Architect: support data ingestion architecture design and troubleshoot production issues.

Achievements:

  • [AI] Build Hospital Price Transparency Chatbot with LangChain, Chainit, OpenAI API and SQLite.
  • [Data] Build HL7 C-CDA and FHIRv4 Test Data Generator for Interoperability products.
  • [Culture] Build a "Confluence Insight" dashboard to observe team development progress.
  • [Scrum] Establish the "Communication Model" and "Working Model" for 5 teams.

Soft skills used:

  • Conflict Resolution
  • Negociateion
  • Facilitator
  • Retrospective
  • Distributed Team

Technologies used:

  • AWS EMR
  • AWS Glue
  • AWS RDS (MySQL/PostgreSQL)
  • AWS ECS
  • AWS Lambda
  • Apigee
  • Apache Spark
  • Scala
  • Python
  • Golang
  • LangChain

Data Architect

TenMax AdTech Lab Co. Ltd.
2016-04 - 2018-06

Roles:

  • Cloud Admin/NetAdmin: manage on-premises data center infrastructure and automation deployment Ansible script to Azure & GCP.
  • Architect: refine existing data pipeline to reduce operational cost. Design and implement full-stack monitoring and notification architecture.

Achievements:

  • [Data] migrate data pipeline to Kafka. Reduce 65% of Cassandra Cost.
  • [DevOps] Enable Full Stack Monitoring and Notification with Prometheus. Reduce MTTR from hours to 10 minutes.
  • [Law] Write GDPR documents for audience and publisher.

Technologies used:

  • Azure
  • GCP L7 loadbalancer
  • Prometheus
  • Grafana
  • Docker
  • Ansible
  • Java

Assistant Vice President, Product Management

Etu Corporation, SYSTEX Group
2014-02 - 2016-03

Roles:

  • Product Owner: define product roadmap and specification of Etu Manager 2.5 & 3 software appliance based on Cloudera Manager.
  • Pre-sales & Solution Architect: team up with sales and BD to provide solutions to customers.
  • Architect & Evangelist: promoting Apache Hadoop ecosystem technologies and applications in Taiwan.

Achievements:

  • [Product] build and sell Etu Manager to 5 customers.
  • [Product] covers 50% annual revenue in 2015.
  • [Innovation] receive 2 TW and US patents.

Technologies used:

  • Cloudera Manager
  • Cloudera CDH
  • Apache Hadoop
  • Apache Hive
  • Apache HBase
  • Apache Sqoop
  • Apache Impala
  • PostgreSQL
  • Bootstrap
  • Docker
  • VirtualBox
  • CentOS
  • Puppet

Associate Researcher

National Center for High-Performance Computing
2003-01 - 2014-02

Roles:

  • Principal Investigator(PI): manage up to 4 teams, 25 members for government-funded projects.
  • Project Manager: manage projects from university, ITRI, TWNIC, MiTAC (TRTC), etc.
  • System Admin: maintain Hadoop Cluster, underwater ecology observation system (video streaming for coral reef).
  • Developer: develop hadoop4win, drbl-hadoop, etc.

Achievements:

  • [Embedded] develop the Wireless Service Unit (WSU) of TRTC (Metro Taipei).
  • [Hadoop] build "Hadoop as a service", 4000+ registered users (2009-03 ~ 2014-04).
  • [DRBL] 2008 Award for Outstanding Contributions in Science and Technology.
  • [Innovation] receive 1 TW patent .

Technologies used:

  • PC Cluster
  • PXE Boot
  • Apache Hadoop
  • Xen
  • KVM
  • Grid Computing
  • Google Map API
  • jQuery
  • PHP
  • Access Grid
  • Video Streaming
  • Sensor Network
  • Fast Roaming
  • C/C++
  • NSYS
  • Shell Script

Skills & Tools

Cloud

  • AWS
  • GCP
  • Azure

Data

  • Hadoop
  • Spark
  • Kafka

NoSQL/SQL

  • HBase
  • Cassandra
  • MySQL
  • PostgreSQL

Language

  • Shell Script
  • Java / Scala
  • Python
  • PHP
  • C/C++

Others

  • DevOps
  • Git / SVN / CVS
  • Selenium
  • Linux Kernel Module
  • Video Streaming
  • Data Grid
  • Distributed File System
  • IEEE 1516
  • Sensor Network

Education

  • MSc in Electrical and Control Engineering
    National Chiao-Tung University, Taiwan
    2000 - 2002
  • BSc in Electrical and Control Engineering
    National Chiao-Tung University, Taiwan
    1996 - 2000

Paterns

  • TW I550418
    Issued: 2016-06-16
    METHOD, APPARATUS, AND APPLICATION SYSTEM FOR REAL-TIME PROCESSING THE DATA STREAMS
  • TW I530808
    Issued: 2016-03-08
    System and Method for Providing Instant Query
  • TW I307599
    Issued: 2009-03-11
    Design of underwater long-term monitoring device that can delay biological proliferation

Awards

  • 2008 行政院科技貢獻獎
    2008 Award for Outstanding Contributions in Science and Technology.
  • 2007
    DRBL won first place in the 'Public Sector Applications' category at the Free Software Contest in France.

Publications

  • 2016/2013/2011
    Hadoop The Definitive Guide (Traditional Chinese Translation), 4e/3e/2e
    ISBN: 9789864761364
  • 2014
    Hadoop Operations, 1e ( Tranditional Chinese Translation )
    ISBN: 9789862769973