Jazz Yao-Tsung Wang
Data Engineering Evangelist / Hybrid Cloud Practitioner
Career Summary
A multi-domain, cross-functional fast learner.
"Data-driven" Evangelist since 2008. Hybrid Cloud Practitioner since 2010. (AWS, GCP, Azure)
Act in different roles, including People Management, Product Management, Project Management, Sales Consult, Solution Architect, Data Architect, Cloud Admin, System Admin, Network Admin, and Developer.
Serve different industries, such as Healthcare IT, AdTech (RTB), Bank, 3rd-party payment, Retail, Gaming, E-commerce, Telecom, Semiconductor, Bioinformatics, Academic Research, etc.
Work Experience
Senior Director, Head of Delivery, TDC
Innova Solutions
2024/05 ~ Now
Director of Engineering
2021/12 ~ 2024/04
Senior Software Engineering Manager
2018/07 ~ 2021/12
2018/07 ~ 2023/07
(3rd party workforce of) Change Healthcare
2023/08 ~ 2024/03
(contractor of) Optum, Unitedhealth Group
Roles:
- People Management: recruit and manage 9 Scrum teams, 84+ members
- Stakeholder Management: communicate with differemt roles across 4 organizations.
- Architect: support data architecture design and troubleshoot production issues.
Achievements:
- [AI POC] Hospital Price Transparency Chatbot with LangChain and GPT-3.5. 🖥️
- [Data] Build HL7 C-CDA and FHIRv4 Synthetic Test Data Generator with Synthea.
- [Culture] Build "Confluence Insight" to observe team development progress.
- [Scrum] Establish the "Asynchronous Communication Model"x for 5 teams.
- [Innovation] Submit 3 Invention Disclosure.
Soft skills used:
- Conflict Resolution
- Negociateion
- Facilitator
- Retrospective
- Distributed Team
Technologies used:
- AWS EMR
- AWS Glue
- AWS RDS (MySQL/PostgreSQL)
- AWS ECS
- AWS Lambda
- Apigee
- Apache Spark
- Scala
- Python
- Golang
- LangChain
- Chainlit
- OpenAI GPT-3.5/4
Data Architect
TenMax AdTech Lab Co. Ltd.
2016/04 ~ 2018/06
Roles:
- Cloud/Net/Sys Admin: manage on-premises data center infrastructure and automation deployment Ansible script to Azure & GCP.
- Architect: refine data pipeline. Reduce operational cost. Design and implement full-stack monitoring and notification architecture.
Achievements:
- [Data] migrate data pipeline to Kafka. Reduce 65% of Cassandra Cost.
- [DevOps] Enable Full Stack Monitoring and Notification with Prometheus.
Reduce MTTR from hours to 10 minutes. - [Law] Write GDPR documents for audience and publisher.
Technologies used:
- Azure
- GCP L7 loadbalancer
- Prometheus
- Grafana
- Docker
- Ansible
- Java
Assistant Vice President, Product Management
Etu Corporation, SYSTEX Group
2014/02 ~ 2016/03
Roles:
- Product Owner: define data platform roadmap and specification of Etu Manager 2.5 & 3 software appliance based on Cloudera Manager.
- Pre-sales & Solution Architect: team with Sales and BD to provide solutions.
- Architect & Evangelist: promoting ASF Big Data ecosystem in Taiwan.
Achievements:
- [Product] build and sell Etu Manager to 5 customers.
- [Profit] covers 50% annual revenue in 2015.
- [Innovation] receive 2 TW and US patents.
Technologies used:
- Cloudera Manager
- Cloudera CDH
- Apache Hadoop
- Apache Hive
- Apache HBase
- Apache Sqoop
- Apache Impala
- PostgreSQL
- Bootstrap
- Docker
- VirtualBox
- CentOS
- Puppet
Associate Researcher
National Center for
High-Performance Computing
2003/01 ~ 2014/02
Roles:
- Principal Investigator (PI): manage up to 4 teams, 25 members for government-funded projects.
- Project Manager: manage projects from university, ITRI, TWNIC, MiTAC (TRTC), etc.
- System Admin: maintain Hadoop Cluster, underwater ecology observation system (video streaming for coral reef) and Agriculture Grid.
- Developer: develop
hadoop4win
,drbl-hadoop
, etc.
Achievements:
- [Embedded] develop the
Wireless Service Unit (WSU)
of TRTC (Metro Taipei). - [Hadoop] build "Hadoop as a service", 4000+ registered users (2009/03 ~ 2014/04).
- [DRBL] 2008 Award for Outstanding Contributions in Science and Technology.
- [Innovation] receive 1 TW patent .
Technologies used:
- PC Cluster
- PXE Boot
- Apache Hadoop
- Xen
- KVM
- Grid Computing
- Google Map API
- jQuery
- PHP
- Access Grid
- Video Streaming
- Sensor Network
- Fast Roaming
- C/C++
- NSYS
- Shell Script
Skills & Tools
Cloud
-
AWS
-
GCP
-
Azure
Data
-
Hadoop
-
Spark
-
Kafka
NoSQL/SQL
-
HBase
-
Cassandra
-
MySQL
-
PostgreSQL
Language
-
Shell Script
-
Java / Scala
-
Python
-
PHP
-
C/C++
Others
- DevOps
- Git / SVN / CVS
- Selenium
- Linux Kernel Module
- Video Streaming
- Data Grid
- Distributed File System
- IEEE 1516
- Sensor Network
Education
-
MSc in Electrical and Control EngineeringNational Chiao-Tung University, Taiwan2000/09 ~ 2002/08
-
BSc in Electrical and Control EngineeringNational Chiao-Tung University, Taiwan1996/09 ~ 2000/08
Patents
-
TW I550418Issued: 2016-06-16METHOD, APPARATUS, AND APPLICATION SYSTEM FOR REAL-TIME PROCESSING THE DATA STREAMS
-
TW I530808Issued: 2016-03-08System and Method for Providing Instant Query
-
TW I307599Issued: 2009-03-11Design of underwater long-term monitoring device that can delay biological proliferation
Awards
-
2008 行政院科技貢獻獎2008 Award for Outstanding Contributions in Science and Technology.
-
2007DRBL won first place in the 'Public Sector Applications' category at the Free Software Contest in France.
Publications
-
2016/2013/2011Hadoop The Definitive Guide (Traditional Chinese Translation), 4e/3e/2eISBN: 9789864761364 (4e)ISBN: 9789862766682 (3e)ISBN: 9789862762967 (2e)
-
2014Hadoop Operations, 1e ( Tranditional Chinese Translation )ISBN: 9789862769973