Resume - Tony Zhang
911tonyzhang@gmail.com
Skill
Programming languages: Python, Java, Scala
Storage: S3/GCS, Redshift/Snowflake/BigQuery, BigTable/Dynamo, RDS/CloudSQL
Compute: Spark, EMR, K8s, Lambda
Airflow expert, contributor to airflow jenkins and bigquery operators
Topics of interest: readability, functional programming, airflow, serverless, dataops
WORk
Senior Staff Platform Engineer@Foursquare, Dec 21-
Build Foursquare's data stack in EMR Serverless and saved multi-million of compute cost per year.
Drive delta format adoption at Foursquare and adoption of optimization techniques like bloomfilter and liquid clustering.
Adoption of streamlit within the data org.
Built foursquare's data governance stack that handles PB of data governance in s3, with proper data skipping.
Rebuild a data product for offline conversion in 3 month that contributes to 10MM of revenue for Foursquare. Lead cross-function collaboration from mvp to launch.
Maintain AWS and Databricks vendor relations, drive efficiency through long-term saving plans.
Manage a team of 10+ data engineers, dozens of spark jobs, hundreds of dags, and petabyte scale data pipelines for adtech applications.
Foster strong engineering culture on delivery, productivity (CI/CD), best practices (readability, test-driven) and support (pair-programming, design review)
Design cross-cloud architecture and manage complex pipeline migration from Hadoop stack.
Lead infrastructure design and tooling, including EMR, BigQuery, Airflow/Composer.
Education
M.S. - Engineering, University of Virginia, Aug 2018
B.S. - Engineering, University of Virginia, May 2016