Experience

Senior Data Engineer, Warner Bros.: 2017 — Present

As Senior Data Engineer: 2020 — Present

  • Served as engineering lead on team of several data engineers.
  • Owned data quality and job monitoring across nearly 100 pipelines.
  • Assisted with compliance issues and cost-savings measures.

As Big Data Engineer: 2017 — 2020

  • Built Spark-based reusable ETL framework to dynamically load input files to various outputs.
  • Evangelized, implemented and supported Airflow for job scheduling across the department.
  • Migrated ETL workflows from permanent HDP cluster to ephemeral EMR clusters.
  • Created API integrations for various data sources in Python.

Technologies Used

  • Snowflake
  • Redshift
  • Python
  • Apache Airflow
  • AWS: S3, EMR, Lambda, Redshift

Data Engineer, Leaf Group: 2017

  • Created Python API integrations for third party data sources.
  • Designed and maintained aggregate tables to speed up reporting queries.
  • Managed ETL jobs that powered reports sent to upper management.

Technologies Used

  • Redshift
  • Python

Data Engineer, Loot Crate: 2015 — 2017

  • Created Python ETL workflows for ingesting data into Redshift data warehouse.
  • Built unified Python library for ETL tasks.
  • Worked with data architect to implement data lake on Google's cloud platform.
  • Provided documentation and data modeling insights to data analysts.

Technologies Used

  • BigQuery
  • Redshift
  • Python
  • Apache Airflow
  • Apache Nifi
  • GCP: GCS, DataProc, BigQuery
  • AWS: S3, Lambda

Developer, Creators: 2012 — 2015

  • Managed and expanded large PHP-based website.
  • Developed a comprehensive content management system for all of the company's web properties.
  • Designed and implemented client-facing delivery methods.
  • Designed and implemented a range of new business websites.
  • Managed a mid-sized computer network, comprised of mostly iMacs and Windows PCs.
  • Maintained company's web servers, including dedicated and virtual machines.

Technologies Used

  • PHP
  • MySQL
  • Python
  • AWS: S3, EC2, RDS

Computer Skills

Tools

  • Airflow
  • Nifi
  • Tableau
  • Redash

Database Engines

  • Snowflake
  • Postgresql/Redshift
  • BigQuery
  • MySQL

Big Data

  • Spark (pypark)
  • Data Lakes: S3, GCS
  • Streaming: Segment, Kinesis
  • Databricks

Operations

  • Git: Github, Gitlab, Bitbucket
  • CI/CD: Jenkins, Gitlab Pipelines
  • Ansible
  • Datadog

Amazon Web Services

  • Database: RDS, DynamoDB, Redshift
  • Execution: EC2, EMR, Lambda
  • S3
  • IAM

Web Development

  • Backend: PHP, Python (Flask)
  • Frontend: HTML, JS, CSS
  • REST APIs
  • Hugo and Jekyll

Education

Bachelor of Science, Computer Science, Cum Laude

California State University Long Beach
Spring, 2012
Overall GPA: 3.55
Major GPA: 3.74