Summary
Principal Data Engineer with 14+ years of experience designing and operating data platforms for both customer-facing products and large-scale analytics. Experienced in leading greenfield systems, driving architectural decisions, and working cross-functionally to deliver reliable, scalable data solutions in ambiguous environments.
Key Systems & Impact
Unify Data Platform (Living Security)
- Designed multi-tenant architecture for secure, scalable data processing
- Built serverless pipelines in AWS handling variable workloads and scaling constraints
- Selected and implemented core technologies (Postgres, Airflow, dbt)
- Reduced AWS costs by 5–10% through targeted optimizations
- Owned release process, including QA coordination and stakeholder communication
Distributed ETL Framework (WarnerMedia)
- Spark-based framework adopted across team, used in 90% of pipelines
- Improved reliability and reduced time to build new pipelines
Experience
Principal Data Engineer, Living Security: 2021 — Present
- Tech lead for data and backend systems on Unify, a customer-facing cybersecurity product and primary revenue driver
- Designed and built data platform from scratch, enabling multi-tenant analytics and processing
- Collaborated with product managers, data scientists, and domain experts to define system requirements and data models
- Built and operated AWS serverless pipelines, addressing scaling limitations and cost challenges
- Solved multi-tenant data isolation challenges for secure customer data handling
- Improved performance of Postgres-backed systems powering user-facing features
- Identified and executed cost optimization strategies resulting in ~5–10% AWS cost reductions
- Owned release lifecycle: QA coordination, production readiness decisions, and release communication
- Led and mentored a team of 3 engineers through design guidance and code reviews
Technologies Used
- Orchestration: Apache Airflow
- Query engines: AWS Athena, Postgres, DynamoDB
- Execution: AWS Lambda, Step Functions, Elastic Container Service
- Tools: dbt, SQLAlchemy, cdk
Senior Data Engineer, Warner Bros.: 2017 — 2021
- Developed large-scale data warehouse supporting ~100 pipelines and tens of GBs of daily processing
- Built flexible Spark ETL framework used by ~10 engineers across majority of pipelines
- Drove adoption of Airflow across multiple teams, establishing best practices and reusable components
- Owned data quality and monitoring systems including dashboards, alerting, and incident response
- Reduced time to resolution for pipeline failures and improved trust in analytical data
- Collaborated with analysts and data scientists to align data systems with business needs
- Mentored junior engineers through hands-on support and design discussions
Technologies Used
- Snowflake
- Redshift
- Python
- Apache Airflow
- AWS: S3, EMR, Lambda, Redshift
Data Engineer, Leaf Group: 2017
- Created Python API integrations for third party data sources.
- Designed and maintained aggregate tables to speed up reporting queries.
- Managed ETL jobs that powered reports sent to upper management.
Technologies Used
Data Engineer, Loot Crate: 2015 — 2017
- Created Python ETL workflows for ingesting data into Redshift data warehouse.
- Built unified Python library for ETL tasks.
- Worked with data architect to implement data lake on Google's cloud platform.
- Provided documentation and data modeling insights to data analysts.
Technologies Used
- BigQuery
- Redshift
- Python
- Apache Airflow
- Apache Nifi
- GCP: GCS, DataProc, BigQuery
- AWS: S3, Lambda
Developer, Creators: 2012 — 2015
- Managed and expanded large PHP-based website.
- Developed a comprehensive content management system for all of the company's web properties.
- Designed and implemented client-facing delivery methods.
- Designed and implemented a range of new business websites.
- Managed a mid-sized computer network, comprised of mostly iMacs and Windows PCs.
- Maintained company's web servers, including dedicated and virtual machines.
Technologies Used
- PHP
- MySQL
- Python
- AWS: S3, EC2, RDS
Computer Skills
Tools
- Airflow
- Nifi
- Tableau
- Redash
Database Engines
- Snowflake
- Postgresql/Redshift
- BigQuery
- MySQL
Big Data
- Spark (pyspark)
- Data Lakes: S3, GCS
- Streaming: Segment, Kinesis
- Databricks
Operations
- Git: Github, Gitlab, Bitbucket
- CI/CD: Jenkins, Gitlab, Github Actions
- Ansible
- Datadog
Amazon Web Services
- Database: RDS, DynamoDB, Redshift, Athena
- Execution: EC2, EMR, Lambda, Step Functions
- Containers: ECS, ECR, Fargate
Web Development
- Backend: PHP, Python (Flask)
- Frontend: HTML, JS, CSS
- REST APIs
- Hugo and Jekyll
Education
Bachelor of Science, Computer Science, Cum Laude
California State University Long Beach