Daniel Pascal Lamblin's Resume

Etsy Senior Software Engineer Experimentation Platform

Brooklyn

2022–Present

Extended self-service no-code custom experiment metrics for teams to track funnel metrics in experiment variants
Added some api endpoints in php to support custom metrics and events management from commandline and python notebooks
Consolidated jobs with similar sources to get 45% run time improvements & increased argument granularity to allow back-fills without customizing jobs
Added data quality validations to new and old experimentation pipeline tables with Great Expectations
Developed Self-repair pipeline steps for event mis-namings & extend monitoring with on-call rotations

Coupang Senior Backend Software Engineer Clickstream Web Logging

Seattle

2020–2022

Developed and delivered Customer Experience Analytics Platform tool for self-service funnel, journey and trend analysis of web log data using Vue.js 2, Spark, and Scala
Mentored intern project to port and extend UI on Vue.js 3 & hired them
Assisted Retail Delivery team's mission-critical multi-AZ expansion projects

Coupang Senior Data Engineer Data Platform Tools and Infra

Seattle

2019–2020

Migrated ~4000 Airflow ETL jobs from Airflow 1.8 to 1.10+, with custom tooling
Optimized teams' resource usage of EMR and Spark jobs
Split and upgraded Airflow into deployments by priority

Coupang Big Data Engineer Big Data Platform

Seoul

2017–2019

Onboard teams onto Airflow & and maintained EMR clusters for their jobs
Developed Data Platform Portal tool with Cluster management and data discovery features
Scale out Airflow to ~6000 dags with multiple deployments
On call rotation for Airflow, Presto, Hive, Hue, Zeppelin, Spark, HDFS, Zookeeper etc.

Coupang Data Engineer Data Platform

Seattle

2016–2017

Migrated on-prem Netezza and HDFS Hive to EMR Hive on S3
Update ETL into Airflow from Oozie and Talend
Monitored data readiness with on-call rotations

Insight Data Science Data Engineering Fellow

New York City

2015–2016

Realtime and batch processing on NYC MTA's GTFS stream with Kafka, Spark, HDFS, HBase, S3
Generated user data in large scale for testing aim of notifying users of train delays
Cluster on AWS EC2; project information and presentation at dlamblin.github.io/mta-delay-monitoring

Paragon Cloud Security Software Engineer

New York City

2015–2015

Paragon was interested in cloud storage of security video and we got over 100 cameras storing footage on Azure.
The Axis Camera's had embedded linux and a cross-compilation chain for edge video processing apps which we attempted to add scene/object recognition to.

Spectre App Founding Software Engineer

New York City

2014–2015

Rapid prototype a photo-journalism sourcing app for location specific calls for photos with photographer licensing & attribution

Google Software Engineer

New York City

2010–2014

Ported legacy Studio product, a rich advertisement authoring and QA web-app, to Google’s Web Toolkit front-end with a stubby rpc backend and megastore datastore
Developed a dashboard to track component usage data of ads by comparing html5 vs. flash authoring, common formats, layouts, and generated impressions. Utilized cross-team apis and internal versions of GFS, Cloud Dataflow and Drill
Implemented critical preview features for monitoring ad unit interactions and compliance
Migrated user records and assets to support new multi-account users and unified asset library view
Reduced reprocessing and conserved storage of assets by fingerprinting uploads, both on individual files and within archives

Sigato Group Software Engineer

Remote

2009–2010

Engineered provider search functionality for New West Health with R-tree based search, maps, and directions, as a module for Drupal using PHP, MySQL, and Perl
Integrated single sign-on to converge features from New West Health and partners

Travelocity Senior Web Application Developer

New York City

2006–2009

Established features for IgoUgo.com using the ASP.NET 2.0 framework, C#, PrototypeJS
Introduced a Lucene based index of content to offload db search as a wsdl service in Java and Spring with auto-completed suggestions for key geo-entities
Boosted traffic ten fold through optimization of page structure, URLs and image file names

Hairzone Inc Head Web Developer

Moonachie

2004–2006

Built and supported the multiple websites of four company brands with ASP

Richmond Research Inc Programmer

New York City

2003–2004

Automated reporting and developed web products using Perl, PHP, Visual Basic and ActionScript for clients including Wall Street Journal and Priceline

Mile NY Website Programmer

Fort Lee

2002–2003

Established MileNY's e-commerce site for using Perl, CGIs, and MySQL database

EMC Corp Associate Software Engineer

Milford, MA

2000–2001

Extended EMC Data Manager Volume & Tape Library Manager's media duplication processes targeting Petabyte capable systems like Sony PetaSite with multiple robots and drives
Improved and maintained EDM as a multi-process C based system with Sun RPC, threading, and IPC signals & pipes; Resolved deadlock by refactoring mutex hierarchy

Daniel Pascal Lamblin

Skills

Education

Experience (Selected)