Applying for Data Scientist, Product Analytics at Etsy

Analytics that
explains the why
behind behavior.

I design experiments, surface behavioral signals, and build the analytical systems that help product teams understand what is actually happening and what to do next.

By the numbers
Enterprise clients supported 50+
A/B experiments run 30+
Campaign ROI improvement 22%
Churn reduction 18%
Years in analytics 3+
Case Studies
Work that shaped
product decisions.
Vue.ai — Client & Product Analytics
30+ A/B Experiments Across Pricing, Promotions, and Personalization
Experimentation

Enterprise retail clients including Adidas, Zara, Dune London, and Crocs needed to know which recommendation strategies, pricing configurations, and campaign setups were actually moving revenue, not just engagement metrics. There was no structured experimentation program to answer that reliably.

22% Campaign ROI lift
18% Churn reduction
60% Engagement lift
Built and ran the A/B testing program end to end from hypothesis design and metric selection through significance testing and stakeholder readout. Findings went directly to commercial leadership at Adidas and Zara and changed how those teams allocated campaign spend across recommendation channels.
A/B TestingPythonSQLStatistical AnalysisCausal Inference
Vue.ai — Data Quality & Root Cause Analysis
Catching a Silent Pipeline Failure Before It Reached Clients
Anomaly Detection

A major retail client's recommendation click-through rate dropped noticeably over two weeks. The pipeline showed green. No errors flagged. The data looked structurally clean but the commercial signal was wrong and nobody upstream had noticed.

<15m Time to resolution
2 Future issues caught early
0 Client-facing errors
An upstream schema change had reclassified user behavior events under a new event type our ingestion layer did not recognize. The pipeline was silently discarding the behavioral signal the recommendation model depended on. I traced the root cause, fixed the ingestion logic, backfilled the missing data, and built a statistical validation layer monitoring event type distributions against a 7-day rolling baseline.
SQLPythonData QualityStatistical MonitoringRoot Cause Analysis
Vue.ai — Analytics Engineering
SQL Metrics Layer Standardizing KPIs Across 50+ Enterprise Accounts
Data Governance

Commercial teams at Adidas, Zara, Dune London, and Crocs were pulling numbers from their own dashboards and arriving at different answers to the same questions. There was no single source of truth for engagement, conversion, or pipeline KPIs across the portfolio.

20+ KPIs standardized
50+ Accounts on one layer
0 Reporting discrepancies
Designed and maintained a SQL metrics layer that standardized KPI definitions and query logic across all client accounts. Once teams pulled from the same layer, reporting inconsistencies disappeared and commercial conversations became faster, cleaner, and credible to senior leadership.
SQLdbtSnowflakeDimensional ModelingKPI Design
WPTI — Analytics Engineering
Behavioral Dashboards That Improved Program Retention by 25%
Product Analytics

Program leaders were receiving delayed, manually exported reports. By the time data reached them, the window to intervene with at-risk participants had already closed. The analysis arrived after the problem had become permanent.

25% Retention improvement
40% Faster reporting
120h Saved monthly
Built Tableau dashboards tracking cohort behavior and engagement trends across 20K+ participant records, replacing a manual export process with a structured ELT pipeline into Redshift. Leaders could now see early risk signals in real time and act before participants dropped out rather than after.
TableauSQLdbtAirflowAmazon Redshift
Projects
Built to solve
real problems.
Personal Project — Data Engineering
Real-Time Crypto Analytics Pipeline on AWS
Streaming Architecture

Built a production-grade streaming pipeline ingesting live Coinbase market data, processing it in real time using Kafka and Spark, and loading analytics-ready tables into AWS for downstream reporting and trend analysis. Demonstrates end-to-end ownership of a high-volume data system from ingestion through analytical output.

Pipeline Architecture
Coinbase APILive data
KafkaEvent stream
SparkProcessing
AWS S3Data lake
AirflowOrchestration
AnalyticsReporting
Apache KafkaApache SparkAWS S3EC2AirflowPython
Personal Project — Applied AI
Governed AI Analytics Agent
LLM + Analytics

Non-technical stakeholders needed a way to query live data in plain English without opening a ticket or learning SQL. The challenge was making it safe: no write access, no schema hallucinations, no unvalidated results reaching the user.

100% Read-only enforced
0 Schema hallucinations
Live Production queries
The agent translates natural language into validated SQL, enforces SELECT-only policies, grounds every query against an allowed schema, and shows transparent query evidence before returning results. Built with OpenAI API, Guardrails, and custom orchestration logic. Directly relevant to Etsy's requirement for experience with ML and GenAI-assisted workflows.
PythonOpenAI APISQLGuardrailsLLM Orchestration
Technical Stack
Tools I work
with every day.
Query & Analysis
SQLPythonRPySparkPandasscikit-learn
BI & Visualization
TableauLooker StudioMetabasePower BI
Data Infrastructure
BigQuerySnowflakeRedshiftdbtAirflowKafkaSpark
AI & Automation
OpenAI APILangChainGuardrailsPrompt EngineeringAWS

Ready to talk about
the Etsy Ads team.

Excited about building analytics that helps creative entrepreneurs grow their businesses.