Hi, my name is Sean

I’m a data scientist working on improving peoples health and driving behavior change. I love asking interesting questions and using math to find impactful solutions. Most my days are made up wrangling, exploring, and modeling data in python. The tools I use range from deep learning to causal inference.

Experience




copany logo

Apple

Senior Data Scientist

Dec 2019 - Current

  • Health initiatives R&D, working with Apple watch, phone, claims data, and medical surveys
  • Drove product improvements using observational casual analysis techniques and experimentation
  • Modeled behavior change and the impact of programs on health / wellbeing



copany logo

Microsoft

ML Engineer (Intern)

May 2019 - Aug 2019

  • Extended Azure SDK to incorporate open source MLflow projects to pipelines
  • Developed solutions for ML development lifecycle (development, production, monitor)
  • HoloLens 2 NLP virtual assistant at hackathon



copany logo

Disney

Data Scientist

Jan 2018 - May 2018

  • Lead research and development of recommender system for Disney World app in park experience
  • Integrated many big data and real time processing tools like Hadoop, Spark, Kafka and NiFi
  • Created live dashboards to monitor and visualize large data pipelines and algorithms



copany logo

Loveland Innovations

ML Engineer

Feb 2017 - Jan 2018

  • Used drone imaging to construct 3D models of buildings
  • Built production algorithms to segment 3D models, detect damage, and identify roof features
  • Applied and scaled machine learning research to industry

Education




copany logo

Brigham Young University

Masters Computer Science, BS Applied Math

Dec 2017 - Nov 2019

  • National Merit Scholar
  • 2015 & 2017 Distinguished Undergraduate in Mathematics
  • Data science course instructor in South Africa and Portugal
  • MIT Lincoln Labs

Research




Researched machine learning techniques in healthcare for survival analysis, disease prediction, and cost analysis. Applied new methods for training gradient boosted trees and recurrent neural networks to large healthcare datasets. Also developed high dimensional disease embeddings based on the field of natural language processing.

Skills

copany logo

Python

copany logo

Spark

copany logo

Deep Learning

copany logo

ML

copany logo

Stats

copany logo

Big Data

copany logo

Data Engineering

copany logo

Bayesian Inference

copany logo

SQL

copany logo

Computer Vision

copany logo

Data Viz

copany logo

Jupyter

copany logo

Linux

copany logo

JS

copany logo

Web Dev