Color Genomics, 2015 – present
Engineering lead; Head of engineering
Grew team to >30 software engineers and data scientists. Led bioinformatics and product teams to implement new genetic tests and health areas, physician ordering, insurance billing, dynamic interactive reports, external facing analytics suite, and more. Led infrastructure and ops teams to deliver CI/CD, development toolchains, and >99.9% availability.
Led data platform and data science teams. Built internal analytics stack and multi-billion-row data warehouse that does GWAS studies and polygenic scores in minutes. Created Color Data (blog post), a large interactive public research dataset.
Implemented much of the bioinformatics pipeline. Sped up development cycle from 12-24h to ~2m.
Quip, 2013 – 2015
Google, 2003 – 2013
Senior Staff Engineer
Co-founded Google App Engine, which eventually expanded into Google Cloud. Led engineering for the datastore, a large scale, globally multi-homed NoSQL database that stores >1T rows, >1EB of data, and serves >3Mqps.
Designed, built, and led company-wide ML infrastructure for personalization that served 500kqps at peak.
Co-founded Google’s transaction processing and billing platform and led its storage engineering team.
Stanford University, 1998 – 2003
MS Computer Science
BS Computer Science, Mathematics minor
Selected publications and talks
- Preprint: Mark J. Berger, et al. Color Data v2: a user-friendly, open-access database with hereditary cancer and hereditary cardiovascular conditions datasets.
- Barrett, Neben, et al. A scalable, aggregated genotypic-phenotypic database for human disease variation. Database: The Journal of Biological Databases and Curation, Volume 2019.
- Jamieson, Jack. Bridging the Open Web and APIs: Alternative Social Media Alongside the Corporate Web (more). 10th International Conference on Social Media & Society, July 19-21 2019. Academic study of my open source project Bridgy. Paper, poster.
- Wilkerson, Barrett, et al. Distributed Transactions for Google App Engine: Optimistic Distributed Transactions built upon Local Multi-Version Concurrency Control. Computing Research Repository, June 2011, arXiv:1106.3325, blog post.
- Transactions Across Datacenters. Special Lecture Series in Computer Science, University of San Francisco, Feb. 12 2009; Google I/O, May 27 2009. Citations, slides, video, review, book excerpt, related blog post.
- Under the Covers of the Google App Engine Datastore. Google I/O, 2008. Citations, slides, video.
- 9811552 (USPTO): Detecting and bucketing sparse indicators for communication generation
- 9785792 (USPTO): Systems and methods for processing requests for genetic data based on client permission data
- 9811439 (USPTO): Functional testing of code modifications for read processing systems
- 9817866 (USPTO): Lossless compression of client read data
- 9813467 (USPTO): Real-time alignment and processing of incomplete stream of data
- 9811391 (USPTO): Load balancing and conflict processing in workflow with task dependencies
- 9773031 (USPTO): Duplication and deletion detection using transformation processing of depth vectors
- 9678794, 9811438 (USPTO): Techniques for processing queries relating to task-completion times or cross-data-structure interactions
- 9584882, 9774508 (USPTO): Communication generation using sparse indicators and sensor data
- 8005950 (USPTO): Application server scalability through runtime restrictions enforcement in a distributed application execution system
- 7877482 (USPTO): Efficient application hosting in a distributed application execution system
- Eagle Scout, Boy Scout Troop 120
- Black Belt, 1st dan, Hapkido Karate, Jang Moo Won
- Classical piano under Geraldine Keeling, multiple competitions and concerts