
John Allspaw is the godfather of resilience engineering in software, dating back to his introducing the term and practice of “blameless postmortems” in a 2012 Etsy blog post. Allspaw is a prolific speaker, but there has never been a full timeline for the decades-long “incident” of his public speaking career… until now. I am likely missing some talks, please send me any additions or errors. I’ve skipped videos under fifteen minutes in length and those behind paywalls.
- 2009-06-23 10+ Deploys Per Day: Dev and Ops Cooperation at Flickr (w/ Paul Hammond) from Velocity 2009
- 2011-06-28 Building Resilience in Web Development and Operations from USI 2011
- 2011-11-09 Anticipation: What Could Possibly Go Wrong? from Velocity EU 2011
- 2011-11-16 Outages, Post Mortems, and Human Error 101 from Etsy Tech Talk
- 2012-04-23 Interview from GOTO Chicago 2013
- 2012-06-26 Stronger and Faster (w/ Steve Souders) from Velocity 2012
- 2012-09-25 Interview with Jez Humble
- 2013-05-14 Owning Attention: Alert Design Considerations from Etsy Tech Talk
- 2013-11-13 AMA from Velocity EU 2013
- 2013-11-22 Fireside Chat with Andrew Clay Shafer
- 2014-01-29 An Evening with John Allspaw on Development and Deployment at Etsy from Data Council
- 2014-06-24 Interview from Velocity 2014
- 2014-06-24 PostMortem Facilitation: Theory and Practice of “New View” Debriefings Parts One, Two, Three, Four from Velocity 2014
- 2015-05-28 Seeing the Invisible: Discovering Operations Expertise from Velocity 2015
- 2016-05-25 Common Ground and Coordination in Joint Activity from Papers We Love
- 2017-11-15 How Your Systems Keep Running Day After Day: Resilience Engineering as DevOps from DOES 2017
- 2018-03-20 Poised To Adapt: Continuous Delivery’s Relationship To Resilience Engineering from PipelineConf 2018
- 2018-04-24 Taking Human Performance Seriously in Software from DevOpsDays Seattle 2018
- 2018-08-16 In the Center of the Cyclone: Finding Sources of Resilience from Redeploy 2018
- 2018-09-12 Interview from PagerDuty Summit 2018
- 2018-09-12 Incidents as we Imagine Them Versus How They Actually Are from PagerDuty Summit 2018
- 2018-10-15 Problem Detection from Papers We Love
- 2019-02-11 Video AMA from PagerDuty 2019
- 2019-06-03 Taking Human Performance Seriously In Software from Monitorama PDX 2019
- 2019-07-08 Resilience Engineering: The What and How from DevOpsDays DC 2019
Bonus: podcasts
- 2016-02-13 PAPod 57 – System Reliability – John Allspaw from PreAccident Investigation Podcast
- 2017-03-07 John Allspaw on System Failures: Preventing, Responding, and Learning From Failure from SE-Radio
- 2018-09-05 096: Resilience Engineering with John Allspaw from Greater Than Code