2022 Agenda Day 1

Nov 16, 2022
start (UTC) start (ET) IBM & Family
14:25 9:25 Day 1 – welcome
14:30 9:30 Ingo Averdunk, IBM SRE Profession Lead (SRE-240)
15:00 10:00 John Allspaw, Founder of Adaptive Capacity Lab and David Leigh, IBM DE (SRE-250)
15:30 10:30 Varun Bijlani, Global Managing Partner – Hybrid Cloud Transformation Services, IBM Consulting (SRE-260)
15:45 10:45 Coffee (15 mins)
Track 1 – Becoming Track 2 – Managing Track 3 – Implementing
16:00 11:00 SRE-090 SRE-141 SRE-159
Making of the SRE Omelette Embracing SRE as Development Manager Reliability Engineering for RISE with SAP
16:30 11:30 Q&A Q&A Q&A
16:45 11:45 SRE-123 SRE-152 SRE-166
What do SREs do when they are not firefighting? What are the right measure for a SRE Organization? Meeting the Auditor Requirements (full length)
17:15 12:15 Q&A Q&A Q&A
17:30 12:30 Lunch (45 mins)
18:15 13:15 Lightning Talks (Block 1)

SRE-083 – SREs – Learning from the failures/incidents (Be the Tony Stark of the pack)
SRE-148 – Strategies to reduce on-call fatigue
SRE-130 – How not to write a project profile for SRE certification
SRE-055 – An iterative and repeatable approach to getting started with SLI /SLO
SRE-082 – A Slackbot for silencing alerts
SRE-089 – Managing Monitoring Systems with GitOps and CI/CD Pipelines

18:45 13:45
Coffee (15 mins)
19:00 14:00 SRE-016 SRE-145 SRE-001
How to get started as a post-incident review facilitator The managed Openshift SRE approach Boundary security at scale
19:30 14:30 Q&A Q&A Q&A
19:45 14:45 SRE-160 SRE-110 SRE-054
SRE expansion – start by a differentiated learning model The Joy and Challenges Of Owning a Service with One of the Deepest Stacks on IBM Cloud Modernizing Service Management
20:15 15:15 Q&A Q&A Q&A
20:30 15:30 Coffee (30 mins)
21:00 16:00 SRE-109 SRE-013 SRE-101
Can I Trust Anomalies Detected from Unlabelled Data? SRE management

(PANEL)

Velos: Combining Passive and Active Monitoring For SLO Failure Diagnosis
21:30 16:30 Q&A Q&A
21:45 16:45 SRE-168 SRE-144 SRE-140
Becoming a Site Reliability Engineer

(PANEL)

Associate retention during The Great Resignation IBM Cloud Databases – Improving alert rates and system stability
22:15 17:15 Q&A Q&A
22:30 17:30 Lightning Talks (Block 2)

SRE-166 – Meeting the Auditor Requirements
SRE-115 – Astounded – how an IPMI firmware update automation caused an outage across Production regions
SRE-046 – FMEA as a tool to improve MTTD and MTTR KPI’s
SRE-007 – Monitoring Amazon EKS with Instana
SRE-074 – How we dealt with 1000+ alerts/day

22:55 17:55 Day 1 close
23:00 18:00 Virtual Beer Bash
00:00 19:00