In complex service-oriented architectures, failure can happen in individual servers and containers, then cascade through your system. Good engineering takes into account possible failures. But how do you test whether a solution actually mitigates failures without risking the ire of your customers? That’s where chaos engineering comes in, injecting failures and uncertainty into complex systems so your team can see where your architecture breaks.
On this sponsored episode, our fourth in the series with Intuit, Ben and Ryan chat with Deepthi Panthula, Senior Product Manager, and Shan Anwar, Principal Software Engineer, both of Intuit about how use self-serve chaos engineering tools to control the blast radius of failures, how game day tests and drills keep their systems resilient, and how their investment in open-source software powers their program.
Episode notes:
Sometimes old practices work in new environments. The Intuit team uses Failure Mode Effect Analysis, (FMEA), a procedure developed by the US military in 1949, to ensure that their developers understand possible points of failure before code makes it to production.
The team uses Litmus Chaos to inject failures into their Kubernetes-based system and power their chaos engineering efforts. It’s open source and maintained by Intuit and others.
If you’ve been following this series, you’d know that Intuit is a big fan of open-source software. Special shout out to Argo Workflow, which makes their compute-intensive Kubernetes jobs work much smoother.
Connect on LinkedIn with Deepthi Panthula and Zeeshan (Shan) Anwar.
If you want to see what Stack Overflow users are saying about chaos engineering, check out
Chaos engineering best practice
, asked by
User NingLee
two years ago.
An open-source development paradigm
Would you board a plane safety-tested by GenAI?
How to train your dream machine
OverflowAI and the holy grail of search
Spreading the gospel of Python
Between hyper-focus and burnout: Developing with ADHD
Reshaping the future of API platforms
The reverse mullett model of software engineering
Net neutrality is in; TikTok and noncompetes are out
Supporting the world’s most-used database engine through 2050
Is GenAI the next dot-com bubble?
Why configuration is so complicated
If everyone is building AI, why aren't more projects in production?
How do you evaluate an LLM? Try an LLM.
Diverting more backdoor disasters
Climbing the GenAI decision tree
Want to be a great software engineer? Don’t be a jerk.
What a year building AI has taught Stack Overflow
Are long context windows the end of RAG?
Will antitrust suits benefit developers?
Create your
podcast in
minutes
It is Free
Insight Story: Tech Trends Unpacked
Zero-Shot
Fast Forward by Tomorrow Unlocked: Tech past, tech future
The Unbelivable Truth - Series 1 - 26 including specials and pilot
Acquired