Chaos Engineering for Resilient Systems Virtual Internship
In this advanced virtual internship, students will learn to build and test resilient, fault-tolerant systems by introducing controlled failures and monitoring the impact on application behavior. Through hands-on projects, students will gain expertise in chaos engineering principles, techniques, and tools to create highly available and scalable applications. They will learn to identify and mitigate potential failure points, improve system reliability, and ensure seamless user experiences even during unexpected outages or incidents.
Track Overview
Tasks & Milestones
Chaos Engineering Principles and Practices
AdvancedIn this task, students will explore the key principles and best practices of chaos engineering, including the Chaos Engineering Axioms and the Principles of Chaos Engineering.
Chaos Engineering Tool Evaluation and Selection
AdvancedIn this task, students will research and evaluate different chaos engineering tools, and select the most appropriate ones for their use case.
Designing and Executing Chaos Experiments
AdvancedIn this task, students will learn to design and execute chaos experiments to test the resilience of their systems.
Chaos Engineering in CI/CD with Jenkins
AdvancedIn this task, students will learn to integrate chaos engineering experiments into a Jenkins-based CI/CD pipeline.
Chaos Engineering in Production with Gremlin
AdvancedIn this task, students will learn to use Gremlin, a chaos engineering tool, to run experiments in a production environment.
Prerequisites
- • Proficient in at least one programming language (e.g., Python, Go, Java)
- • Experience with containerization and container orchestration (Docker, Kubernetes)
- • Familiarity with infrastructure as code (Terraform, Ansible)
Certificate
Certificate of Completion
Earn a certificate upon successful completion