Do you know about Netflix Simian Army?

When we talk about micro services, cloud infrastructure and ability of the infrastructure which is resilient, Netflix is one of the top companies which comes into scene. Netflix always work in the direction of making its infrastructure more resilient to make it always available cause it has to serve a lot of traffic. In this artilcle we will talk about netflix simian army.

netflix simian army
Taken from https://devops.com/wp-content/uploads/2014/03/simian-army.jpg

 

Here I will be talking about netflix simian army which makes its infrastructure more secure and less prone to unintended errors like hardware failure etc. There are three elements in Simian Army. Chaos Monkey, Janitor monkey and conformity monkey.

choas monkey netflix simain army

Lets talk about what these does.

Chaos Monkey

Chaos Monkey is a service which identifies groups of systems and randomly terminates one of the systems in a group. The service operates at a controlled time (does not run on weekends and holidays) and interval (only operates during business hours). In most cases we have designed our applications to continue working when a peer goes offline, but in those special cases we want to make sure there are people around to resolve and learn from any problems. With this in mind Chaos Monkey only runs in business hours with the intent that engineers will be alert and able to respond.
These are the words from netflix github definition.

So in short chaos monkey cause chaos to test how much resilient the infrastructure is.

 

Janitor monkeyjanitor monkey netflix simian army

Janitor Monkey determines whether a resource should be a cleanup candidate by applying a set of rules on it. If any of the rules determines that the resource is a cleanup candidate, Janitor Monkey marks the resource and schedules a time to clean it up. The design of Janitor Monkey also makes it simple to customize the set of rules or to add new ones.
In short it is kind of garbage collector for your aws resources.

 

Conformity monkey

Conformity Monkey determines whether an instance is nonconforming by applying a set of rules on it. If any of the rules determines that the instance is not conforming, the monkey sends an email notification to the owner of the instance.

Its keeps the infrastructure secure by making it follow certain rules, which makes the system secure.

 

So these are the netflix  simian army. This is how they make their infrastructure more secure, resilient and clean and also save money.
Keep following the latest netflix blog for their awesome infrastructure setup and solutions.

 


Gaurav Yadav

Gaurav is cloud infrastructure engineer and a full stack web developer and blogger. Sportsperson by heart and loves football. Scale is something he loves to work for and always keen to learn new tech. Experienced with CI/CD, distributed cloud infrastructure, build systems and lot of SRE Stuff.

3 COMMENTS
  • Wanetta Kalhorn
    Reply

    A compelling perspective that you have, thanks for sharing it with the rest of us.

  • Enedina Henderlight
    Reply

    an enlightening way of looking at things that you have, danke for sharing it with your audience.

  • Maryalice Grenway
    Reply

    Wonderful goods from you, man. I’ve understand your stuff previous to and you are just extremely great. I actually like what you have acquired here, certainly like what you are saying and the way in which you say it. You make it enjoyable and you still care for to keep it sensible. I can’t wait to read far more from you. This is really a wonderful web site.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.