DevOps Classroomnotes 12/Sep/2023

Monitoring

An System (Hospital Information System)

  • This is the system used to maintain a Multi-Speciality Hospital with different branches
  • Architecture
  • Lets us assume our job in system is figure out issues and respond in the cases of failures.
  • To solve this we have to use two approaches
    • Proactive
    • Reactive
  • Metrics:
    • MTTF (Mean Time To Failure): Average Time which states time taken by your system to fail. This should be high
    • MTTR (Mean Time To Recover): Average Time which states time taken by your team to recover from failure. This should be less.

Expectation

  • We need to have a monitoring system so that our objective MTTF is HIGH and MTTR is low can be acheived.

Principles

  • Single Point of Failure (SPOF): An component or server which alone is responsible for doing something. This is generally solved by redundancy or replication
  • Fault Tolerance: Ability fo system to deal with faults is called as Fault Tolerance.

Ways of Monitoring

  • System Monitoring:
    • This at a very simple level is to check if the application/server is up or down (Heart Beat)
Published
Categorized as Uncategorized Tagged

By continuous learner

devops & cloud enthusiastic learner

Leave a ReplyCancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Please turn AdBlock off
Customized Social Media Icons from Acurax Digital Marketing Agency

Discover more from Direct DevOps from Quality Thought

Subscribe now to keep reading and get access to the full archive.

Continue reading

Exit mobile version
%%footer%%