36
Self-healed* (*Самопочин)

Event-based self-healing monitoring / Кирилл Сотников (Cognician)

  • Upload
    ontico

  • View
    171

  • Download
    2

Embed Size (px)

Citation preview

Page 1: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Self-healed* (*Самопочин)

Page 2: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

• https://github.com/cognician

Page 3: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Стандартный конвейер

Page 4: Event-based self-healing monitoring / Кирилл Сотников (Cognician)
Page 5: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Начало

Page 6: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Обычное состояние

Page 7: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Failover

Page 8: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

• Надежно

• Гибко

• Просто

• Не использует существующие ресурсы

• Поддержка большого количества языков

• Дёшево (относительно)

• Event-based

AWS Lambda

Page 9: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Решение

Page 10: Event-based self-healing monitoring / Кирилл Сотников (Cognician)
Page 11: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Alert Fatigue

Page 12: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Улучшенный конвейер

Page 13: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Сценарий: кончилось место на диске

Page 14: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Общая схема

Page 15: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Решение 1

Page 16: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Решение 2

Page 17: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Сценарий: сервис недоступен

Page 18: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Общая схема

Page 19: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Решение 1

Page 20: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Решение 2

Page 21: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Результат

Page 22: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Сценарий: возросла нагрузка

Page 23: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Общая схема

Page 24: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Решение 1

Page 25: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Решение 2

Page 26: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

transmogrifier

Page 27: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Сценарий: поймали исключение

Page 28: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Общая схема

Page 29: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Решение 1

Page 30: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Решение 2

Page 31: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Сценарий: падение региона

Page 32: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Всё пропало

Page 33: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Решение 1

Page 34: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Решение 2

Page 35: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

Работа с социалками

Page 36: Event-based self-healing monitoring / Кирилл Сотников (Cognician)

[email protected]

• https://github.com/lowl4tency

• hangops.slack.com #hangops_ru

• https://twitter.com/yellow_madops

Спасибо ^_^