Work with incidents, improving incident response and technical debt value. Backend United 4 Mitap Materials: Okroshka

Hello! This is a post-report from the Backend United meeting, our series of thematic meetings for server-side developers. This time we talked a lot about working with incidents, discussed how to build our system to improve the incident response and were convinced of the value of the technical debt.







Come under the cat if you are interested in these topics. Inside you will find meeting materials: video recordings of presentations, speaker presentations, guest reviews of the meeting and links to the photo report.













Reports



Simple tools to improve incident response: Tutu experience. Andrey Borzov (Tutu.ru)



Andrey told how Tutu made life easier during incidents with the help of simple technical solutions. They got a customizable system for teams, which makes diagnostics important for them closer, alerts from different systems are more useful, and their routing is easier.









โ†’ Presentation







Listener reviews



  • โ€œIt's interesting to hear about the application of similar technologies used in our company.โ€
  • โ€œThe team is well done. But now sawing your bikes is not very productive for a company whose bike is not a business product. Because of this, I do not consider this solution as a model for implementation, but many of the problems that have been voiced are taken into account. Thank you Useful. "




Work with Production Explosions: detection, loss estimation, incident management. Dmitry Khimion (Avito)



Dmitry spoke about how the practice of incident management is arranged in Avito, and what research and automation we use in our work.









โ†’ Presentation







Listener reviews



  • "A lot of notes, class."
  • โ€œInteresting and structured.โ€




AutoLSR - Automated data collection for significant incidents. Vladimir Kolobaev (Avito)



We collected all the secret knowledge, failure scenarios of various systems and services and transferred all this into code for the purpose of automated detection and initial analysis of significant incidents. About this - the report of Vladimir.









โ†’ Presentation







Listener reviews



  • โ€œAn interesting and useful report.โ€
  • โ€œEasy and interesting report.โ€




We broke it now, but we will fix it later. Tech debt and its value. Boris Kaiser (Ozon)



Boris said that they and the team are doing to control everything that breaks down and is quickly repaired, how they help the development not to forget about these promises, and how they provide the business with complete and understandable information about what happened, how it was repaired and what will be done. so that the situation does not happen again.









โ†’ Presentation







Listener reviews



  • "Opsgini :) I love to learn about new technologies."
  • โ€œIn my opinion, this is the best report. Competently, interesting and practical. He endured a lot for himself. โ€




A few pictures



Click to see pictures.





















































































We posted all the photos on Facebook and VK . See how it was and celebrate yourself and friends if you were at the meeting.







References



A playlist with all the videos from the mitap can be found on our YouTube channel .







We publish all new events for developers, first of all, on our Tympad . Subscribe in order not to miss.







See you soon!








All Articles