SRE Case Study: URL Distribution Issue Caused by an Application

Topics: Cloud, Coding Practices, Distributed Systems, Frontend Engineering, Java, Web Technology

One of the frequently asked questions from new site reliability engineers is: Where to begin when troubleshooting a problem in a cloud environment? I always tell them: You should begin with understanding the problem. Let me demonstrate the reasons and methods with a real troubleshooting case.