A coding error caused Rogers outage that left millions without service
D. Yuan et al. (2014). Simple testing can prevent most critical failures: An analysis of production failures in distributed data-intensive systems. Proceedings of the 11th Symposium on Operating Systems Design and Implementation (OSDI). 249-265. https://www.eecg.utoronto.ca/~yuan/papers/failure_analysis_o...
Here is the link to the (heavily redacted) report from Rogers to the CRTC: https://crtc.gc.ca/public/otf/2022/c12_202203868/4215445.doc...