You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+47-2Lines changed: 47 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -67,6 +67,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
67
67
### Blog Posts
68
68
69
69
*[Why Are the Top Internet Companies Choosing SRE over Traditional O&M?](https://www.alibabacloud.com/blog/why-are-the-top-internet-companies-choosing-sre-over-traditional-o%26m_596099)
70
+
*[Architecture and Practices of Bilibili's Real-time Platform](https://www.alibabacloud.com/blog/architecture-and-practices-of-bilibilis-real-time-platform_596676)
70
71
71
72
</details>
72
73
@@ -125,6 +126,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
125
126
126
127
*[Anomaly Detection on Golden Signals](https://www.usenix.org/conference/srecon19asia/presentation/chen-yu)
127
128
*[NetRadar: Monitoring the Datacenter Network](https://www.usenix.org/conference/srecon19asia/presentation/chen-yun)
129
+
*[Let the Chaos Begin—SRE Chaos Engineering Meets Cybersecurity](https://www.youtube.com/watch?v=x3c0PPkSf14)
128
130
129
131
</details>
130
132
@@ -464,6 +466,18 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
*[Kubernetes - A Practical Introduction for Application Developers](https://www.godaddy.com/engineering/2018/05/02/kubernetes-introduction-for-developers/)
477
+
*[An Intuitive Node.js Client for the Kubernetes API](https://www.godaddy.com/engineering/2018/04/10/an-intuitive-nodejs-client-for-the-kubernetes-api/)
478
+
479
+
</details>
480
+
467
481
<details>
468
482
<summary>Gojek</summary>
469
483
@@ -517,6 +531,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
517
531
*[SRE Classroom - How to Design a Distributed System in 3 Hours](https://www.usenix.org/conference/srecon19americas/presentation/thomas)
518
532
*[Using PRDs and User Journeys to Design User-Friendly Tools](https://www.usenix.org/conference/srecon19americas/presentation/stockman)
519
533
*[How Google SRE and Developers Work Together](https://www.youtube.com/watch?v=DOQqOrHs3VY)
534
+
*[SREcon21 - Experiments for SRE](https://www.youtube.com/watch?v=yjusNjAFxFg)
520
535
521
536
</details>
522
537
@@ -653,7 +668,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
*[SRE Teams #8: Loggi](https://sreteams.substack.com/p/loggi)
659
674
@@ -672,6 +687,25 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
672
687
673
688
</details>
674
689
690
+
<details>
691
+
<summary>Mattermost</summary>
692
+
693
+
### Blog Posts
694
+
695
+
*[Monitoring Cloud Environments at Scale with Prometheus and Thanos](https://mattermost.com/blog/monitoring-cloud-environments-at-scale-with-prometheus-and-thanos/)
696
+
*[How We Use Sloth to do SLO Monitoring and Alerting with Prometheus](https://mattermost.com/blog/sloth-for-slo-monitoring-and-alerting-with-prometheus/)
697
+
698
+
</details>
699
+
700
+
<details>
701
+
<summary>Meituan (美团)</summary>
702
+
703
+
### Blog Posts
704
+
705
+
*[The development and practice of SRE in the cloud (云端的SRE发展与实践)](https://tech.meituan.com/2017/08/03/meituanyun-sre.html)
706
+
707
+
</details>
708
+
675
709
<details>
676
710
<summary>Mercari</summary>
677
711
@@ -1099,6 +1133,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
1099
1133
*[Fulfilling the promise of CI/CD](https://stackoverflow.blog/2021/01/19/fulfilling-the-promise-of-ci-cd/)
1100
1134
*[A deeper dive into our May 2019 security incident](https://stackoverflow.blog/2021/01/25/a-deeper-dive-into-our-may-2019-security-incident/)
1101
1135
*[Guest Post - Failing over without falling over](https://stackoverflow.blog/2020/10/23/adrian-cockcroft-aws-failover-chaos-engineering-fault-tolerance-distaster-recovery/)
1136
+
*[How We Built Our Blog](https://stackoverflow.blog/2015/07/02/how-we-built-our-blog/)
1102
1137
1103
1138
### Videos
1104
1139
@@ -1302,6 +1337,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
1302
1337
1303
1338
*[Tracing SRE’s journey in Zalando - Part I](https://engineering.zalando.com/posts/2021/09/sre-journey-part1.html)
1304
1339
*[Tracing SRE’s journey in Zalando - Part II](https://engineering.zalando.com/posts/2021/09/sre-journey-part2.html)
1340
+
*[Tracing SRE’s journey in Zalando - Part III](https://engineering.zalando.com/posts/2021/10/sre-journey-part3.html)
1305
1341
1306
1342
</details>
1307
1343
@@ -1314,6 +1350,15 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
1314
1350
1315
1351
</details>
1316
1352
1353
+
<details>
1354
+
<summary>Zomato</summary>
1355
+
1356
+
### Blog Posts
1357
+
1358
+
*[Huddle Diaries – DevOps and Data Platform](https://www.zomato.com/blog/huddle-diaries-devops-and-data-platform)
1359
+
1360
+
</details>
1361
+
1317
1362
## SRECon Mix Playlist
1318
1363
1319
1364
### Videos
@@ -1357,7 +1402,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
1357
1402
*[The Site Reliability Workbook from Google](https://www.oreilly.com/library/view/the-site-reliability/9781492029496/) | [Read free online version hosted by Google](https://sre.google/workbook/table-of-contents/)
1358
1403
*[Training Site Reliability Engineers](https://www.oreilly.com/library/view/training-site-reliability/9781492076018/) | [Read free online version hosted by Google](https://static.googleusercontent.com/media/sre.google/en//static/pdf/training-sre.pdf)
1359
1404
*[97 Things Every SRE Should Know](https://www.oreilly.com/library/view/97-things-every/9781492081487/) | [Complimentary Copy from Nginx](https://www.nginx.com/resources/library/97-things-every-sre-should-know/)
1360
-
*[SLO Adoption and Usage in Site Reliability Engineering](https://www.oreilly.com/library/view/slo-adoption-and/9781492075370/)
1405
+
*[SLO Adoption and Usage in Site Reliability Engineering](https://www.oreilly.com/library/view/slo-adoption-and/9781492075370/) | [Read free online version hosted by Google](https://sre.google/static/pdf/slo-adoption-and-usage-in-sre.pdf)
1361
1406
*[Practical Site Reliability Engineering](https://www.oreilly.com/library/view/practical-site-reliability/9781788839563/)
1362
1407
*[Implementing Service Level Objectives](https://www.oreilly.com/library/view/implementing-service-level/9781492076803/)
0 commit comments