You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+47-2Lines changed: 47 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -67,6 +67,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
67
67
### Blog Posts
68
68
69
69
*[Why Are the Top Internet Companies Choosing SRE over Traditional O&M?](https://www.alibabacloud.com/blog/why-are-the-top-internet-companies-choosing-sre-over-traditional-o%26m_596099)
70
+
*[Architecture and Practices of Bilibili's Real-time Platform](https://www.alibabacloud.com/blog/architecture-and-practices-of-bilibilis-real-time-platform_596676)
70
71
71
72
</details>
72
73
@@ -125,6 +126,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
125
126
126
127
*[Anomaly Detection on Golden Signals](https://www.usenix.org/conference/srecon19asia/presentation/chen-yu)
127
128
*[NetRadar: Monitoring the Datacenter Network](https://www.usenix.org/conference/srecon19asia/presentation/chen-yun)
129
+
*[Let the Chaos Begin—SRE Chaos Engineering Meets Cybersecurity](https://www.youtube.com/watch?v=x3c0PPkSf14)
128
130
129
131
</details>
130
132
@@ -473,6 +475,18 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
*[Kubernetes - A Practical Introduction for Application Developers](https://www.godaddy.com/engineering/2018/05/02/kubernetes-introduction-for-developers/)
486
+
*[An Intuitive Node.js Client for the Kubernetes API](https://www.godaddy.com/engineering/2018/04/10/an-intuitive-nodejs-client-for-the-kubernetes-api/)
487
+
488
+
</details>
489
+
476
490
<details>
477
491
<summary>Gojek</summary>
478
492
@@ -526,6 +540,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
526
540
*[SRE Classroom - How to Design a Distributed System in 3 Hours](https://www.usenix.org/conference/srecon19americas/presentation/thomas)
527
541
*[Using PRDs and User Journeys to Design User-Friendly Tools](https://www.usenix.org/conference/srecon19americas/presentation/stockman)
528
542
*[How Google SRE and Developers Work Together](https://www.youtube.com/watch?v=DOQqOrHs3VY)
543
+
*[SREcon21 - Experiments for SRE](https://www.youtube.com/watch?v=yjusNjAFxFg)
529
544
530
545
</details>
531
546
@@ -660,7 +675,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
*[SRE Teams #8: Loggi](https://sreteams.substack.com/p/loggi)
666
681
@@ -679,6 +694,25 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
679
694
680
695
</details>
681
696
697
+
<details>
698
+
<summary>Mattermost</summary>
699
+
700
+
### Blog Posts
701
+
702
+
*[Monitoring Cloud Environments at Scale with Prometheus and Thanos](https://mattermost.com/blog/monitoring-cloud-environments-at-scale-with-prometheus-and-thanos/)
703
+
*[How We Use Sloth to do SLO Monitoring and Alerting with Prometheus](https://mattermost.com/blog/sloth-for-slo-monitoring-and-alerting-with-prometheus/)
704
+
705
+
</details>
706
+
707
+
<details>
708
+
<summary>Meituan (美团)</summary>
709
+
710
+
### Blog Posts
711
+
712
+
*[The development and practice of SRE in the cloud (云端的SRE发展与实践)](https://tech.meituan.com/2017/08/03/meituanyun-sre.html)
713
+
714
+
</details>
715
+
682
716
<details>
683
717
<summary>Mercari</summary>
684
718
@@ -1089,6 +1123,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
1089
1123
*[Fulfilling the promise of CI/CD](https://stackoverflow.blog/2021/01/19/fulfilling-the-promise-of-ci-cd/)
1090
1124
*[A deeper dive into our May 2019 security incident](https://stackoverflow.blog/2021/01/25/a-deeper-dive-into-our-may-2019-security-incident/)
1091
1125
*[Guest Post - Failing over without falling over](https://stackoverflow.blog/2020/10/23/adrian-cockcroft-aws-failover-chaos-engineering-fault-tolerance-distaster-recovery/)
1126
+
*[How We Built Our Blog](https://stackoverflow.blog/2015/07/02/how-we-built-our-blog/)
1092
1127
1093
1128
### Videos
1094
1129
@@ -1292,6 +1327,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
1292
1327
1293
1328
*[Tracing SRE’s journey in Zalando - Part I](https://engineering.zalando.com/posts/2021/09/sre-journey-part1.html)
1294
1329
*[Tracing SRE’s journey in Zalando - Part II](https://engineering.zalando.com/posts/2021/09/sre-journey-part2.html)
1330
+
*[Tracing SRE’s journey in Zalando - Part III](https://engineering.zalando.com/posts/2021/10/sre-journey-part3.html)
1295
1331
1296
1332
</details>
1297
1333
@@ -1304,6 +1340,15 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
1304
1340
1305
1341
</details>
1306
1342
1343
+
<details>
1344
+
<summary>Zomato</summary>
1345
+
1346
+
### Blog Posts
1347
+
1348
+
*[Huddle Diaries – DevOps and Data Platform](https://www.zomato.com/blog/huddle-diaries-devops-and-data-platform)
1349
+
1350
+
</details>
1351
+
1307
1352
## SRECon Mix Playlist
1308
1353
1309
1354
### Videos
@@ -1347,7 +1392,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
1347
1392
*[The Site Reliability Workbook from Google](https://www.oreilly.com/library/view/the-site-reliability/9781492029496/) | [Read free online version hosted by Google](https://sre.google/workbook/table-of-contents/)
1348
1393
*[Training Site Reliability Engineers](https://www.oreilly.com/library/view/training-site-reliability/9781492076018/) | [Read free online version hosted by Google](https://static.googleusercontent.com/media/sre.google/en//static/pdf/training-sre.pdf)
1349
1394
*[97 Things Every SRE Should Know](https://www.oreilly.com/library/view/97-things-every/9781492081487/) | [Complimentary Copy from Nginx](https://www.nginx.com/resources/library/97-things-every-sre-should-know/)
1350
-
*[SLO Adoption and Usage in Site Reliability Engineering](https://www.oreilly.com/library/view/slo-adoption-and/9781492075370/)
1395
+
*[SLO Adoption and Usage in Site Reliability Engineering](https://www.oreilly.com/library/view/slo-adoption-and/9781492075370/) | [Read free online version hosted by Google](https://sre.google/static/pdf/slo-adoption-and-usage-in-sre.pdf)
1351
1396
*[Practical Site Reliability Engineering](https://www.oreilly.com/library/view/practical-site-reliability/9781788839563/)
1352
1397
*[Implementing Service Level Objectives](https://www.oreilly.com/library/view/implementing-service-level/9781492076803/)
0 commit comments