Skip to content

Commit 550ca76

Browse files
authored
Merge pull request #105227 from dnblankedelman/working-azure-sre
Addition of SRE hub to Docs
2 parents fc7d3ca + 09545c8 commit 550ca76

File tree

9 files changed

+454
-0
lines changed

9 files changed

+454
-0
lines changed

articles/site-reliability-engineering/index.yml

Lines changed: 358 additions & 0 deletions
Large diffs are not rendered by default.
Lines changed: 1 addition & 0 deletions
Loading
Lines changed: 1 addition & 0 deletions
Loading
Lines changed: 1 addition & 0 deletions
Loading
Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
---
2+
title: SRE book list | Microsoft Docs
3+
description: Increase your knowledge of site reliability engineering with these books
4+
author: dnblankedelman
5+
manager: efreeman
6+
ms.service: site-reliability-engineering
7+
ms.topic: article
8+
ms.date: 02/18/2020
9+
ms.author: dnb
10+
11+
---
12+
# Books about site reliability engineering
13+
14+
For more detailed information about site reliability engineering (SRE), the best source is a trio of books that have been published on the subject
15+
16+
- [_Site Reliability Engineering: How Google Runs Production Systems_](https://shop.oreilly.com/product/0636920041528.do) (known as "The SRE Book")
17+
- [_The Site Reliability Workbook: Practical Ways to Implement SRE_](https://shop.oreilly.com/product/0636920132448.do) (known as "The SRE Workbook")
18+
- [_Seeking SRE: Conversations About Running Production Systems at Scale_](https://shop.oreilly.com/product/0636920063964.do)
19+
20+
Each of those books provides an important set of information:
21+
22+
- The SRE Book - provides a detailed explanation of how Google implemented SRE over the years.
23+
24+
- The SRE Workbook - a companion to The SRE Book that provides a more detailed explanation of not just the “what” of SRE at Google and a few other places, but the “how” and “why”.
25+
26+
- Seeking SRE - provides a more expansive view of the SRE world beyond its origin including information on how it has been implemented in other environments.
27+
28+
Because these books describe the experience, environments and culture of organizations that may or may not resemble the one you are in, it is important to read these books with a critical eye. As you read, try to determine which practices would or would not succeed in your organization. Take some time to identify the information that you are certain can provide some positive value. Think about which parts of your organization's culture and values can support SRE work as described and which might make it more challenging. A careful, iterative embracing of SRE will almost always yield better results than a wholesale duplication of something you read in these books.
Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,18 @@
1+
---
2+
title: SRE event list | Microsoft Docs
3+
description: Participate in the global SRE community by attending these events.
4+
author: dnblankedelman
5+
manager: efreeman
6+
ms.service: site-reliability-engineering
7+
ms.topic: article
8+
ms.date: 02/18/2020
9+
ms.author: dnb
10+
11+
---
12+
# Participate in the global site reliability engineering community
13+
14+
As important as reading up on SRE is, it can often be more important to talk about it with your peers. Discussing your challenges, successes and failures around SRE can be crucial to gaining a nuanced understanding of the subject.
15+
16+
There are a number of meetups and conferences that feature SRE content. Perhaps the most directly relevant is the globally distributed [SREcon conferences](https://www.usenix.org/srecoon) put on by USENIX.
17+
18+
More and more SRE content is making its way to conferences like [LISA](https://www.usenix.org/conferences/byname/5), [All Day DevOps](https://www.alldaydevops.com), [DevOps Enterprise Summit](https://events.itrevolution.com) and local DevOps conferences like [DevOps Days](https://www.devopsdays.org). Seek out this content and others interested in the subject wherever you can find it and them.
Lines changed: 33 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,33 @@
1+
---
2+
title: SRE link collections and digests | Microsoft Docs
3+
description: Expand your knowledge of site reliability engineering with these resources
4+
author: dnblankedelman
5+
manager: efreeman
6+
ms.service: site-reliability-engineering
7+
ms.topic: article
8+
ms.date: 02/18/2020
9+
ms.author: dnb
10+
11+
---
12+
# Sources for site reliability engineering information
13+
14+
There are many, many sources for site reliability engineering (SRE) information on the web.
15+
16+
## Curated link lists
17+
18+
Here are some good curated lists of links for SRE and SRE-related topics:
19+
20+
* [Awesome Site Reliability Engineering](https://github.com/dastergon/awesome-sre)
21+
* [Awesome Site Reliability Engineering Tools](https://github.com/squadcastHQ/awesome-sre-tools)
22+
* [The Patterns of Scalable, Reliable, and Performant Large-Scale Systems](http://awesome-scalability.com)
23+
* [awesome-distributed-systems](https://github.com/theanalyst/awesome-distributed-systems)
24+
* [Awesome Consensus](https://github.com/dgryski/awesome-consensus)
25+
* [awesome-monitoring](https://github.com/crazy-canux/awesome-monitoring)
26+
* [Awesome Observability](https://github.com/adriannovegil/awesome-observability)
27+
28+
## Weekly digests of interest to site reliability engineers
29+
30+
* [SRE Weekly](https://sreweekly.com) - "SRE Weekly is a newsletter devoted to everything related to keeping a site or service available as consistently as possible."
31+
* [Devops Weekly](https://www.devopsweekly.com) - "A weekly slice of devops news brought to you by Gareth Rushgrove."
32+
* [This Week in DevOps](https://thisweekindevops.com) - "DevOps news without the hype."
33+
* [KubeWeekly](https://kubeweekly.io) - "The weekly newsletters for all things Kubernetes."
Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
- name: Site reliability engineering documentation
2+
href: ../index.yml
3+
- name: Resources
4+
expanded: true
5+
items:
6+
- name: SRE books
7+
href: books.md
8+
- name: SRE events
9+
href: events.md
10+
- name: SRE link collections and digests
11+
href: links.md

bread/toc.yml

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -728,6 +728,9 @@
728728
- name: Site Recovery
729729
tocHref: /azure/site-recovery/
730730
topicHref: /azure/site-recovery/
731+
- name: Site Reliability Engineering
732+
tocHref: /azure/site-reliability-engineering/
733+
topicHref: /azure/site-reliability-engineering/
731734
- name: Synapse Analytics
732735
tocHref: /azure/sql-data-warehouse/
733736
topicHref: /azure/sql-data-warehouse/

0 commit comments

Comments
 (0)