Language:
English
繁體中文
Help
圖資館首頁
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Mastering site reliability engineeri...
~
Hoeppner, Florian.
Mastering site reliability engineering in enterprisea complete guide to resilient systems & chaos engineering /
Record Type:
Electronic resources : Monograph/item
Title/Author:
Mastering site reliability engineering in enterpriseby Florian Hoeppner, Francesco Sbaraglia.
Reminder of title:
a complete guide to resilient systems & chaos engineering /
Author:
Hoeppner, Florian.
other author:
Sbaraglia, Francesco.
Published:
Berkeley, CA :Apress :2025.
Description:
xxii, 311 p. :ill., digital ;24 cm.
Contained By:
Springer Nature eBook
Subject:
Business enterprisesComputer networks.
Online resource:
https://doi.org/10.1007/979-8-8688-1448-8
ISBN:
9798868814488$q(electronic bk.)
Mastering site reliability engineering in enterprisea complete guide to resilient systems & chaos engineering /
Hoeppner, Florian.
Mastering site reliability engineering in enterprise
a complete guide to resilient systems & chaos engineering /[electronic resource] :by Florian Hoeppner, Francesco Sbaraglia. - Berkeley, CA :Apress :2025. - xxii, 311 p. :ill., digital ;24 cm.
Transform enterprise IT by adopting site reliability engineering (SRE) practices that reduce downtime, build resilience, and drive business value. This book is a comprehensive guide designed to help site reliability engineers, DevOps teams, and platform engineers identify, address, and mitigate system weaknesses before they become significant critical failures. Authors Francesco Sbaraglia and Florian Hoeppner highlight the paradigm shift from IT as a cost center to a core business function, emphasizing the central role of developers and the need for speed and reliability. They detail the challenges of transitioning to SRE, including overcoming cultural resistance and legacy infrastructure limitations, while bringing to the forefront the importance of building resilience in systems and processes. Specific SRE capabilities like chaos engineering, observability, and toil management are explored, along with strategies for successful implementation, including building a Center of Excellence, selecting the right tools, and fostering a culture of collaboration and continuous improvement. Looking ahead, the book examines emerging trends like Agentic AI SRE Agents, the use of generative AI (GenAI) in SRE and the future evolution of chaos engineering. You'll learn how to embed SRE practices into your existing enterprise tech operating model and unlock tangible business outcomes: reduced downtime, increased resilience, and measurable gains in stability. Additionally, discover how GenAI can support SRE teams in planning, executing, and optimizing reliability experiments and automating toil reduction and continuous improvement efforts. By the end of this book, you'll know how to apply core SRE practices to strengthen reliability: establishing a chaos engineering practice led by SREs, running reliability-focused "game days," improving observability, troubleshooting failure scenarios, and fortifying the digital resilience of your systems and teams.
ISBN: 9798868814488$q(electronic bk.)
Standard No.: 10.1007/979-8-8688-1448-8doiSubjects--Topical Terms:
184482
Business enterprises
--Computer networks.
LC Class. No.: HD30.37
Dewey Class. No.: 658.054678
Mastering site reliability engineering in enterprisea complete guide to resilient systems & chaos engineering /
LDR
:03024nmm a2200313 a 4500
001
690597
003
DE-He213
005
20251016174911.0
006
m d
007
cr nn 008maaau
008
260409s2025 cau s 0 eng d
020
$a
9798868814488$q(electronic bk.)
020
$a
9798868814471$q(paper)
024
7
$a
10.1007/979-8-8688-1448-8
$2
doi
035
$a
979-8-8688-1448-8
040
$a
GP
$c
GP
041
0
$a
eng
050
4
$a
HD30.37
072
7
$a
UT
$2
bicssc
072
7
$a
COM043000
$2
bisacsh
072
7
$a
UT
$2
thema
082
0 4
$a
658.054678
$2
23
090
$a
HD30.37
$b
.H694 2025
100
1
$a
Hoeppner, Florian.
$3
1006151
245
1 0
$a
Mastering site reliability engineering in enterprise
$h
[electronic resource] :
$b
a complete guide to resilient systems & chaos engineering /
$c
by Florian Hoeppner, Francesco Sbaraglia.
260
$a
Berkeley, CA :
$b
Apress :
$b
Imprint: Apress,
$c
2025.
300
$a
xxii, 311 p. :
$b
ill., digital ;
$c
24 cm.
520
$a
Transform enterprise IT by adopting site reliability engineering (SRE) practices that reduce downtime, build resilience, and drive business value. This book is a comprehensive guide designed to help site reliability engineers, DevOps teams, and platform engineers identify, address, and mitigate system weaknesses before they become significant critical failures. Authors Francesco Sbaraglia and Florian Hoeppner highlight the paradigm shift from IT as a cost center to a core business function, emphasizing the central role of developers and the need for speed and reliability. They detail the challenges of transitioning to SRE, including overcoming cultural resistance and legacy infrastructure limitations, while bringing to the forefront the importance of building resilience in systems and processes. Specific SRE capabilities like chaos engineering, observability, and toil management are explored, along with strategies for successful implementation, including building a Center of Excellence, selecting the right tools, and fostering a culture of collaboration and continuous improvement. Looking ahead, the book examines emerging trends like Agentic AI SRE Agents, the use of generative AI (GenAI) in SRE and the future evolution of chaos engineering. You'll learn how to embed SRE practices into your existing enterprise tech operating model and unlock tangible business outcomes: reduced downtime, increased resilience, and measurable gains in stability. Additionally, discover how GenAI can support SRE teams in planning, executing, and optimizing reliability experiments and automating toil reduction and continuous improvement efforts. By the end of this book, you'll know how to apply core SRE practices to strengthen reliability: establishing a chaos engineering practice led by SREs, running reliability-focused "game days," improving observability, troubleshooting failure scenarios, and fortifying the digital resilience of your systems and teams.
650
0
$a
Business enterprises
$x
Computer networks.
$3
184482
650
0
$a
Management information systems.
$3
199355
650
1 4
$a
Computer Networks.
$3
919579
650
2 4
$a
Communications Engineering, Networks.
$3
273745
700
1
$a
Sbaraglia, Francesco.
$3
1006152
710
2
$a
SpringerLink (Online service)
$3
273601
773
0
$t
Springer Nature eBook
856
4 0
$u
https://doi.org/10.1007/979-8-8688-1448-8
950
$a
Professional and Applied Computing (SpringerNature-12059)
based on 0 review(s)
Multimedia
Multimedia file
https://doi.org/10.1007/979-8-8688-1448-8
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login