Welcome!

Linux Containers Authors: Yeshim Deniz, Liz McMillan, Zakia Bouachraoui, Elizabeth White, Pat Romanski

Related Topics: @DevOpsSummit, Microsoft Cloud, Linux Containers, Containers Expo Blog, Agile Computing

@DevOpsSummit: Blog Feed Post

Five Reasons to Ditch Email Alerts By @PagerDuty | @DevOpsSummit [#DevOps]

Looking to improve email alerts? Look again. Here are 5 reasons why you should ditch email alerts if you’re still using them

Five Reasons to Ditch Email Alerts

Want to improve your email alerts? Think again

Monitoring systems can help you better manage your uptime, but even though you may spend a lot of time configuring checks and thresholds to identify problems early, your alerts are only as good as your incident response processes. One of the biggest challenges we’ve seen when talking with customers is getting bogged down in email alerts. Despite the increasing disarray of our inboxes, many monitoring systems and IT Operations teams still rely on email for alerting, even though most agree it’s messy and too easy to miss. Looking to improve email alerts? Look again. Here are 5 reasons why you should ditch email alerts if you’re still using them:

1. Email alerts are too easy to miss

“Hey did you see this latest cat video my friend emailed to me?”

Even if you’re staring at your email inbox constantly, it’s not hard to imagine a critical alert getting buried by other alerts or work-related emails. For this reason, top Operations teams typically use at least two notification channels where one is a phone call or SMS message. Having an audible sound with the alert definitely helps it get noticed.

2. You can’t assign an email to someone

“Um, is someone on this?”

Time is critical during a severe incident and you don’t want your team wondering about who’s on point for addressing it. If your alerts are getting emailed to multiple people, there’s no way to know for sure who on the team should respond first. Has someone else already seen the email and are they already working on it? Am I really the best person to respond, or should I wait for someone with more experience to take it? Top Operations teams with a strong culture of response make sure each incident is automatically assigned to the person responsible for fixing it. Incident management tools and ticketing systems can enforce this workflow by automatically assigning an incident to the engineer on-call and by tracking assignee status for each open incident.

In PagerDuty, we use your on-call schedules to determine who’s on point right now, and assign the incident accordingly.

3. You can’t aggregate or bundle emails

“Will it ever stop?”

Alert storms suck. When stuff really goes wrong, all of your monitoring systems will be sending alerts, multiple times per minute. Those alerts can quickly flood your inbox making it virtually unusable. PagerDuty will aggregate alerts for a single incident and will bundle alerts for multiple incidents (after the first notification for each) so repeated alerts will notify you only once. Dashboards are helpful here too so you can get a quick picture of how many incidents are open and where they’re coming from.

4. Email doesn’t offer visibility for the team

“What’s the latest status?”

It’s hard to tell from email who’s working on an incident, how long it has been open, and the latest status. This information is useful not only to your team, but also to your management and other business stakeholders. It’s annoying to be pinged constantly by people wanting an update on the issue when you’re trying to fix it. By taking your incidents into a system like PagerDuty, you can get all of this information in a single dashboard view that’s accessible to management as well as everyone on your team. We can’t promise that the CEO and CTO still won’t ask, but at least there’s a place you can direct them to where they can get the information for themselves.

5. You can’t create metrics with email alerts

“How are we doing?”

Top Operations teams track metrics to continually measure, evaluate, and improve their performance. We’ve blogged before about what metrics you should track and all of them would be incredibly difficult to measure from emails. Tracking when an incident is opened, how long it takes for the first person to notice & respond, and ultimately how long it takes your team to resolve it are critical for proactively managing your uptime. With this data, you can create dashboards on team performance and weekly reports to facilitate conversations within your team and company.

Want to learn more about incident resolution best practices and how IT stacks up today? Email alerts may be only one challenge you’re facing, but you’re not alone. Learn more about the key facets of an intelligent incident resolution strategy and common challenges in a commissioned study conducted by Forrester Consulting on behalf of PagerDuty. Download the study to read more.

The post 5 Reasons to Ditch Email Alerts appeared first on PagerDuty.

Read the original blog entry...

More Stories By PagerDuty Blog

PagerDuty’s operations performance platform helps companies increase reliability. By connecting people, systems and data in a single view, PagerDuty delivers visibility and actionable intelligence across global operations for effective incident resolution management. PagerDuty has over 100 platform partners, and is trusted by Fortune 500 companies and startups alike, including Microsoft, National Instruments, Electronic Arts, Adobe, Rackspace, Etsy, Square and Github.

IoT & Smart Cities Stories
Dion Hinchcliffe is an internationally recognized digital expert, bestselling book author, frequent keynote speaker, analyst, futurist, and transformation expert based in Washington, DC. He is currently Chief Strategy Officer at the industry-leading digital strategy and online community solutions firm, 7Summits.
Digital Transformation is much more than a buzzword. The radical shift to digital mechanisms for almost every process is evident across all industries and verticals. This is often especially true in financial services, where the legacy environment is many times unable to keep up with the rapidly shifting demands of the consumer. The constant pressure to provide complete, omnichannel delivery of customer-facing solutions to meet both regulatory and customer demands is putting enormous pressure on...
IoT is rapidly becoming mainstream as more and more investments are made into the platforms and technology. As this movement continues to expand and gain momentum it creates a massive wall of noise that can be difficult to sift through. Unfortunately, this inevitably makes IoT less approachable for people to get started with and can hamper efforts to integrate this key technology into your own portfolio. There are so many connected products already in place today with many hundreds more on the h...
The standardization of container runtimes and images has sparked the creation of an almost overwhelming number of new open source projects that build on and otherwise work with these specifications. Of course, there's Kubernetes, which orchestrates and manages collections of containers. It was one of the first and best-known examples of projects that make containers truly useful for production use. However, more recently, the container ecosystem has truly exploded. A service mesh like Istio addr...
Digital Transformation: Preparing Cloud & IoT Security for the Age of Artificial Intelligence. As automation and artificial intelligence (AI) power solution development and delivery, many businesses need to build backend cloud capabilities. Well-poised organizations, marketing smart devices with AI and BlockChain capabilities prepare to refine compliance and regulatory capabilities in 2018. Volumes of health, financial, technical and privacy data, along with tightening compliance requirements by...
Charles Araujo is an industry analyst, internationally recognized authority on the Digital Enterprise and author of The Quantum Age of IT: Why Everything You Know About IT is About to Change. As Principal Analyst with Intellyx, he writes, speaks and advises organizations on how to navigate through this time of disruption. He is also the founder of The Institute for Digital Transformation and a sought after keynote speaker. He has been a regular contributor to both InformationWeek and CIO Insight...
Andrew Keys is Co-Founder of ConsenSys Enterprise. He comes to ConsenSys Enterprise with capital markets, technology and entrepreneurial experience. Previously, he worked for UBS investment bank in equities analysis. Later, he was responsible for the creation and distribution of life settlement products to hedge funds and investment banks. After, he co-founded a revenue cycle management company where he learned about Bitcoin and eventually Ethereal. Andrew's role at ConsenSys Enterprise is a mul...
To Really Work for Enterprises, MultiCloud Adoption Requires Far Better and Inclusive Cloud Monitoring and Cost Management … But How? Overwhelmingly, even as enterprises have adopted cloud computing and are expanding to multi-cloud computing, IT leaders remain concerned about how to monitor, manage and control costs across hybrid and multi-cloud deployments. It’s clear that traditional IT monitoring and management approaches, designed after all for on-premises data centers, are falling short in ...
In his general session at 19th Cloud Expo, Manish Dixit, VP of Product and Engineering at Dice, discussed how Dice leverages data insights and tools to help both tech professionals and recruiters better understand how skills relate to each other and which skills are in high demand using interactive visualizations and salary indicator tools to maximize earning potential. Manish Dixit is VP of Product and Engineering at Dice. As the leader of the Product, Engineering and Data Sciences team at D...
Dynatrace is an application performance management software company with products for the information technology departments and digital business owners of medium and large businesses. Building the Future of Monitoring with Artificial Intelligence. Today we can collect lots and lots of performance data. We build beautiful dashboards and even have fancy query languages to access and transform the data. Still performance data is a secret language only a couple of people understand. The more busine...