
Understanding incidents
This was a significant cross-team effort — helping users understand an incident, identify its root cause, and remediate it, all using new AI technology and a redesigned UI. It meant balancing the needs, schedules, and perspectives of team members across multiple overlapping teams. ​​
​
Presented at IBM's THINK conference and submitted for a Red Dot award
​

01
Understanding impact
When the development team adopted an Agentic AI LLM model, the UI needed to be reworked. In short — the AI uses a large language model to analyze data, and the agent uses that data to determine the root cause of an incident.
​
The design challenge was about more than the interface. Users needed to understand and trust the results. That meant making AI transparent and genuinely helpful — not giving wrong answers, or answers users already knew. And because incidents impact the business in real time, speed matters.
​
Accessibility was also a key consideration, particularly for the interactive topology visualization — ensuring the experience worked for all users, including those with visual impairments.
02
UX basics and future
I started with a simple UX exercise — heuristically reviewing all the existing flows and grouping related items and actions together to bring some order to a complex experience.
​
From there, I worked with the content designer to add explainability, and with the visual designer to create an interactive topology so users could understand the cause of an incident and its root cause at a glance.
​
Finally, I coordinated closely with developers to implement the designs in the product with real-world examples — ensuring the design intent was preserved all the way through.


03
Overlapping team ideas
Around this time, the project grew in complexity. Three things were happening at once — connecting data between two teams to surface recommended actions for different root causes, incorporating AI more deeply through an assistant that could generate summaries and include new agents, and stepping back to rethink what an incident actually means and whether the broader strategy needed to evolve.
​
It was a lot to hold at once — and it meant staying closely aligned across multiple teams with different priorities and timelines.
04
Aligning...everyone
The central question was deceptively simple: "What is an incident?" The idea was floating — not gaining traction, but not being dismissed either.
​
After reviewing competitors and gathering internal feedback, I designed a workshop to bring the many teams together around a shared answer. A separate session with technical leaders worked through the details.
