Application Observability Architect
City : TORONTO, Ontario, Canada
Category : Technology | Analytics | Research
Industry : Financial/Banking
Employer : RBC
Job Summary
Job Description
Application Observability Architect
WHAT IS THE OPPORTUNITY?
The Application Observability Architect plays a strategic leadership role in defining the architectures that ensures the organization’s systems are observable, resilient, and capable of delivering highly available systems and platforms. This position is accountable for establishing observability and resiliency targets across the organization’s applications, developing technical prototypes, and creating reference architectures that support scalable, fault-tolerant systems. The architect will work closely with engineering, operations, DevOps, Security, and infrastructure teams to ensure that the systems architecture meets the organization's performance, reliability, and visibility goals while supporting innovation and agility.
WHAT WILL YOU DO?
Define the enterprise-wide observability and resiliency architectures, ensuring alignment with business objectives, service level agreements (SLAs), and key performance indicators (KPIs).
- Establish Operational Excellence targets and oversights on how applications and infrastructure should emit logs, metrics etc.
- Develop a roadmap for implementing observability frameworks and tools across the enterprise, including logging, monitoring, distributed tracing, and event management systems.
- Establish resiliency standards, including fault-tolerance, disaster recovery, redundancy, and high-availability targets, ensuring systems can withstand failures and recover with minimal impact.
- Collaborate with DevOps, Operations, SRE, and Engineering teams to define resiliency and observability targets for critical systems, focusing on metrics such as uptime, mean time to recovery (MTTR), latency, error rates, and system health.
- Create performance benchmarks and reliability objectives that help measure the effectiveness of observability and resiliency practices across various systems, including cloud, hybrid, and on-premise infrastructures.
- Set clear Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for resiliency and observability, ensuring alignment with business-critical systems and services.
- Continuously evaluate and incorporate new technologies and tools that improve observability, reliability, and performance across distributed systems, microservices, and cloud-native applications.
WHAT DO YOU NEED TO SUCCEED?
Must have:
- 8+ years of experience in enterprise architecture, with a focus on observability, system reliability, or resiliency.
- Proven experience with designing and implementing resilient and observable architectures in complex, large-scale environments, including cloud-native, hybrid, and on-premise. infrastructure.
- Hands-on experience with observability tools such as Prometheus, Grafana, Elastic Stack, OpenTelemetry, or equivalent, as well as monitoring, logging, and distributed tracing systems.
- Demonstrated success in leading efforts to improve system resilience through architectures that support high availability, fault tolerance, and disaster recovery.
Nice-to-have:
- Strong understanding of resiliency patterns such as circuit breakers, retries, fallbacks, and graceful degradation.
- Deep understanding of observability technologies (Splunk, Datadog, Dynatrace, etc.) and emerging industry standards (OpenTelemetry).
- Experience with distributed systems architecture, cloud platforms (AWS, Azure, GCP), and microservices-based architectures.
- Familiarity with DevOps practices and CI/CD pipelines for automating deployment, monitoring, and testing.
What’s in it for you?
We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper. We care about each other, reaching our potential, making a difference to our communities, and achieving success that is mutual.
- A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable
- Leaders who support your development through coaching and managing opportunities
- Ability to make a difference and lasting impact
- Work in a dynamic, collaborative, progressive, and high-performing team
- A world-class training program in financial services
- Flexible work/life balance options
- Opportunities to do challenging work
Job Skills
Applications Architecture, Application Security, Cloud Computing, Critical Thinking, Data Architecture, Decision Making, Detail-Oriented, Enterprise Application Delivery, Industry Knowledge, Multi-Level CommunicationAdditional Job Details
Address:
City:
Country:
Work hours/week:
Employment Type:
Platform:
Job Type:
Pay Type:
Posted Date:
Application Deadline:
Note: Applications will be accepted until 11:59 PM on the day prior to the application deadline date above
Inclusion and Equal Opportunity Employment
At RBC, we embrace diversity and inclusion for innovation and growth. We are committed to building inclusive teams and an equitable workplace for our employees to bring their true selves to work. We are taking actions to tackle issues of inequity and systemic bias to support our diverse talent, clients and communities.
We also strive to provide an accessible candidate experience for our prospective employees with different abilities. Please let us know if you need any accommodations during the recruitment process.
Join our Talent Community
Stay in-the-know about great career opportunities at RBC. Sign up and get customized info on our latest jobs, career tips and Recruitment events that matter to you.
Expand your limits and create a new future together at RBC. Find out how we use our passion and drive to enhance the well-being of our clients and communities at jobs.rbc.com.