REQ-768 Application Reliability Senior Specialist (Open)
City : Toronto
Category : Full time
Industry : Financial Services
Employer : Interac
Application Senior Reliability Specialist
You are a hands-on developer responsible for maintaining and supporting Interac’s highly distributed, high-performance payment system. You will diagnose and resolve application issues to ensure optimal performance and availability of all our IT applications and provide root cause analysis with recommendations for improvements. You will also focus on Site (Application) Reliability Engineering activities, including proactive monitoring, responding to alerts and automation. You will be required to be available on call and will work with senior members of the team to gather monitoring requirements from stakeholders and deliver solutions utilizing the enterprise monitoring toolset, paving the way to the SRE-based next-generation Interac platform.
You’re great at…
Understanding large-scale Java applications, database architectures, application monitoring and fault management.
Troubleshooting applications by leveraging APM tools like AppDynamics/Dynatrace.
Having an SRE mindset toward ensuring Application Availability
Identifying the application monitoring needs or performance issues and instrumenting them appropriately in AppDynamics and Splunk
Designing and instrumenting AppDynamics monitoring and tuning (health rules, alerts) for various applications.
Identifying areas of automation for building self-remediation needs.
Creating performance analysis reports & dashboards for application teams.
Proposing and implementing solutions to improve application availability and reliability.
API & Microservices technologies and containers
Who are you?
You have a University Degree in Computer Science Engineering or an equivalent combination of education and experience.
Must be eligible to work for Interac Corp. in Canada in a Full Time Capacity.
You are experienced in Core Java Object Oriented programming and understand essential Enterprise Integration Patterns.
You have experience in application support and maintenance of Java/JEE applications.
You have good knowledge of REST APIs.
Experience in using CI/CD platform for deployment.
You have debugging expertise in the Java tool stack.
Must have a minimum of 3 years experience in Application Support with a focus on improving application reliability by enhancing application monitoring, designing, and instrumenting monitoring dashboards and tuning alerts, preferably with AppDynamics, Splunk and ELK.
Overall, you have 8+ years of software development (Java)/ maintenance experience, preferably with experience in the payment system or banking domain.
Excellent understanding of ITIL service management processes.
Skilled in IT problem diagnosis and resolution.
You have experience in supporting high-throughput, low-latency systems.
You have experience in maintaining high-performance, service-oriented architectures.
You are experienced in scripting tools like Power Shell, BASH, Python, and Ansible.
You must have RDBMS expertise - Oracle and DB2.
You have a solid understanding of different types of open-source packages, preferably Spring, Apache and data transformation (jaxb2, JSON, XML).
You have strong communication skills - verbal and written (technical documentation).
You have participated in the overall delivery of software components as part of an agile development process.
Strong communication and interpersonal skills, with an ability to communicate effectively and professionally.
Willingness to learn new technologies and maintain industry knowledge.
You have a demonstrated ability to achieve successful outcomes when handling difficult situations & customers.
You have a demonstrated ability to manage multiple priorities & follow through on projects to completion.
You can stay organized and deal with information from different sources simultaneously.
You stay positive and thrive under pressure.
Open to new ideas and change initiatives, with an ability to modify the current approach in the face of new demands.
SRE (Site Reliability Engineering) expertise is nice to have.
How we work
We know that exceptional people have great ideas and are passionate about their work. Our culture encourages excellence and actively rewards contributions with:
Connection: You’re surrounded by talented people every day who are driven by their passion of a common goal.
Core Values: They define us. Living them helps us be the best at what we do.
Compensation & Benefits: Pay is driven by individual and corporate performance and we provide a multitude of benefits and perks.
Education: To ensure you are the best at what you do we invest in you