Menu

Principal, Resiliency Engineer - R125888

at Northern Trust Company in Chicago, Illinois, United States

Job Description

About Northern Trust:

Northern Trust, a Fortune 500 company, is a globally recognized, award-winning financial institution that has been in continuous operation since 1889.

Northern Trust is proud to provide innovative financial services and guidance to the world's most successful individuals, families, and institutions by remaining true to our enduring principles of service, expertise, and integrity. With more than 130 years of financial experience and over 22,000 partners, we serve the world's most sophisticated clients using leading technology and exceptional service.

NT is seeking an experienced Principal Engineer with a strong focus on resiliency and automation to scale the posture across technologies. The Principal Engineer position, reporting into the IT Resiliency Office, is a strategic role geared towards liaising with Business Units, Product Teams, Technology Leaders, and Programs to both promote and ensure overall system resiliency. This role will develop technology blueprints and Infrastructure as Code to be leveraged by our core Teams. This position will lead a critical role with integration a configuration management tool into Service Now. You will influence the NT culture with driving resiliency to lead to uninterrupted services.

This role will be responsible for a number of key functions that both support and drive improvements to the resiliency of Northern Trust ' s IT Resiliency.

What you will do:

Cloud and On-Premises Infrastructure Expertise: Engineer and review resilient solutions in both cloud-based and on-premises environments

Chaos Engineering Infrastructure Initiatives: Lead chaos engineering efforts to proactively identify and mitigate potential system weaknesses

Resiliency Architecture Reviews: Represent the IT Resiliency Office during the Architectural Review Board

IaC and CI/CD Practices: Develop pipelines to deploy IaC for our core technologies

Chef/Ansible/Puppet: Develop cookbooks/playbooks/manifests in order to drive automation and manage drift among the core infrastructure technologies

Enterprise-wide Collaboration and stakeholder management: Collaborate with various teams across the organization to align and prioritize resiliency and recovery efforts

Incident Response and Recovery: Integrate with post mortem process, from a major incident, to identify areas of opportunity for enhancing resiliency

Development: Evangelize standards and practices among the Technology organization to enrich our resiliency posture

Reporting and Documentation: Develop standardized regular reporting on resilience activities, risks, and improvements to the Leadership team

You possess:

Qualifications:

  • Bachelor's degree or equivalent experience
  • 10+ years in systems engineering with a focus on resiliency, demonstrating leadership in the creation and maintenance of robust, large-scale systems
  • 5+ years as a Team lead or a hands on Technical M anager role that can engage and deliver projects to completion
  • 5+ years leading drift management with Chef, Ansible, or Puppet
  • 10+ years as a Sr Linux Engineer
  • Demonstrated ability to design and implement systems that ensure high availability, support massive transaction volumes, and facilitate seamless disaster recovery processes
  • Infrastructure and engineering experience, including functional and technical requirements gathering, and solution development.
  • Strong dedication to customer needs, with excellent...

    Equal Opportunity Employer - minorities/females/veterans/individuals with disabilities/sexual orientation/gender identity

Copy Link

Job Posting: 11934667

Posted On: May 29, 2024

Updated On: Jun 28, 2024

Please Wait ...