Menu

Senior Site Reliability Engineer

at Microsoft Corporation in Chicago, Illinois, United States

Job Description

Microsoft is a company where innovators come to collaborate, envision what can be and take their careers to levels they cannot achieve anywhere else. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking a cloud-enabled world.

We do not just value differences or different perspectives. We seek them out and invite them in so we can tap into the collective power of everyone in the company. As a result, our ideas are better, our products are better, and our customers are better served.

There has never been a more exciting time to be working in healthcare at Microsoft. Our Health & Life Sciences Solutions (HLS) organization is an interdisciplinary team of product managers, designers, engineers, and clinicians who are designing, developing and deploying next-generation healthcare solutions powered by the Microsoft Cloud for healthcare organizations around the world.

We are looking to hire a Senior Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be an integral member of a team within HLS Solutions that is working to empower clinicians to achieve more with groundbreaking healthcare-oriented copilots and provide a secure, scalable, reliable solution. The candidate will be excited about waking up every morning to apply their skills in automation, Continuous Integration/Continuous Deployment (CI/CD), and Infrastructure as code (IAC) to develop and deploy new technologies and experiences centered around driving positive healthcare outcomes.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Responsibilities

Responsibilities:

We are looking for people with experiences working with all service aspects of high throughput and multi-tenant services, ability to design components?carefully, properly handle errors, write clean and well-factored code with good tests and good maintainability.??

Responsibilities include:

+ Demonstrates expertise in distributed systems design, interactions between cloud technology layers and components, common dependencies at scale, and the code that defines infrastructures. Can identify and recommend configurations optimal of cloud technology solutions and modify the code base that defines systems or cloud technologies to improve the reliability and operability of supported products with minimal guidance from other engineers.

+ Develops an understanding of the code, features, and operations of specific products at scale as required to contribute to incremental improvements in product availability, reliability, efficiency, observability, and/or performance; participates in on-boarding, code/design reviews, and regular meetings with the engineering teams that develop and/or manage those products.

+ Researches and maintains an awareness in industry trends, advances in distributed systems and cloud technologies, new tools, and/or processes for maintaining and improving product availability, reliability, efficiency, observability, and/or performance. Contributes to the implementation of new solutions within their team by identifying ways they can be applied to solve persistent problems.

+ Leverages technical expertise in large scale distributed systems and specific products, as well as objective insights drawn from analyses of production telemetry data to suggest changes or add-ons to product features or code to improve the availability, reliability, efficiency, observability, and performance of product components or features supported by their team.

+ Independently develops code or scripts that automate the performance of repetitive and easily scalable operations processes (e.g., monitoring, alerting, deploying products and updates) across components and features of products operating at scale.

+ Independently uses existing tools and/or models to troubleshoot problems or flaws affecting the availability, reliability, performance, and/or efficiency of components and features; proposes solutions that will resolve and prevent recurring issues and brings them to the attention of their Site Reliability Engineering (SRE) and/or product engineering teams.

+ Embody our culture (https://careers.microsoft.com/v2/global/en/culture) and values. (https://www.microsoft.com/en-us/about/corporate-values)

Qualifications

Minimum Qualifications:

+ 6+ years technical experience in software engineering, network engineering, or systems administration

+ OR Bachelor’s Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration

+ OR Master’s Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration.

+ 2+ years of experience with Azure Cloud.

Other Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to, the following specialized security screenings:

+ Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

+ 7+ years technical experience in software engineering, network engineering, or systems administration

+ OR Bachelor’s Degree in Computer Science, Information Technology, or related field AND 4+ years technical experience in software engineering, network engineering, or systems administration

+ OR Master’s Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration

+ OR Doctorate Degree in Computer Science, Information Technology, or related field.

Site Reliability Engineering IC4 – The typical base pay range for this role across the U.S. is USD $117,200 – $229,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $153,600 – $250,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: US corporate pay information | Microsoft Careers (https://careers.microsoft.com/v2/global/en/us-corporate-pay.html)

Microsoft will accept applications for the role until August 12, 2024.

\#Health&LifeScience #HLS

Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations (https://careers.microsoft.com/v2/global/en/accessibility.html) .

To view full details and how to apply, please login or create a Job Seeker account
How to Apply Copy Link

Job Posting: JC263443622

Posted On: Jul 31, 2024

Updated On: Aug 03, 2024

Please Wait ...