Principal Systems Engineer
Microsoft
Principal Systems Engineer
Mountain View, California, United States
Save
Overview
Microsoft Silicon Cloud Hardware Infrastructure Engineering (SCHIE) is the team behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox Live, Skype, OneDrive and the Microsoft Azure platform globally with our server and data center infrastructure, security and compliance, operations, globalization, and manageability solutions. Our focus is on smart growth, high efficiency, and delivering a trusted experience to customers and partners worldwide and we are looking for passionate, high-energy engineers to help achieve that mission.
We are seeking a High Performance Computing (HPC) professional to join Microsoft’s Silicon Development Compute Solutions (SDCS) team, embedded within the broader silicon engineering organization. As a Principal Systems Engineer, you will play a critical role in architecting and managing scalable, Linux-based compute and storage infrastructure and services that supports silicon design workloads.
This global role involves close collaboration with CAD Operations, Engineering, and cross-functional teams to ensure high availability, performance, and efficiency of HPC services. You will help shape the future of compute solutions that power Microsoft’s silicon innovation, working at the intersection of infrastructure engineering and silicon development.
If you are passionate about HPC, infrastructure at scale, and enabling cutting-edge silicon design, this is a unique opportunity to make a significant impact.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees, we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond
#azurehwjobs #SCHIE #MILE
Qualifications
Required/minimum qualifications:
- Understanding of networking, systems administration, automation, and monitoring
- Written and verbal communications
- Understanding of Electronic Design Automation (EDA) concepts
- Experience with cloud platforms like Azure, Google, AWS
Silicon Engineering IC5 - The typical base pay range for this role across the U.S. is USD $139,900 - $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft will accept applications for the role until June 27, 2025.
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer
#azurehwjobs #SCHIE #MILE
Responsibilities
- Shared end-to-end ownership of Linux Engineering Display (Exceed Turbo X, VNC, VDI, etc.) and Authentication (RHDS, RHIDM, Free IPA), including the implementation and maintenance of these systems to ensure an excellent customer experience while driving innovation and operational excellence.
- Advanced administration of Linux servers, including installation, configuration, and troubleshooting of various distributions (e.g., Red Hat, Rocky8), ensuring smooth and efficient operation of both on-premise and Azure Cloud infrastructure.
- Collaborate with storage lead to design, implement and maintain Azure ANF capacity while contributing to operations for On-prem storge solutions (Pure, Isilon)
- Implement robust monitoring and alerting systems to maintain peak performance and reliability, monitor system performance, and implement enhancements to improve efficiency.
- Collaborate with Engineering and CAD teams to tackle and solve complex, challenging problems, driving innovation and optimizing system performance.
- Develop and maintain comprehensive documentation for configurations, processes, and procedures related to support technologies.
- Provide technical support and mentorship to junior team members, fostering their development and ensuring effective knowledge transfer.
- Other