Member of Technical Staff, High Performance Computing Engineer
Microsoft
Member of Technical Staff, High Performance Computing Engineer
Mountain View, California, United States
Save
Overview
As Microsoft AI we are pushing the boundaries of technology.
We are creating unique, beautiful and powerful products that will change lives. A small, friendly, fast-moving team, we support each other to do the best work of our lives, always looking to break new ground, fast. We are proud of what we build, how we build it and that our products will define the AI era. We run lean, obsess about users, and always make our decisions based on the evidence. We ship regularly, so your work will have real and immediate impact.
We are seeking experienced High Performance Computing Engineers to join our team and contribute to the evolution of our personal AI, Copilot. This role offers the unique opportunity to work on some of the largest scale supercomputers in the world, a rare chance to operate at such a significant scale. The right candidate will bring a wealth of positive energy, empathy, and kindness, coupled with a track record of effectiveness. You'll be proactive, relishing the challenge of crafting top-tier consumer experiences and products swiftly and efficiently. Our team is at the forefront of developing APIs that enhance our ability to fine-tune and deploy Copilot's core functionalities, in partnership with our Product Management, Design, and AI Research teams.
Our newly formed organization, Microsoft AI, is dedicated to advancing Copilot and other consumer AI products and research. The team is responsible for Copilot, Bing, Edge, and generative AI research. Come be a part of the team shaping the future personal computing.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
By applying to this Mountain View, CA OR Redmond, WA position, you are required to be local to the San Francisco OR Seattle area and in office 3 days a week.
Qualifications
Required/Minimum Qualifications:
- Bachelor’s degree in computer science, or related technical discipline AND 4+ years technical engineering experience building web services with coding in languages including, but not limited to, Python, C#, C++, Rust, Java
- OR equivalent experience.
- 4+ years of experience working with high-scale training clusters (ex. working with frameworks/tools such as nvidia InfiniBand clusters, SLURM, Kubernetes, Ray, etc.)
- 4+ years' experience building scalable services on top of public cloud infrastructure like Azure, AWS, or GCP.
Preferred Qualifications:
- Experience with LLM training clusters.
- Experience working with AI platforms, frameworks, and APIs.
- Experience using Machine Learning frameworks, including experience using, deploying, and scaling language learning models, either personally or professionally.
- Ability to identify, analyze, and resolve complex technical issues, ensuring optimal performance, scalability, and user experience.
- Dedication to writing clean, maintainable, and well-documented code with a focus on application quality, performance, and security.
- Demonstrated interpersonal skills and ability to work closely with cross-functional teams, including product managers, designers, and other engineers.
- Ability to clearly communicate complex technical concepts to both technical and non-technical stakeholders.
- Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in web development and AI.
- Ability to work in a fast-paced environment, manage multiple priorities, and adapt to changing requirements and deadlines.
- Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.
Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $117,200 - $229,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $153,600 - $250,200 per year.
Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $137,600 - $267,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $180,400 - $294,000 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft will accept applications and processes offers for these roles on an ongoing basis.
#copilot #SWE24 #SHPE24MSFT
Responsibilities
- Build secure and performant AI Platform services that power Copilot.
- Work collaboratively with other Platform, infrastructure, application engineers as well as AI Researchers to build next generation AI products and services.
- Ship high-quality, well-tested, secure, and maintainable code.
- Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively.
- Enjoy working in a fast-paced, design-driven, product development cycle.
- Embody our Culture and Values.