Member of Technical Staff - Voice & Vision
Microsoft
This job is no longer accepting applications
See open jobs at Microsoft.See open jobs similar to "Member of Technical Staff - Voice & Vision" Tech:NYC.IT
USD 137,600-267k / year
Posted on Mar 21, 2025
Member of Technical Staff – Voice & Vision
Mountain View, California, United States
Date posted
Mar 20, 2025
Job number
MIC010558
Work site
Microsoft on-site only
Role type
Individual contributor
Profession
Software Engineering
Discipline
Software Engineering
Employment type
Full-time
CLICK HERE TO APPLY
Job Description
Overview:
As Microsoft continues to push the boundaries of AI, we are on the lookout for passionate individuals to work with us on the most interesting and challenging AI questions of our time. Our vision is bold and broad — to build systems that have true artificial intelligence across agents, applications, services, and infrastructure. It’s also inclusive: we aim to make AI accessible to all — consumers, businesses, developers — so that everyone can realize its benefits.
We are seeking a highly skilled and experienced Member of Technical Staff – Voice & Vision to join our team and drive the development of voice and vision capabilities for our Copilot product. The ideal candidate will have a strong background in engineering, with a focus on voice recognition, natural language processing, and computer vision technologies. As a Member of Technical Staff – Voice & Vision, you will be a founding engineer in a team that is at the forefront of generative AI, making significant strides in audio while aggressively pushing into the nascent field of video.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
By applying to this U.S. Mountain View, CA position, you are required to be local to the San Francisco area and in office 3 days a week.
Key Responsibilities:
- Work on cutting-edge technologies with a focus on voice and vision, including super resolution and real-time video streaming.
- Collaborate with a dynamic team to deliver innovative solutions that enhance the user experience across various platforms.
- Apply expertise in audio and video technologies to new AI contexts, focusing on traditional methods such as noise suppression and echo cancellation to enhance voice quality across mobile platforms.
- Develop and implement advanced techniques for video and image manipulation, including upscaling and super resolution.
- Lead the integration and rollout of vision capabilities, ensuring seamless collaboration between the vision and voice teams to deliver next-level improvements and expand functionality across various contexts.
- Drive hands-on development of voice-heavy and video-heavy features, pushing the boundaries of generative AI in both audio and video domains.
- Collaborate with product managers, designers, and other stakeholders to define technical requirements and deliverables.
- Design, develop, and optimize voice recognition algorithms, including acoustic modeling, language modeling, and speech-to-text conversion.
- Implement and enhance natural language processing (NLP) techniques, such as named entity recognition, sentiment analysis, and intent detection.
- Develop and refine computer vision algorithms for tasks such as image classification, object detection, and facial recognition.
- Ensure the scalability, performance, and reliability of voice and vision systems through rigorous testing and validation.
Required Qualifications
- Bachelor's Degree in Computer Science, or related technical discipline AND 6 years technical engineering experience with coding in languages including, but not limited to, C, C , C#, Java, JavaScript, or Python
- OR equivalent experience.
Preferred Qualifications
- Bachelor's degree in computer science, or related technical discipline AND 8 years technical engineering experience with coding in languages including, but not limited to, C, C , C#, Java, JavaScript, or Python
- OR equivalent experience.
Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $137,600 - $267,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $180,400 - $294,000 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft will accept applications and processes offers for these roles on an ongoing basis.
#MicrsoftAI #Copilot
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
This job is no longer accepting applications
See open jobs at Microsoft.See open jobs similar to "Member of Technical Staff - Voice & Vision" Tech:NYC.