Post your job offer for free on H1BConnect with no upfront cost!

Logo

Hire with Us
Microsoft Corporation logo

Research Intern - AI Systems and Architecture

Microsoft Corporation

2/7/2025

Mountain View, CA

Internship

Salary: $6,550 - $13,920 per month


Job Description

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment. Join our Strategic Planning and Architecture (SPARC) team within Microsoft’s Azure Hardware Systems & Infrastructure (AHSI) organization and be a part of the organization behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission. Microsoft delivers more than 200 online services to more than one billion individuals worldwide and AHSI is the team behind our expanding cloud infrastructure. We deliver the core infrastructure and foundational technologies for Microsoft's cloud businesses including Microsoft Azure, Bing, MSN, Office 365, OneDrive, Skype, Teams and Xbox Live. The SPARC organization manages Azure’s hardware roadmap from architecture concept through production for all of Microsoft’s current and future on-line services.

Requirements

  • Accepted or currently enrolled in a PhD program in Computer Science or a related STEM field.
  • At least 1 year of experience with performance analysis for AI accelerators.
  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter.
  • Ability to collaborate effectively with other researchers and product development teams.
  • Proficient interpersonal skills, cross-group, and cross-culture collaboration.
  • Ability to think unconventionally to derive creative and innovative solutions.

Responsibilities

  • Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.
  • Responsible for developing and contributing to an in-house performance modeling tool for large scale machine learning systems.
  • Responsible for evaluation of ideas for performance improvement along with bottleneck analysis and feature enhancement.
  • Responsible for building framework for running large scale parallel performance simulations using cloud-based compute infrastructure.
  • Developing a testing framework and testbenches for enabling operator level unit tests and end-to-end application tests for the performance model.
  • Integrate performance model with power & TCO model to project application level Perf/W and Perf/$ metrics across workloads.
  • Develop cloud-based performance simulation database for storing large scale data from design-space exploration experiments.
  • Develop data-analytics framework along with debug tools and automation for easier retrieval of performance data based on user queries.
  • Develop and maintain performance dashboards and visualization tools for improving the analysis framework.
  • Formalize and improve general software development practices including codebase maintenance, code review, feature development and software design reviews.
  • Integrating CI/CD pipeline into Azure devops software development process.
  • General troubleshooting and debug processes including common performance bottleneck limiters and developing performance comparison tools.
  • Collaborate with larger team to define product requirements, feature improvements and implementation.
  • Responsible for developing and contributing to an in-house performance modeling tool for large scale machine learning systems.
  • Responsible for evaluation of ideas for performance improvement along with bottleneck analysis and feature enhancement.
  • Responsible for building framework for running large scale parallel performance simulations using cloud-based compute infrastructure.
  • Developing a testing framework and testbenches for enabling operator level unit tests and end-to-end application tests for the performance model.
  • Integrate performance model with power & TCO model to project application level Perf/W and Perf/$ metrics across workloads.
  • Develop cloud-based performance simulation database for storing large scale data from design-space exploration experiments.
  • Develop data-analytics framework along with debug tools and automation for easier retrieval of performance data based on user queries.
  • Develop and maintain performance dashboards and visualization tools for improving the analysis framework.
  • Formalize and improve general software development practices including codebase maintenance, code review, feature development and software design reviews.
  • Integrating CI/CD pipeline into Azure devops software development process.
  • General troubleshooting and debug processes including common performance bottleneck limiters and developing performance comparison tools.
  • Collaborate with larger team to define product requirements, feature improvements and implementation.

Benefits

  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Opportunities to network and connect
Logo

© 2024 H1BConnect. All rights reserved.

Check out our sister site LatamDev for tech jobs in Latin America! 🌎