Senior Solutions Architect, GPU - Cloud Service Providers

NVIDIA

2.7

(9)

Multiple Locations

#JR1991569

Position summary

pects related to tasks like large scale LLM training and inference.

  • Conducting regular technical customer meetings for project/product details, feature discussions, introductions to new technologies, performance advice, and debugging sessions.

  • Collaborating with customers to build Proof of Concepts (PoCs) for solutions to address critical business needs and support cloud service integration for NVIDIA technology on hyperscalers.

  • Analyzing and developing solutions for customer performance issues for both AI and systems performance.

What we need to see:

  • BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other Engineering fields or equivalent experience.

  • 3+ years of engineering (performance/system/solution) experience.

  • Hands-on experience building performance benchmarks for data center systems, including large scale AI training and inference.

  • Understanding of systems architecture including AI accelerators and networking as it relates to the performance of an overall application.

  • Effective engineering program management with the capability of balancing multiple tasks.

  • Ability to communicate ideas clearly through documents, presentations, and in external customer-facing environments.

Ways to stand out from the crowd:

  • Hands-on experience with Deep Learning frameworks (PyTorch, JAX, etc.), compilers (Triton, XLA, etc.), and NVIDIA libraries (TRTLLM, TensorRT, Nemo, NCCL, RAPIDS, etc.).

  • Familiarity with deep learning architectures and the latest LLM developments.

  • Background with NVIDIA hardware and software, performance tuning, and error diagnostics.

  • Hands-on experience with GPU systems in general including but not limited to performance testing, performance tuning, and benchmarking.

  • Experience deploying solutions in cloud environments including AWS, GCP, Azure, or OCI as well as knowledge of DevOps/MLOps technologies such as Docker/containers, Kubernetes, data center deployments, etc. Command line proficiency.

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.