[Remote] AI Principal Machine Learning Engineer (10189)
Note: The job is a remote job and is open to candidates in USA. Extreme Networks is a global networking leader that provides end-to-end, cloud-driven networking solutions. They are seeking a Principal Machine Learning Engineer to drive innovation in Generative AI and machine learning solutions, leading the software development lifecycle and mentoring engineers to build intelligent systems for network management.
Responsibilities
- Be a thought leader and forward thinker, help drive an innovative vision for our various products and platforms, design and launch strategic machine learning (ML) solutions and drive business-wide innovation
- Take the lead in the end-to-end software development lifecycle, encompassing design, testing, deployment, and operations, lead technical discussions and strategy, and participate hands-on in design reviews, code reviews, and implementation
- Craft high-performance, high-scale microservices architectures, including synchronous and asynchronous web services
- Develop real-time online inferencing for highly complex models using Triton, TensorRT and mixed precision computing
- Mentor and develop other engineers on the team, establish technical direction and foster team culture
- Uphold the highest standards of technical rigor in engineering and operational excellence, build highly resilient and scalable systems, and champion operational and process improvements
Skills
- Degree in mathematics/computer science or related discipline
- 5 to 10 years of experience in the complete software development lifecycle including design, coding, code reviews, testing, build processes, deployments and operations
- 5 to 10 years of experience in Python with an in-depth knowledge of its advanced features and libraries
- Expertise in designing RESTful APIs with hands-on experience with technologies such as FastAPI
- Proficient in Docker, Kubernetes, and modern CI/CD practices
- 3+ years of experience in leading the design and architecture of large distributed systems preferably on cloud platforms (e.g., AWS, Azure, Google Cloud)
- Experience as a mentor, tech lead or leading an engineering team
- MS or PhD in Computer Science or equivalent experience in ML
- Experience working with ML technologies (PyTorch, Sagemaker, Triton, TensorRT, etc.)
- Experience with NoSQL and document databases
- Proven ability to handle big data, optimize workflows, and improve system performance
Benefits
- Plus benefits
Company Overview