D

HPC Kubernetes Engineering Manager

Davanti
Full-time
On-site
Dallas, Texas, United States

Do you want to tackle the biggest questions in finance with near infinite compute power at your fingertips?

G-Research is a leading quantitative research and technology firm.We are proud to employ some of the best people in their field and to nurture their talent in a dynamic, flexible and highly stimulating culture where world-beating ideas are cultivated and rewarded.

This is a hybrid role based in our new Dallas infrastructure hub where we work on the latest technologies in a cutting-edge environment. 

The role

We are seeking a highly skilled Kubernetes Engineering Manager, with a focus on HPC, to join our Platform Engineering function in Dallas. 

Kubernetes underpins all facets of our Research platforms and HPC. As HPC Kubernetes Engineering Manager you will take ownership of the strategic roadmap, design and delivery of our Kubernetes platform. In addition, you will focus on continuous optimisations and performance enhancements of our kubernetes platform as Research demands augment.

We are looking for a highly experienced technical manager who can lead the significant scaling up of our existing compute platforms. You will excel working on the bleeding edge of technology, pushing the boundaries of HPC compute performance and providing an innovative approach to solving complex technical challenges that arise.

Working closely with the Kubernetes Platform Management team, you will ensure a smooth transition of new engineering capabilities, with a strong focus on operational excellence in all aspects of design and implementation. 

Key responsibilities of the role include: 

  • Designing, deploying and scaling a high-performance Kubernetes platform to meet current and future demands

  • Engaging proactively with stakeholders to ensure the Kubernetes platform aligns with and supports broader business and research demands

  • Driving cross-functional engineering initiatives across the Technology and Research organisations through confident communication and collaboration

  • Managing vendor relationships, providing continuous feedback to influence product roadmaps and ensuring efficient deployment, support and maintenance of critical platforms

  • Leading and developing a high-performing engineering team across the UK and US, fostering technical excellence and professional growth

  • Monitoring and evaluating emerging trends in the Kubernetes ecosystem, and working with Architecture and Innovation teams to assess and adopt relevant technologies

  • Ensuring platform reliability, availability and security by applying a DevOps mindset and managing infrastructure using Infrastructure-as-Code tools

  • Overseeing budgeting, capacity forecasting and resource management for Kubernetes platform operations and future scaling

Who are we looking for?

The ideal candidate will have the following skills and experience:  

  • Deep technical expertise in designing and scaling high-performance Kubernetes platforms for HPC and ML workloads in distributed environments

  • Strong capability in performance tuning for ML workloads across GPU and CPU clusters, including workload scheduling, GPU integration and resource optimisation

  • Skilled in managing multi-tenant compute environments and integrating distributed file systems and high-speed interconnects, such as InfiniBand and RoCE

  • Strong collaboration and stakeholder management skills, aligning engineering outcomes with business value and ensuring smooth capability handover

  • Proven leadership and project management abilities, fostering a high-performance culture and accountable engineering teams

  • Advocate of best practices across CI/CD, automation and tooling, configuration management, and Site Reliability Engineering (SRE)

  • Committed to designing and building secure, high-integrity systems with a security-first mindset

Why should you apply?

  • Sick days, military leave, and family and medical leave 

  • Generous 401(k) plan 

  • 16-weeks’ fully paid parental leave 

  • Medical and Prescription, Dental, and Vision insurance 

  • Life and Accidental Death & Dismemberment (AD&D) insurance 

  • Employee Assistance and Wellness programs 

  • Generous relocation allowance and support 

  • Great selection of office snacks, and hot and cold drinks 

  • Free on-site gym and car parking 

This role is employed through our US affiliate.

G-Research is committed to cultivating and preserving an inclusive work environment. We are an ideas-driven business and we place great value on diversity of experience and opinions.

We want to ensure that applicants receive a recruitment experience that enables them to perform at their best. If you have a disability or special need that requires accommodation please let us know in the relevant section