Principal Infrastructure Engineer
About us
Ori was founded in 2018. We believe AI is radically transforming industries that drive the progress and betterment of humanity —from science, healthcare and commerce, to technology and the arts. However, the scarcity of GPU-accelerated chips and inefficient usage of GPU compute resources greatly limit how fast we can build the future of AI. Ori is solving these challenges by making AI infrastructure more accessible, available and affordable.
Meet the team behind this journey
We are growing very fast right now. This is an exciting time, and we need active support as soon as possible! Our Software Engineering team is pivotal to Ori's success, and this is where you come in with your expertise. As a Principal Infrastructure Engineer, you will analyse cutting-edge technologies, design scalable and resilient infrastructure solutions, and lead their implementation across a globally distributed environment. Combining deep technical expertise with solution-focused leadership, you will enable the infrastructure to power HPC and AI/ML workloads, delivering value through innovation and collaboration.
What You’ll Focus On
Infrastructure Design & Virtualisation:
- Architect, design and implement virtualisation solutions optimised for AI-native workloads, focusing on HPC environments and hypervisor tuning.
- Design infrastructure that dynamically adjusts to meet customer demands for storage and networking resources, ensuring resilience and scalability.
Bare-Metal and Operating System Management:
- Lead the lifecycle management of bare-metal hardware, ensuring efficient provisioning, orchestration, and optimization.
- Maintain deep expertise in Unix/Linux systems, delivering secure, performant configuration at scale.
Networking and High-Performance Storage:
- Design and implement high-performance, cloud-native storage and networking solutions for demanding workloads.
- Apply advanced knowledge of networking protocols (TCP, UDP, DNS, encryption) and software-defined networking (SDN) technologies.
Kubernetes and Cloud-Native Platforms:
- Deploy and manage Kubernetes clusters across hybrid and multi-cloud environments, leveraging CNIs and service meshes.
- Architect scalable CI/CD pipelines to automate infrastructure delivery and ensure reliable global operations.
Observability and Automation:
- Build observability pipelines to monitor and troubleshoot global systems integrating tools for logins, metrics, and tracing.
- Create automation frameworks to reduce operational toil, streamline deployments, and enhance scalability using tools like Terraform, Ansible, Go, and Python.
Architecture & Solution Design:
- Analyse emerging technologies and assess their potential t within our platform, considering scalability, security, and performance.
- Develop high- and low-level architectural proposals, balancing technical and business requirements.
- Collaborate with cross-functional teams to ensure successful implementation of infrastructure solutions.
- Provide thought leadership on architecture best practices, driving alignment across engineering and product teams.
Collaboration and Leadership:
- Foster a 'how do we achieve this?' mindset, championing a positive, solution-oriented approach to challenges.
- Mentor and guide team members in architectural principles, emerging technologies, and best practices.
- Partner with engineering leaders and stakeholders to align infrastructure projects with broader organisational goals.
You’ll be a great t if you bring a few of the below with you:
- Expertise in AI/ML workloads, GPU-accelerated systems, or HPC infrastructures.
- Familiarity with hybrid cloud and edge computing principles.
- Experience leading architecture initiatives in agile environments, working iteratively to deliver high-quality results.
It's the perfect role for you if you have:
- Bachelor's degree in Computer Science, Information Technology, or a related field.
- Minimum of 8 years of experience in global scale infrastructure.
- Proven experience in infrastructure architecture and solution design, including the ability to produce detailed technical proposals.
- Demonstrated success in evaluating and adopting new toolsets and technologies.
- Extensive expertise in large-scale global infrastructure deployments, particularly in AI-native, HPC, and cloud-native environments.
- Advanced knowledge of Kubernetes, container networking (CNI), and service mesh technologies.
- Deep understanding of virtualisation and hypervisor technologies for high-performance workloads.
- Strong proficiency in networking protocols (TCP, UDP, DNS, BGP) and SDN solutions.
- Skilled in Infrastructure-as-Code (IaC) tools like Terraform, Ansible, and cloud orchestration frameworks.
- Hands-on coding experience in Go, Python, or similar languages for automation.
- Solid grasp of observability tools (Prometheus, Grafana) and distributed tracing systems.
- Strong communication and collaboration skills, with the ability to convey complex technical concepts to cross-functional teams.
- Demonstrate ability to create clear and concise documentation for infrastructure designs, processes and systems.
Qualities we look for:
- Architectural Leadership: You can analyse complex problems, evaluate solutions, and deliver clear architectural proposals.
- Solution-Oriented Mindset: You thrive on challenges, asking 'how do we make this work?' rather than dwelling on obstacles.
- Collaboration and Influence: You build strong partnerships across teams, bringing diverse perspectives into alignment.
- Adaptability: You embrace new technologies and methodologies, continually seeking ways to improve infrastructure solutions.
- Customer-Centric Focus: You design systems that address the evolving needs of customer workloads, ensuring performance and reliability.
Why should you join us?
What sets us apart is our blend of modern technology, competitive benefits, and an open, welcoming work culture that enables our people to thrive.
Here are just some of the great things you can expect from us:
- Remote work, flexible hours: we offer a fully remote work schedule, with flexible working hours and trust in your productivity, we are in sync with your team’s general locations and time zones to foster effective and seamless collaboration.
- A culture that emphasises results over hierarchy, process & ego: we place great emphasis on the quality, ingenuity and creativity of work.
- Open communication, regular feedback: we value smooth collaboration, direct and actionable feedback, and believe that leading with empathy and a growth mindset makes us better together.
- Learning Time: we all have dedicated learning time to focus on new skills, projects or interests that lay outside of your day-to-day job.
- Health & Wellbeing: We're committed to your health. Our private medical insurance covers a range of services, from routine check-ups to specialist care.
- Participation in the company shares program.
Diversity, Equity, Inclusion and Belonging
We are an equal opportunity employer and we strive to reduce unconscious bias throughout our hiring process. All applicants will be considered for employment without attention to ethnicity, religion, sexual orientation, gender identity, family or parental status, national origin, veteran, neurodiversity status or disability status. To ensure our recruitment processes provide an equal opportunity for all applicants to succeed, we encourage you to let us know if there are any adjustments that we can make.
- Department
- Engineering
- Locations
- UK Remote Working, EMEA, Remote working
- Remote status
- Fully Remote
- Yearly salary
- £90,000 - £130,000
- Employment type
- Full-time
Principal Infrastructure Engineer
Loading application form
Already working at Ori Industries?
Let’s recruit together and find your next colleague.