Principal Cloud Infrastructure Engineer
Atreides
Job Title: Principal Cloud Infrastructure Engineer (Remote, UK)
Company Overview:
Atreides helps organizations transform large and complex multi-modal datasets into information-rich, geo-spatial and digital data subscriptions used across various use cases. We focus on providing defense intelligence professionals with high-fidelity data solutions to derive insights quickly. As a fast-growing, high-performance early-stage company, we value diversity and inclusion, trust, and autonomy. A mission-driven mindset and entrepreneurial spirit are essential as we work to unlock the power of massive-scale data for a safer, stronger, and more prosperous world.
Team Overview:
We are a passionate team of technologists, data scientists, and analysts with backgrounds in operational intelligence, law enforcement, large multinationals, and cybersecurity operations. We obsess about designing products that will change the way global companies, governments and nonprofits protect themselves from external threats and global adversaries.
Position Overview:
We are seeking a Principal Cloud Infrastructure Architect to design and actively build and scale the systems that form the backbone of our cloud platform. You will own the design, deployment, and operation of production infrastructure across AWS (and potentially Azure), delivering secure, reliable, and self-service environments that empower product teams to deploy and iterate rapidly.
This is a hands-on technical leadership role—you’ll design solutions, write infrastructure code, and guide the evolution of our platform through direct implementation and mentorship. You’ll collaborate deeply with software and product teams to ensure that new products are built on sound, scalable infrastructure foundations. You’ll also be a key voice in product planning sessions where infrastructure requirements and capabilities influence the path forward.
We’re looking for someone who can take broad goals, translate them into technical designs, and execute end-to-end—someone who has built and operated large-scale systems and is excited to do it again, better.
Team Principles:
At Atreides, we believe that teams work best when they:
- Remain curious and passionate in all aspects of our work
- Promote clear, direct, and transparent communication
- Embrace the 'measure twice, cut once' philosophy
- Value and encourage diverse ideas and technologies
- Lead with empathy in all interactions
Responsibilites:
- Architect, deploy, and maintain scalable, secure infrastructure across AWS (and potentially Azure) using Infrastructure as Code (Pulumi, Teraform etc).
- Collaborate closely with internal engineering teams to understand product requirements, identify system constraints, and design solutions that balance performance, reliability, and cost.
- Own AWS EKS clusters and Kubernetes workloads, including autoscaling (Karpenter), FluxCD-based GitOps, Helm charts, and custom SDKs for reusable deployments.
- Design and manage core cloud services—VPCs, VPNs, IAM, S3, RDS, ECR, load balancing, and network security controls.
- Build, maintain, and document infrastructure templates and developer enablement tooling to allow teams to deploy independently.
- Implement observability and monitoring systems using Grafana, Prometheus, and Loki for infrastructure and application metrics.
- Establish and contribute to CI/CD best practices using GitHub Actions and related automation pipelines.
- Participate in on-call rotations and respond to infrastructure alerts, RFIs, and external requests.
- Author and review technical design documents and proposals for infrastructure initiatives, architecture changes, and new services.
- Provide technical leadership and mentorship to peers, helping elevate infrastructure craftsmanship across the organization.
Desired Qualification:
- 10+ years of experience in DevOps, platform, or infrastructure engineering roles, including 2+ years in a senior or principal capacity.
- Proven experience designing and delivering complete cloud infrastructure solutions independently.
- Expert-level understanding of AWS (EKS, VPC, EC2, S3, RDS, IAM, KMS, Route53, etc.).
- Experience with Azure cloud services (VMs, networking, AKS, identity, monitoring) is a strong plus.
- Proficiency with Pulumi (TypeScript or Python) or Terraform is required.
- Strong Kubernetes expertise: workload management, autoscaling, GitOps (FluxCD), Helm, Kustomize.
- Strong experience building CI/CD pipelines (GitHub Actions, GitLab CI, or similar).
- Skilled in containerization (Docker) and maintaining image registries and artifact repositories.
- Solid understanding of networking and security fundamentals (VPNs, firewall rules, IAM policies, encryption).
- Familiarity with observability and alerting stacks (Prometheus, Grafana, Loki).
- Excellent communicator, able to write and present technical designs and proposals clearly.
- Experience mentoring others and leading by example in fast-paced, high-autonomy environments.
- Proficiency in Python for automation, scripting, or tooling development. Experience with Go is a plus.
- Display friendliness and empathy towards others :)
Compensation and Benefits:
- Competitive salary
- Comprehensive health, dental, and vision insurance plans
- Flexible remote work environment
- Additional benefits like flexible hours, work travel opportunities, competitive vacation time and parental leave
While meeting all of these criteria would be ideal, we understand that some candidates may meet most, but not all. If you're passionate, curious and ready to "work smart and get things done," we'd love to hear from you.