Skip to main content Skip to footer

AI Infrastructure Lead Architect

AI Infrastructure Architecture Associate Manager | Mid-Level | Full time
Job No. R00339281 | Multiple Locations
Inscreve-te agora

YOU ARE 

As a Lead and Principal Infrastructure Architect, you own end-to-end responsibility for designing optimized compute infrastructure for large-scale AI and machine learning systems, including large-scale distributed training environments.

You are the authority who translates business goals, SLAs, and client standards into infrastructure architectures that perform at scale while being deliberately engineered for cost-efficiency. Drawing on deep experience, you weigh multiple viable solutions for any given problem — across compute, networking, storage, orchestration, and model serving — and make rational, well-justified architectural decisions tailored to each client's situation, constraints, and standards. You architect and optimize the full computational stack for performance, power, cost, and scalability; design and tune large-scale GPU clusters and distributed training systems; and ensure infrastructure meets security, compliance, and regulatory requirements.

As the recognized AI infrastructure expert in at least one hyperscaler cloud (such as AWS, Azure, or Google Cloud), you bring authoritative knowledge of that platform's AI/ML services, accelerators, networking, and cost levers, and apply it to deliver best-in-class solutions. Beyond design, you set technical direction and standards, lead and mentor engineers and architects, partner with clients and stakeholders to shape the infrastructure roadmap, and are ultimately accountable for delivering AI/ML infrastructure that meets business SLAs, controls cost, and scales to enterprise and frontier workloads. 

 

THE WORK 

  • Own the end-to-end architecture and design of optimized compute infrastructure for large-scale AI/ML systems, including large-scale distributed training environments, from concept through delivery. 

  • Develop and evaluate architecture alternatives, weighing trade-offs across compute, networking, storage, orchestration, and model serving to make rational, well-justified decisions tailored to each client's situation and standards. 

  • Lead architecture assessments and reviews of existing and proposed environments, identifying gaps, risks, bottlenecks, and optimization opportunities, and recommending remediation. 

  • Drive architectural decision-making, documenting rationale, trade-offs, and assumptions so decisions are transparent, defensible, and aligned with business SLAs and standards. 

  • Define and maintain the AI infrastructure roadmap, planning capacity, scaling, and technology evolution in step with business and product goals. 

  • Architect and optimize the full computational stack for performance, power, cost, and scalability, ensuring infrastructure meets business SLAs while being deliberately engineered for cost-efficiency. 

  • Design and tune large-scale GPU clusters and distributed training systems, including accelerator selection, interconnect/networking, and storage for high-throughput training workloads. 

  • Serve as the authoritative AI infrastructure expert in at least one hyperscaler cloud (AWS, Azure, or GCP), applying deep knowledge of its AI/ML services, accelerators, networking, and cost levers. 

  • Design deployment, automation, and CI/CD strategies for reliable, repeatable, and scalable releases of AI systems, models, and data pipelines into production. 

  • Establish AI monitoring and observability strategy across InfraOps and MLOps, defining SLAs, SLOs, alerting, and performance/cost tracking, and driving continuous optimization. 

  • Integrate AI/ML systems into enterprise environments, ensuring interoperability, security, compliance, and adherence to regulatory and client standards. 

  • Lead capacity planning and cost modeling, forecasting compute needs and engineering cost-efficiency into the architecture without compromising performance. 

  • Collaborate with clients, stakeholders, and engineering teams to align infrastructure decisions with business outcomes, translating requirements into actionable architecture and standards. 

  • Set technical direction, standards, and best practices, mentoring engineers and architects and leading design and code reviews across the team. 

EDUCATION 

  • • Bachelor's Degree in Computer Science,  Computer Engineering, related Engineering field 

BASIC (REQUIRED) QUALIFICATION 

  • Solid background in coding, building, monitoring, troubleshooting  applications of AI/ML models; selecting, designing and infrastructure for  deploying and running them on  premise or on public cloud. 

  • Strong understanding of AI and machine learning as a subject. 

  • Strong understanding of computing infrastructure  a subject, preferred knowledge of AI infrastructure. 

  • Good proficiency in programming languages such as Python, Java, or C++. 

  • Experience with data pipeline and workflow management tools (e.g., Apache Airflow, Kubeflow). 

  • Strong problem-solving skills and ability to work in a fast-paced environment. 

  • Excellent communication and collaboration skills. 

  • Significant experience in AI/ML infrastructure engineering or related roles on a hyperscaler platform for deploying large scale solutions. 

  • Proven experience in leading and managing AI projects and teams. 

  • Strong project management skills, with the ability to manage multiple projects simultaneously. 

  • Demonstrated experience in evaluating and selecting AI technologies and frameworks. 

  • Ability to work with cross-functional teams and drive project alignment. 

London

Berlin

Madrid

Paris

Igualdade de oportunidade de emprego

A Accenture rege-se pelos princípios da meritocracia e da igualdade de oportunidades, não discriminando nem tolerando qualquer discriminação em razão de ascendência, origem étnica, raça, cor, sexo, estado civil, território de origem, grau de instrução, posição social, orientação sexual, religião, convicções e/ou opções políticas ou ideológicas, situação económica ou social, ou em função de qualquer outro factor considerado discriminatório e proibido nos termos da lei.

Para mais informações sobre as oportunidades de emprego na Accenture e se necessitas de assistência especial, material ou infraestrutura adaptada, por favor envia um email para Portugal_careers@accenture.com.

We work with one shared purpose: to deliver on the promise of technology and human ingenuity. Every day, more than 775,000 of us help our stakeholders continuously reinvent. Together, we drive positive change and deliver value to our clients, partners, shareholders, communities, and each other.

We believe that delivering value requires innovation, and innovation thrives in an inclusive and diverse environment. We actively foster a workplace free from bias, where everyone feels a sense of belonging and is respected and empowered to do their best work.

At Accenture, we see well-being holistically, supporting our people’s physical, mental, and financial health. We also provide opportunities to keep skills relevant through certifications, learning, and diverse work experiences. We’re proud to be consistently recognized as one of the World’s Best Workplaces™.

Join Accenture to work at the heart of change. Visit us at www.accenture.com.

Áreas de atuação

Sztuczna inteligencja

Oferty pracy związane z AI i nauką o danych: Odkryj nowe możliwości

Uwolnij potencjał AI i danych, aby zreformować wszystkie aspekty działalności w odpowiedzialny sposób.

Dowiedz się więcej