AI Infrastructure Lead Architect

AI Infrastructure Architecture Associate Manager | Mid-Level | Full time

Job No. R00339281 | Multiple Locations

Job Description

YOU ARE

As a Lead and Principal Infrastructure Architect, you own end-to-end responsibility for designing optimized compute infrastructure for large-scale AI and machine learning systems, including large-scale distributed training environments.

You are the authority who translates business goals, SLAs, and client standards into infrastructure architectures that perform at scale while being deliberately engineered for cost-efficiency. Drawing on deep experience, you weigh multiple viable solutions for any given problem — across compute, networking, storage, orchestration, and model serving — and make rational, well-justified architectural decisions tailored to each client's situation, constraints, and standards. You architect and optimize the full computational stack for performance, power, cost, and scalability; design and tune large-scale GPU clusters and distributed training systems; and ensure infrastructure meets security, compliance, and regulatory requirements.

As the recognized AI infrastructure expert in at least one hyperscaler cloud (such as AWS, Azure, or Google Cloud), you bring authoritative knowledge of that platform's AI/ML services, accelerators, networking, and cost levers, and apply it to deliver best-in-class solutions. Beyond design, you set technical direction and standards, lead and mentor engineers and architects, partner with clients and stakeholders to shape the infrastructure roadmap, and are ultimately accountable for delivering AI/ML infrastructure that meets business SLAs, controls cost, and scales to enterprise and frontier workloads.

THE WORK

Own the end-to-end architecture and design of optimized compute infrastructure for large-scale AI/ML systems, including large-scale distributed training environments, from concept through delivery.

Develop and evaluate architecture alternatives, weighing trade-offs across compute, networking, storage, orchestration, and model serving to make rational, well-justified decisions tailored to each client's situation and standards.

Lead architecture assessments and reviews of existing and proposed environments, identifying gaps, risks, bottlenecks, and optimization opportunities, and recommending remediation.

Drive architectural decision-making, documenting rationale, trade-offs, and assumptions so decisions are transparent, defensible, and aligned with business SLAs and standards.

Define and maintain the AI infrastructure roadmap, planning capacity, scaling, and technology evolution in step with business and product goals.

Architect and optimize the full computational stack for performance, power, cost, and scalability, ensuring infrastructure meets business SLAs while being deliberately engineered for cost-efficiency.

Design and tune large-scale GPU clusters and distributed training systems, including accelerator selection, interconnect/networking, and storage for high-throughput training workloads.

Serve as the authoritative AI infrastructure expert in at least one hyperscaler cloud (AWS, Azure, or GCP), applying deep knowledge of its AI/ML services, accelerators, networking, and cost levers.

Design deployment, automation, and CI/CD strategies for reliable, repeatable, and scalable releases of AI systems, models, and data pipelines into production.

Establish AI monitoring and observability strategy across InfraOps and MLOps, defining SLAs, SLOs, alerting, and performance/cost tracking, and driving continuous optimization.

Integrate AI/ML systems into enterprise environments, ensuring interoperability, security, compliance, and adherence to regulatory and client standards.

Lead capacity planning and cost modeling, forecasting compute needs and engineering cost-efficiency into the architecture without compromising performance.

Collaborate with clients, stakeholders, and engineering teams to align infrastructure decisions with business outcomes, translating requirements into actionable architecture and standards.

Set technical direction, standards, and best practices, mentoring engineers and architects and leading design and code reviews across the team.

Qualification

EDUCATION

• Bachelor's Degree in Computer Science, Computer Engineering, related Engineering field

BASIC (REQUIRED) QUALIFICATION

Solid background in coding, building, monitoring, troubleshooting applications of AI/ML models; selecting, designing and infrastructure for deploying and running them on premise or on public cloud.

Strong understanding of AI and machine learning as a subject.

Strong understanding of computing infrastructure a subject, preferred knowledge of AI infrastructure.

Good proficiency in programming languages such as Python, Java, or C++.

Experience with data pipeline and workflow management tools (e.g., Apache Airflow, Kubeflow).

Strong problem-solving skills and ability to work in a fast-paced environment.

Excellent communication and collaboration skills.

Significant experience in AI/ML infrastructure engineering or related roles on a hyperscaler platform for deploying large scale solutions.

Proven experience in leading and managing AI projects and teams.

Strong project management skills, with the ability to manage multiple projects simultaneously.

Demonstrated experience in evaluating and selecting AI technologies and frameworks.

Ability to work with cross-functional teams and drive project alignment.

Locations

London

Berlin

Madrid

Paris

Additional Information

Equal Employment Opportunity Statement

All employment decisions shall be made without regard to age, race, creed, color, religion, sex, national origin, ancestry, disability status, veteran status, sexual orientation, gender identity or expression, genetic information, marital status, citizenship status or any other basis as protected by federal, state, or local law.

Please read Accenture’s Recruiting and Hiring Statement for more information on how we process your data during the Recruiting and Hiring process.

About Accenture

We work with one shared purpose: to deliver on the promise of technology and human ingenuity. Every day, more than 775,000 of us help our stakeholders continuously reinvent. Together, we drive positive change and deliver value to our clients, partners, shareholders, communities, and each other.

We believe that delivering value requires innovation, and innovation thrives in an inclusive and diverse environment. We actively foster a workplace free from bias, where everyone feels a sense of belonging and is respected and empowered to do their best work.

At Accenture, we see well-being holistically, supporting our people’s physical, mental, and financial health. We also provide opportunities to keep skills relevant through certifications, learning, and diverse work experiences. We’re proud to be consistently recognized as one of the World’s Best Workplaces™.

Join Accenture to work at the heart of change. Visit us at www.accenture.com.

Important Notice

We have been alerted to the existence of fraudulent messages asking job seekers to set up payment to cover various costs associated with establishing employment at Accenture. No one is ever required to pay for employment at Accenture. If you are contacted by someone asking for payment, please do not respond, and contact us at india.fc.check@accenture.com immediately.

Discover where this job fits at Accenture

AI and data science jobs: Uncover new possibilities

Unlock the power of AI and data to reinvent all facets of business–responsibly.

Learn more