Skip to main content Skip to footer

AI Infrastructure Lead Architect

AI Infrastructure Architecture Associate Manager | Mid-Level | Full time
Job No. R00339281 | Multiple Locations
立即申请

YOU ARE 

As a Lead and Principal Infrastructure Architect, you own end-to-end responsibility for designing optimized compute infrastructure for large-scale AI and machine learning systems, including large-scale distributed training environments.

You are the authority who translates business goals, SLAs, and client standards into infrastructure architectures that perform at scale while being deliberately engineered for cost-efficiency. Drawing on deep experience, you weigh multiple viable solutions for any given problem — across compute, networking, storage, orchestration, and model serving — and make rational, well-justified architectural decisions tailored to each client's situation, constraints, and standards. You architect and optimize the full computational stack for performance, power, cost, and scalability; design and tune large-scale GPU clusters and distributed training systems; and ensure infrastructure meets security, compliance, and regulatory requirements.

As the recognized AI infrastructure expert in at least one hyperscaler cloud (such as AWS, Azure, or Google Cloud), you bring authoritative knowledge of that platform's AI/ML services, accelerators, networking, and cost levers, and apply it to deliver best-in-class solutions. Beyond design, you set technical direction and standards, lead and mentor engineers and architects, partner with clients and stakeholders to shape the infrastructure roadmap, and are ultimately accountable for delivering AI/ML infrastructure that meets business SLAs, controls cost, and scales to enterprise and frontier workloads. 

 

THE WORK 

  • Own the end-to-end architecture and design of optimized compute infrastructure for large-scale AI/ML systems, including large-scale distributed training environments, from concept through delivery. 

  • Develop and evaluate architecture alternatives, weighing trade-offs across compute, networking, storage, orchestration, and model serving to make rational, well-justified decisions tailored to each client's situation and standards. 

  • Lead architecture assessments and reviews of existing and proposed environments, identifying gaps, risks, bottlenecks, and optimization opportunities, and recommending remediation. 

  • Drive architectural decision-making, documenting rationale, trade-offs, and assumptions so decisions are transparent, defensible, and aligned with business SLAs and standards. 

  • Define and maintain the AI infrastructure roadmap, planning capacity, scaling, and technology evolution in step with business and product goals. 

  • Architect and optimize the full computational stack for performance, power, cost, and scalability, ensuring infrastructure meets business SLAs while being deliberately engineered for cost-efficiency. 

  • Design and tune large-scale GPU clusters and distributed training systems, including accelerator selection, interconnect/networking, and storage for high-throughput training workloads. 

  • Serve as the authoritative AI infrastructure expert in at least one hyperscaler cloud (AWS, Azure, or GCP), applying deep knowledge of its AI/ML services, accelerators, networking, and cost levers. 

  • Design deployment, automation, and CI/CD strategies for reliable, repeatable, and scalable releases of AI systems, models, and data pipelines into production. 

  • Establish AI monitoring and observability strategy across InfraOps and MLOps, defining SLAs, SLOs, alerting, and performance/cost tracking, and driving continuous optimization. 

  • Integrate AI/ML systems into enterprise environments, ensuring interoperability, security, compliance, and adherence to regulatory and client standards. 

  • Lead capacity planning and cost modeling, forecasting compute needs and engineering cost-efficiency into the architecture without compromising performance. 

  • Collaborate with clients, stakeholders, and engineering teams to align infrastructure decisions with business outcomes, translating requirements into actionable architecture and standards. 

  • Set technical direction, standards, and best practices, mentoring engineers and architects and leading design and code reviews across the team. 

EDUCATION 

  • • Bachelor's Degree in Computer Science,  Computer Engineering, related Engineering field 

BASIC (REQUIRED) QUALIFICATION 

  • Solid background in coding, building, monitoring, troubleshooting  applications of AI/ML models; selecting, designing and infrastructure for  deploying and running them on  premise or on public cloud. 

  • Strong understanding of AI and machine learning as a subject. 

  • Strong understanding of computing infrastructure  a subject, preferred knowledge of AI infrastructure. 

  • Good proficiency in programming languages such as Python, Java, or C++. 

  • Experience with data pipeline and workflow management tools (e.g., Apache Airflow, Kubeflow). 

  • Strong problem-solving skills and ability to work in a fast-paced environment. 

  • Excellent communication and collaboration skills. 

  • Significant experience in AI/ML infrastructure engineering or related roles on a hyperscaler platform for deploying large scale solutions. 

  • Proven experience in leading and managing AI projects and teams. 

  • Strong project management skills, with the ability to manage multiple projects simultaneously. 

  • Demonstrated experience in evaluating and selecting AI technologies and frameworks. 

  • Ability to work with cross-functional teams and drive project alignment. 

London

Berlin

Madrid

Paris

平等就业机会声明

所有聘用决定均不考虑年龄、种族、信仰、肤色、宗教、性别、国籍、血统、残疾状况、退伍军人身份、性取向、性别认同或表达、基因信息、婚姻状况、公民身份或任何其他受联邦、州或地方法律保护的因素。

求职者在招聘过程中没有义务披露已封存或已删除的定罪或逮捕记录。

埃森哲致力于为我们的男女军人提供退伍军人就业机会。

请阅读埃森哲的招聘和聘用声明,了解更多关于我们在招聘和聘用过程中如何处理您的数据的信息。

We work with one shared purpose: to deliver on the promise of technology and human ingenuity. Every day, more than 775,000 of us help our stakeholders continuously reinvent. Together, we drive positive change and deliver value to our clients, partners, shareholders, communities, and each other.

We believe that delivering value requires innovation, and innovation thrives in an inclusive and diverse environment. We actively foster a workplace free from bias, where everyone feels a sense of belonging and is respected and empowered to do their best work.

At Accenture, we see well-being holistically, supporting our people’s physical, mental, and financial health. We also provide opportunities to keep skills relevant through certifications, learning, and diverse work experiences. We’re proud to be consistently recognized as one of the World’s Best Workplaces™.

Join Accenture to work at the heart of change. Visit us at www.accenture.com.

埃森哲专业领域

人工智能

人工智能与数据科学职位:挖掘全新机遇

挖掘人工智能和数据的巨大潜力,以负责任的方式重塑业务的方方面面。

了解更多