Skip to main content Skip to footer

Security Platform Eng Senior Analyst

Chennai Job No. 14029768 Full-time - Remote

工作描述

Job Title: Site Reliability Engineer (SRE) - Frontline (L1 Operations)

Summary

  • We are seeking a highly motivated Frontline Site Reliability Engineer (SRE) to support and enhance the reliability, scalability, and security of our production platforms. This role acts as the first line of defense for incidents, monitoring alerts, and operational tasks across multi-cloud environments.

  • The ideal candidate brings strong cloud operational knowledge (GCP preferred), hands-on experience with monitoring/observability tools, familiarity with SIEM/SOAR workflows, and the ability to troubleshoot distributed systems in real time. You will work closely with platform engineering, cloud operations, and security teams - including SOC- to ensure uninterrupted availability of business-critical systems.

Key Responsibilities

🔹 Incident Response & Operational Support

  • Serve as the primary responder for production incidents, alerts, and service interruptions.

  • Perform initial triage, diagnosis, and resolution of platform and application issues.

  • Act as PIM (Production Incident Manager) during SEV1/SEV2 incidents - driving coordination, communication, timelines, and stakeholder updates.

  • Ensure SLA adherence and maintain clear communication during escalations.

  • Handle incidents, service requests, and changes using Jira Service Management (JSM).

  • Follow ITSM processes (Incident, Request, Problem, Change Management).

🔹 Monitoring, Observability & Platform Visibility

  • Monitor infrastructure and applications using tools such as:
    GCP Monitoring, Datadog, JSM dashboards

  • Analyze recurring alerts, reduce false positives, and contribute to alert tuning.

  • Build or improve dashboards, log queries, and monitoring pipelines.

  • Ensure operational visibility through metrics, logs, traces, and health checks.

🔹 Cloud Platform Operations (GCP, AWS Preferred)

  • Provide frontline operational support for:

    • GCP Compute Engine, Networking, IAM, Cloud Functions, GKE basics

    • GCP Patch Management (OS patching, compliance, reporting)

  • Support AWS environments as needed, including:

    • AWS Workspaces provisioning and troubleshooting

    • AWS SSM Patch Management

  • Assist with deployments, configuration changes, and policy implementations.

  • Perform day-to-day platform administration tasks across cloud infrastructure.

🔹 Security Operations (Frontline Support)

  • Collaborate with SOC and engineering teams on security alerts.

  • Support operational workflows involving SIEM (Chronicle, Splunk, ELK) and SOAR platforms.

  • Identify the operational impact of security events and escalate appropriately.

  • Apply basic cloud security principles (IAM hygiene, least privilege, MFA enforcement).

  • Support the configuration and management of Microsoft Entra Connect (Azure AD Connect) for hybrid identity synchronization.

  • Validate or triage security-related alerts that affect platform reliability.

🔹Automation & Scripting

  • Automate operational tasks, repetitive workflows, and platform checks using:

    • Python (preferred)

    • YAML for CI/CD and configuration files

  • Build simple tools or scripts to reduce manual toil and improve efficiency.

🔹 Platform Tools & Systems Management

  • Provide support for platform and operational tools including:

    • GCP Patch Management

    • AWS SSM, AWS Workspaces

    • Terraform / Ansible (basic understanding)

    • Git-based workflows (branches, PRs, version control operations)

  • Support maintenance windows, patch cycles, and release operations.

职位要求

Skills & Experience

Required

  • 2 - 4 years in SRE, Cloud Operations, Production Support, or DevOps roles.

  • Strong hands-on experience with GCP (preferred); AWS exposure / working knowledge is a plus

  • Solid understanding of:

    • Cloud fundamentals (compute, networking, IAM, security)

    • Monitoring and observability concepts

    • Distributed system troubleshooting

    • Patch management in cloud environments

  • Practical experience with ITSM processes and JSM ticketing workflows.

  • Hands-on scripting with Python

  • Experience working with SIEM/SOAR tools (Chronicle, Splunk, ELK, etc.).

  • Comfortable working in a 24×7 global operations support environment.

Nice to Have

  • Knowledge of GKE / Kubernetes fundamentals.

  • Understanding of Terraform, Ansible, or other IaC frameworks.

  • Understand and perform basic interactions with RESTful APIs, such as the Microsoft Graph API, to retrieve or modify directory data under supervision.

  • Use tools like Postman or Swagger to test and document basic API calls.

  • Experience with SLO/SLI concepts and reliability engineering practices.

  • Exposure to platform engineering, cloud automation, or DevSecOps workflows.

  • Cloud certifications (GCP Associate Cloud Engineer, AWS SysOps, Azure Administrator).

Soft Skills

  • Excellent communication skills for cross-functional coordination.

  • Strong analytical and troubleshooting abilities.

  • Calm and effective under pressure during incidents.

  • Customer-focused mindset with proactive thinking.

  • Ability to collaborate with globally distributed teams.

Education / Additional Info

  • Bachelor’s degree in Computer Science, IT, or equivalent professional experience.

  • Role is based in Chennai, India..

更多了解埃森哲

我们的专长

我们秉承“科技融灵智,匠心承未来”的企业使命,致力于通过引领变革创造价值,为我们的客户、员工、股东、合作伙伴与整个社会创造美好未来。

认识我们的团队

从业务服务部门到各个行业领域, 从职场新人到卓越领袖,我们一直在运用科技创造非凡!

联系我们

加入我们的团队

搜索与你的技能和兴趣匹配的空缺职位。我们希望招聘充满激情、求知若渴、富有创意、专注于解决方案且喜欢团队合作的员工。

埃森哲职位博客

关注埃森哲职业博客,在职场中先人一步,从真正的业内人士处,获取职业建议、内部观点以及可以即学即用的行业真知。