Esta oportunidade de emprego já não se encontra disponível. Continua a tua pesquisa de oportunidades aqui.
Application Support Engineer
Bengaluru
Job No. atci-5538724-s2020692
Full-time
Descrição
Project Role : Application Support Engineer
Project Role Description : Act as software detectives, provide a dynamic service identifying and solving issues within multiple components of critical business systems.
Must have skills : Linux Containers Administration
Good to have skills : NA
Minimum 3 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
As an Application Support Engineer, a typical day involves acting as a software detective by actively monitoring and investigating issues across various components of essential business systems. This role requires a proactive approach to identifying problems, analyzing system behavior, and ensuring the smooth operation of critical applications. The engineer collaborates with different teams to maintain system stability and performance, while continuously seeking ways to enhance service reliability and user experience through thorough problem-solving and system analysis.
Roles & Responsibilities:
Provide L2 support for Red Hat OpenShift clusters (OCP 3.x / 4.x) across on-prem and cloud environments.
Perform advanced administration, troubleshooting, and day-to-day management of OpenShift clusters to ensure availability, scalability, performance, and security.
Act as a technical bridge between L1 support and L3 platform engineering teams handle escalations and drive issue resolution.
Monitor cluster health, diagnose incidents, and restore service in production environments.
Support multi-environment setups (Dev/QA/Prod) and manage platform stability across environments.
Perform cluster upgrades (minor/major), including pre/post health checks and validations.
Support cluster expansion activities, including adding master/worker nodes as required.
Manage and troubleshoot nodes handle conditions such as NotReady, DiskPressure, and MemoryPressure manage scheduling, taints, and labels.
Troubleshoot workload and deployment issues (e.g., rollout failures, ImagePullBackOff, CrashLoopBackOff, OOMKilled) and support scaling/performance optimization.
Manage and troubleshoot networking (SDN/OVN-Kubernetes), Ingress/Routes, service connectivity, DNS, and traffic routing.
Support high availability and scalability, including HPA/VPA and resource utilization optimization.
Operate and troubleshoot monitoring/logging solutions (Prometheus, Grafana, Alertmanager EFK/ELK) and address performance bottlenecks.
Manage storage (PV/PVC lifecycle, storage classes, dynamic provisioning) and troubleshoot binding/performance issues.
Administer security controls including RBAC (RoleBindings/ClusterRoleBindings), least-privilege access, and authorization troubleshooting.
Support authentication/identity integrations (OAuth, AD, LDAP) and troubleshoot login/authentication issues.
Support CI/CD and GitOps deployments (ArgoCD, OpenShift pipelines), GitHub integrations, and sync/deployment issues.
Support OpenShift workloads on AWS (ROSA/self-managed) and handle cloud infrastructure and networking dependencies.
Manage certificates and TLS (Ingress/API/internal TLS), including renewals/expiry and TLS troubleshooting.
Perform root cause analysis (RCA) for recurring issues and contribute to runbooks/knowledge base.
Professional & Technical Skills:
OpenShift & Kubernetes: Strong expertise in OpenShift (OCP 3.x / 4.x) and Kubernetes architecture/ecosystem.
Linux: Strong Linux administration skills.
Containers: Container runtime concepts (CRI-O / Docker).
Platform lifecycle: Cluster upgrades and lifecycle management.
Networking: SDN/OVN-Kubernetes, Ingress, Routes, DNS troubleshooting.
Security & IAM: RBAC, OAuth, LDAP, Active Directory least privilege implementation.
CI/CD & GitOps: ArgoCD, GitHub integration, OpenShift pipelines troubleshooting deployment/sync issues.
Cloud: AWS (preferred), including basics of EC2, IAM, and VPC.
Monitoring & Logging: Prometheus, Grafana, Alertmanager ELK/EFK performance analysis.
Storage: PV/PVC, storage classes, dynamic provisioning.
ITSM/Operations: ServiceNow or similar ITSM tools.
Soft skills: Strong analytical/troubleshooting mindset, good communication and stakeholder management, ownership, ability to work under pressure and in a 24x7 production support environment.
Experience: 2 to 5+ years in OpenShift/Kubernetes/Cloud environments production support or platform engineering experience preferred.
Certifications (preferred): Red Hat Certified Specialist in OpenShift Administration (EX280), OpenShift Administration II (EX288), AWS Certified Solutions Architect Associate (SAA-C03).
Additional Information:
- The candidate should have minimum 3 years of experience in Linux Containers Administration.
- This position is based at our Bengaluru office.
- A 15 years full time education is required.
Project Role Description : Act as software detectives, provide a dynamic service identifying and solving issues within multiple components of critical business systems.
Must have skills : Linux Containers Administration
Good to have skills : NA
Minimum 3 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
As an Application Support Engineer, a typical day involves acting as a software detective by actively monitoring and investigating issues across various components of essential business systems. This role requires a proactive approach to identifying problems, analyzing system behavior, and ensuring the smooth operation of critical applications. The engineer collaborates with different teams to maintain system stability and performance, while continuously seeking ways to enhance service reliability and user experience through thorough problem-solving and system analysis.
Roles & Responsibilities:
Provide L2 support for Red Hat OpenShift clusters (OCP 3.x / 4.x) across on-prem and cloud environments.
Perform advanced administration, troubleshooting, and day-to-day management of OpenShift clusters to ensure availability, scalability, performance, and security.
Act as a technical bridge between L1 support and L3 platform engineering teams handle escalations and drive issue resolution.
Monitor cluster health, diagnose incidents, and restore service in production environments.
Support multi-environment setups (Dev/QA/Prod) and manage platform stability across environments.
Perform cluster upgrades (minor/major), including pre/post health checks and validations.
Support cluster expansion activities, including adding master/worker nodes as required.
Manage and troubleshoot nodes handle conditions such as NotReady, DiskPressure, and MemoryPressure manage scheduling, taints, and labels.
Troubleshoot workload and deployment issues (e.g., rollout failures, ImagePullBackOff, CrashLoopBackOff, OOMKilled) and support scaling/performance optimization.
Manage and troubleshoot networking (SDN/OVN-Kubernetes), Ingress/Routes, service connectivity, DNS, and traffic routing.
Support high availability and scalability, including HPA/VPA and resource utilization optimization.
Operate and troubleshoot monitoring/logging solutions (Prometheus, Grafana, Alertmanager EFK/ELK) and address performance bottlenecks.
Manage storage (PV/PVC lifecycle, storage classes, dynamic provisioning) and troubleshoot binding/performance issues.
Administer security controls including RBAC (RoleBindings/ClusterRoleBindings), least-privilege access, and authorization troubleshooting.
Support authentication/identity integrations (OAuth, AD, LDAP) and troubleshoot login/authentication issues.
Support CI/CD and GitOps deployments (ArgoCD, OpenShift pipelines), GitHub integrations, and sync/deployment issues.
Support OpenShift workloads on AWS (ROSA/self-managed) and handle cloud infrastructure and networking dependencies.
Manage certificates and TLS (Ingress/API/internal TLS), including renewals/expiry and TLS troubleshooting.
Perform root cause analysis (RCA) for recurring issues and contribute to runbooks/knowledge base.
Professional & Technical Skills:
OpenShift & Kubernetes: Strong expertise in OpenShift (OCP 3.x / 4.x) and Kubernetes architecture/ecosystem.
Linux: Strong Linux administration skills.
Containers: Container runtime concepts (CRI-O / Docker).
Platform lifecycle: Cluster upgrades and lifecycle management.
Networking: SDN/OVN-Kubernetes, Ingress, Routes, DNS troubleshooting.
Security & IAM: RBAC, OAuth, LDAP, Active Directory least privilege implementation.
CI/CD & GitOps: ArgoCD, GitHub integration, OpenShift pipelines troubleshooting deployment/sync issues.
Cloud: AWS (preferred), including basics of EC2, IAM, and VPC.
Monitoring & Logging: Prometheus, Grafana, Alertmanager ELK/EFK performance analysis.
Storage: PV/PVC, storage classes, dynamic provisioning.
ITSM/Operations: ServiceNow or similar ITSM tools.
Soft skills: Strong analytical/troubleshooting mindset, good communication and stakeholder management, ownership, ability to work under pressure and in a 24x7 production support environment.
Experience: 2 to 5+ years in OpenShift/Kubernetes/Cloud environments production support or platform engineering experience preferred.
Certifications (preferred): Red Hat Certified Specialist in OpenShift Administration (EX280), OpenShift Administration II (EX288), AWS Certified Solutions Architect Associate (SAA-C03).
Additional Information:
- The candidate should have minimum 3 years of experience in Linux Containers Administration.
- This position is based at our Bengaluru office.
- A 15 years full time education is required.
Requisitos
15 years full time education