Skip to main content Skip to footer

SRE / Observability Engineer

Kuala Lumpur Job No. 12695264 Full-time - On-Site

Job Description

Job Description:

  • Configures monitoring/alerting/Dashboards/reporting for related application performance monitoring using Dynatrace APM Monitoring tool

  • Analyzing application performance issues by performing root cause analysis within Dynatrace

  • Makes technical recommendations on monitoring improvements (by creates Technical PoCs to demonstrate performance improvements), and partner with Engineering team on new standards for logs, business events, and tags

  • Building out of new Dynatrace monitoring solutions, including the testing and implementation of new features and metrics

  • Frontend application performance monitoring of RUM applications as well as creating and monitoring Synthetic transactions

  • Performing assessment analysis to identify scope of problems and escalate recurring issues to management

  • Train application teams on using Dynatrace for root cause analysis to resolve issues as well as provide guidance to address their needs

  • Maintaining proficiency in application and product expertise.

  • Keeping abreast of the new Dynatrace features and how that impacts licensing and monitoring opportunities

  • Demonstrating ability to communicate effectively with all levels, including customers, technical personnel and management.

  • Thoroughly document processes and standard operating procedures

Qualifications

Job Qualifications:

  • Bachelor's degree in computer science or a related discipline, or equivalent work experience is required.

  • 2-3 years of experience in APM tools such as Dynatrace

  • Expertise in Python/Shell scripting/automation

  • Expertise in application instrumentation and monitoring

  • Working knowledge of infrastructure components. (E.g. routers, load balancers , cloud products , container systems , compute, storage and networks)

  • Collaborate with development and support teams to resolve performance related issues

  • Able to function rationally and methodically in a high productivity environment.

  • Proven understanding of web technologies and distributed application architecture is required

  • Proven understanding of full life cycle software development methodologies is required

Preferred Skills:

  • Must have Experience with APM tools such as Dynatrace

  • Must have Excellent debugging and trouble shooting skills

  • Strong communication skills

  • Good to have Knowledge on Log analysis tools like Splunk / ELK / etc.

  • Should have Knowledge on Windows Server 2008-2019 OS, Linux, Solaris and AIX

  • Worked with Service Reliability Engineering team to design SLI and SLO for respective applications

  • Design and build Service Level Indicator (SLIs) metrics, including but not limited to Service Level Objectives (SLOs), Error Budget, Burn Rate Alerts

Life at Accenture

Training and Development

Take time away to learn and learn all the time in our regional learning hubs, connected classrooms, online courses and learning boards.

Work Environment

Be your best every day in a work environment that helps drive innovation in everything you do.

Learn more about Accenture

Our Expertise

See how we embrace the power of change to create value and shared success for our clients, people, shareholders, partners and communities.

Meet Our People

From entry-level to leadership, across all business and industry segments, get to know our people harnessing technology to make a difference, every day.

Stay connected

Join Our Team

Search open positions that match your skills and interest. We look for passionate, curious, creative and solution-driven team players.

Keep Up to Date

Stay ahead with careers tips, insider perspectives, and industry-leading insights you can put to use today–all from the people who work here.

Job Alert Emails

Personalize your subscription to receive job alerts, latest news and insider tips tailored to your preferences. See what exciting and rewarding opportunities await.