Provide full scope of the day to day Production Support including resolution of the live incidents as well as post-resolution problem management activities;
Drive issue owners with right progress that fits the requirements for Tier 0 services in incident management;
Provide timely incident communication internally and externally as required;
Provide technical expertise in finding root cause of major system problems;
Work as the owner of production environment, participant in various changes deployment and related post implementation verifications, including new link testing;
Work closely with various stakeholders located in different regions of the world;
Review, design and re-engineer monitoring rules with new technologies to improve the efficiency of daily application support;
Automate daily manual routine work;
Contribute productively to the team's objectives, sometimes supporting and sometimes leading activities to achieve shared goals.
任职条件
Bachelor’s degree, with equivalent experience in software engineering discipline.
Fluent English communication skills.
Knowledgeable cross domain and able to operate a global level solving complex, mission critical problems.
Demonstrable experience of distributed compute, Linux, Unix, oracle, WebSphere, MQ, Java, and cloud Native Technologies like GKE, Ali-cloud, Kubernetes, Kafka, Service Mesh…
Working knowledge of infrastructure, networks, load balancers, storage – cloud & containers highly beneficial.
Experience with broad range of technologist tools e.g. Application Performance Monitoring, App Dynamics, Splunk; Automation, Puppet.
Strong understanding of core Service Management disciplines.
Excellent technical debugging and trouble shooting skills.