Job Description
Summary
The NOC administrator is responsible for ensuring maximum possible service availability and performance by monitoring the whole infrastructure, acting upon alerts or incidents via troubleshooting methods and resolving and/or escalating them. He is also responsible for internal communications on any activities performed on the infrastructure as well as unplanned downtime or degradation. The reliability and health of the monitoring platform lies on them as well.
Mission
The main responsibilities and routine tasks of the NOC Administrator are to:
- Monitor system events to ensure health, maximum system availability and service quality;
- Identify and apply corrective measures to prevent any type of service degradation or outage;
- Initiate the incident management process in the cases where an incident cannot be resolved;
- Perform thorough technical analysis of any incident on Microsoft, Linux, network infrastructure, and in-house systems then apply corrective measures;
- Work closely with the monitoring specialists in order to continually improve the accuracy and reliability of our monitoring systems;
- Respond, prioritize and resolve any incident tickets;
- Determine the exact impacts of an incident from the end-user stand-point;
- Perform root cause analysis on on-going or past incidents and suggest corrective or preventive measures;
- Communicate internally the impacts related to any incident or maintenance;
- Escalate or liaise with other teams/departments when necessary in order to resolve an incident;
- Identify service restoration automation opportunities and apply automated response via scripted methods;
- Maintain documentation regarding configuration, operation and troubleshooting procedures related to Microsoft, Linux and in-house platforms;
- Carry out all other related tasks.
Qualifications
Training
- A college degree in Network Administration, Information Systems, Computer Science or equivalent work experience in a related field.
Relevant experience
- 3+ years of experience in an infrastructure technical support role
- 3+ years of experience in IT operations, NOC experience is highly recommended
Skills
- Ability to stay focused in stressful situations;
- Strong ability to communicate clearly and simply in any kind of situation;
- Must be a self-starter that requires only limited supervision/guidance;
- Ability to take initiative and find creative solutions to problems;
- Ability to assess and prioritize faults and respond or escalate accordingly;
- Able to work efficiently during non-standard shifts (Evening, Night, Weekends);
- Must be driven by challenges;
- Must be interested in learning new technologies in a fast-paced environment;
Knowledge
Must have :
- Strong knowledge of Redhat or CentOS and comfort navigating via CLI;
- Fundamental understanding of recent Windows Server environments (2008, 2012, 2016);
- Experience managing firewall rules;
- Familiarity with Bash Scripting;
- General knowledge of monitoring solutions (such as Zabbix, Grafana, Splunk);
- Well rounded understanding of the OSI model;
Nice to have:
- Knowledge of Fortigate
- Experience scripting with PowerShell
- Experience configuring network components (routers/switches) at an enterprise level.
- Knowledge of EMC & ScaleIO, SAN, NAS Management.
- Advanced proficiency in Python, Ruby, NodeJS, Javascript.
- Knowledge of CDN and Akamai
职能类别: 系统工程师
联系方式
上班地址:上海徐汇区桂箐路7号G7园区3号楼3楼
Get email alerts for the latest"Command Center Administrator IT信息中心管理员 jobs in Shanghai"