Responsibilities:
Oversee and manage the network operations center (NOC), ensuring 24/7 reliability and performance.
Lead and mentor a team of NOC professionals, providing technical guidance and escalation support
Develop and implement quality assurance initiatives to enhance platform and service reliability, optimizing availability and performance.
Assume the role of Incident Manager as needed, assessing the impact on customers, coordinating with the appropriate teams, and ensuring incidents are resolved efficiently with minimal disruption.
Collaborate cross-functionally to Work in tandem with Level 1 (Customer Support) and both Development and Operations teams to provide support and share knowledge. Focus on quickly escalating and resolving critical issues while ensuring feedback is integrated into product development.
Campaign Manager
Plan system capacity and coordinate scaling schedules to ensure readiness during campaign peaks.
Ensure all monitoring dashboards, alerts, and end-to-end tracing tools are fully set up and validated.
Organize manpower planning to provide 24/7 on-call and on-site support throughout the campaign period.
Conduct campaign briefings, run rehearsals, and define internal/external communication channels.
Prepare and align all teams on contingency plans and customer communication procedures in case of incidents.
Manage welfare arrangements including hotel bookings, food, venue setup, and expense reimbursements for the campaign team.
Qualifications:
Bachelor’s degree in Information Technology, Computer Science, or related field (or equivalent work experience).
Proven experience (5+ years) in network operations or NOC environments, with hands-on experience in network monitoring, incident management, and troubleshooting.
Strong leadership and interpersonal skills, with the ability to motivate and mentor a team in a fast-paced environment.
Ability to use observability tools such as Datadog, Grafana, Elasticsearch, and Kibana, as well as cloud services like AWS
Good command in English both spoken and written
Preferred qualifications:
Experience with incident management and ticketing systems (e.g., ServiceNow, JIRA) to track and manage incidents.
Understanding of ITIL framework and incident management best practices.
Strong knowledge of networking principles, protocols, and technologies (TCP/IP, DNS, DHCP, VPN, routing, switching, etc.).- Familiarity with Linux/Unix and Windows operating systems, including command-line troubleshooting.
Knowledge of scripting or automation using languages like Python, Bash, or PowerShell is a plus.
ทักษะ
สายงาน
Full-time
บริษัท
16 ตำแหน่งงานที่เปิดรับ
Singapore
อุตสาหกรรม:
พร้อมที่จะสมัครแล้ว?
ส่งใบสมัครของคุณตอนนี้เพื่อก้าวต่อไปในอาชีพของคุณ
ตำแหน่งงานที่คล้ายกัน