Handle incidents by investigating, analyzing and coordinating to resolve incidents within pre-defined SLAs
Leverage understanding and expertise in coding languages to investigate and resolve issues/ incidents
Collaborate with other engineers, customers and key stakeholders to determine root cause of incidents, while providing technical support and assisting with system integration when required
Execute disaster recovery plans and procedures to prevent loss of data and recover applications functionality
Be on standby for systems which requires 24/7 availability and resolving of critical incidents which require immediate response, provide information support to relevant parties upon service requests
Test applications after maintenance to ensure that application is fully functioning
Manage all production issues and engage with external vendors to provide L3 support when necessary
Manage ticketing query system, maintain and update database of incident queries, technical documents & procedures to resolve incidents
Prepare maintenance plan and upgrading schedules for the organization's applications and systems
Qualification and Experience Required :
5+ years experience in the role of a Production Reliability Engineer or software engineer
Strong understanding of multiple software applications design tools and languages
Able to perform development, testing and debugging work
Proven technical aptitude in one or more application programming domains, or “T-shaped” skills
Ability to work in an agile development environment
Skills
Root Cause
Debugging
Troubleshooting
Reliability Test
Functions
Engineering
Job Overview
Job Type:
Full-Time
Company
Arise by INFINITAS
34 active jobs
Industry:
Technology
Ready to Apply?
Submit your application now and take the next step in your career journey.