Roles & Responsibilities
Design, develop, and implement core components of the deployment and monitoring platform, including deployment pipelines, configuration management, and metrics collection.
Build a user-friendly interface for managing deployments, viewing metrics, and generating reports.
Develop advanced analytics and visualization capabilities to provide actionable insights into application performance.
Integrate the platform with existing CI/CD pipelines and infrastructure monitoring tools.
Ensure high availability, scalability, and security of the platform.
Collaborate with development and operations teams to optimize deployment processes and improve application performance.
Build robust APIs and SDKs for interacting with the platform.
Develop and maintain efficient algorithms for cluster management, load balancing, auto-scaling, and bin packing.
Collaborate with cross-functional teams to integrate the platform with other systems and services.
Conduct thorough testing, performance optimization, and security audits to ensure platform reliability, scalability, and resilience.
Stay up-to-date with industry trends and emerging technologies in the cloud computing and orchestration space.
Define strategies to provide world class support to end-users’ queries
Logical thinking and problem-solving skills along with an ability to collaborate
Act as the point of contact for escalations and handle them as per the escalation procedures documented
Take responsibility to coach, train and help team members to meet the skills required to do the job
Ready to learn and adapt to new technologies, tools / applications, processes, and escalation procedures
Participation in ad-hoc and recurring meetings
Should be able to manage a team and handle cross functional communications
Shifts:
24*5 (Rotational Monthly)
Criteria
Graduate and above in any field
10+ years of experience in application development framework and DevOps
Good interpersonal, communication (English - verbal and written) and presentation skills
Able to analyze the data, make data-driven decisions and should have an eye for detail
Strong communication and collaboration abilities.
Technical Skillsets
Strong proficiency in at least one programming language such as Go, Python, or Java.
Experience in building and deploying cloud-native applications.
Deep understanding of software development lifecycle and deployment methodologies.
Proficiency in data structures, algorithms, and system design.
Experience with time-series databases and data visualization tools.
Strong understanding of metrics, KPIs, and performance monitoring concepts.
Experience with containerization technologies (Docker, Jenkins) is a must.
Experience with Kubernetes is an added advantage.