Site Reliability Engineer

  • Hyderabad
  • Insight Global
Required Skills & Experience Bachelor's degree in Computer Science, Engineering, or a related field. 3+ years of experience in Systems Engineering or Site Reliability Engineering. Strong proficiency in GoLang programming. Experience with Red Hat OpenShift and container technologies (Docker, Kubernetes). Understanding of cloud platforms (AWS, Azure, GCP). Monitoring using Prometheus, Grafana Experience with Linux system administration. Knowledge of scripting languages (Bash, Python). Excellent problem-solving and troubleshooting skills. Strong communication and interpersonal skills. • Ability to work independently and as part of a team.

Nice to Have Skills & Experience Certifications in Red Hat OpenShift or related technologies. Experience with DevOps practices and tools (Git, Jenkins, CI/CD pipelines). Knowledge of security best practices and compliance frameworks (ISO 27001, PCI DSS). Contributions to open-source GoLang projects.

Job Description As a Red Hat OpenShift SRE Engineer with GoLang, you will play a crucial role in ensuring the reliability, performance, and security of our critical Red Hat OpenShift platform. You will leverage your strong GoLang skills to develop and maintain tools and automation solutions that streamline infrastructure management and incident response. Responsibilities: Infrastructure Automation: Develop and maintain GoLang-based tools and scripts to automate infrastructure provisioning, configuration, and management tasks. Integrate automation solutions with Red Hat OpenShift and other cloud platforms. Optimize automation workflows for efficiency and scalability. Testing: Thorough code coverage covering wide scenarios and edge cases Perform End to end testing Writing detailed test cases Test the integration between different components Incident Response: Contribute to incident response efforts, utilizing GoLang to develop tools for diagnosing and resolving issues. Automate repetitive tasks and streamline incident workflows. Analyze incident data to identify root causes and implement preventive measures. Monitoring and Alerting: Identifying a metric to gain visibility into components Develop and maintain GoLang-based monitoring and alerting systems to proactively identify and address infrastructure issues. Integrate monitoring tools with Red Hat OpenShift and other systems. Create custom alerts and dashboards to visualize key performance indicators. Performance Optimization: Use GoLang to analyze and optimize infrastructure performance, identifying bottlenecks and areas for improvement. Develop tools to measure and benchmark application and infrastructure performance. Security: Contribute to security initiatives by developing GoLang-based tools for vulnerability scanning, intrusion detection, and compliance enforcement. Implement security best practices and automate security tasks.