
Job Information
IBM Site Reliability Engineer in BANGALORE, India
Introduction
At IBM, we are driven to shift our technology to an as-a-service model and to help our clients transform themselves to take full advantage of the cloud. With industry leadership in AI, analytics, security, commerce, and quantum computing and with unmatched hardware and software design and industrial research capabilities, no other company is as well positioned to address the full opportunity of enterprise cloud computing. We are looking for a Site Reliability Engineer to join our IBM Cloud VPC Observability team. This team is dedicated to ensuring that IBM Cloud is at the forefront of reliable enterprise cloud technology. We are building platforms to deliver performance, reliability and predictability for our customers' most demanding workloads, at global scale and with leadership efficiency, resiliency and security. If you are someone who wants to know what it takes to build a scalable, secure and reliable service and want to grow your technical depth in all aspects of Site Reliability Engineer including security, monitoring, automation, development, infrastructure, self-healing, troubleshooting and are a go-getter with an ownership mindset, we may be the right team for you.
Your role and responsibilities
*
Implement and administrate infrastructure and solutions that support our team
*
Work in a Kubernetes based micro services environment to support our leading edge cloud services. This will include custom solutions, as well as open source DevOps tools (build and deploy automation, monitoring and data gathering for our software delivery pipeline)
*
Contribute to our continuous improvement and continuous delivery while increasing maturity of DevOps and our agile adoption practices
*
Support the compliance and security integrity of the environment through your work
*
Partner with other teams, functional managers and program managers to deliver mission-critical services
*
Support development of new and enhanced pipeline capabilities
*
Adopt and build on automation solutions governed by SRE principles including CI CD pipelines, configuration management, immutable infrastructure deployment, auto healing systems etc.
*
Provide technical escalation support
*
Conceptualize, Design, implement, manage and create a reliable, highly performant, scalable automation solutions that can build consistency across our infrastructure
*
Work with and adopt open source technologies as well as participate in new IBM innovations across IaaS
*
A self-driven attitude to propose, test and implement solutions and improvements for review and consideration with your peers
Required technical and professional expertise
Over 7 years of hands-on experience with programming languages such as Go, Python, Bash Scripting
Familiarity with using Jenkins / Tekton for CI and ArgoCD / Jenkins for CD
Knowledge of security tools, including static code and dynamic code analysis, vulnerability scanners, and intrusion detection/prevention systems.
Experience on Containers and Container Orchestration tools such as Docker and Kubernetes.
Delivering micro services reliable at scale with horizontal pod scaling etc.
Familiarity with cloud platforms and their orchestration using IAC tools like Terraform and Ansible and automated deployment using Helm
Container performance and security
Familiarity with automation using scripting languages like Python
7+ years working with designing, developing and deploying software with Cloud technologies like AWS, Azure, IBM Cloud or GCP.
Understanding of secure principles
Understanding of version control systems like Git and artifact management tools such as JFrog Artifactory.
5+ years experience with Monitoring technologies: Sydig, Grafana, ELK, Mimir, Zabbix etc.
Release Engineering (Git Branching, versioning, tagging)
Experience with Agile software development
Preferred technical and professional experience
*
Familiarity with Open Telemetry concepts, Tracing, Metrics, Events and other Observability principles
*
Familiarity with using Grafana stack like Mimir, Grafana Alloy agent etc.
*
CI CD implementation experience using Tekton and ArgoCD
*
Expertise with end to end infrastructure automation using Python, Terraform, Ansible
*
Familiar with adopting secure practices and processes
*
Familiar with Linux systems and troubleshooting on them
IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.