Senior Site Reliability Engineer
Hyderabad, TG, IN, 5000019
Overview
The Zurich-based Barry Callebaut Group is the world’s leading manufacturer of chocolate and cocoa products – from sourcing and processing cocoa beans to producing the finest chocolates, including chocolate fillings, decorations and compounds. The Group runs more than 60 production facilities worldwide and employs a diverse and dedicated global workforce of more than 13,000 people. The Barry Callebaut Group serves the entire food industry, from industrial food manufacturers to artisanal and professional users of chocolate, such as chocolatiers, pastry chefs, bakers, hotels, restaurants or caterers. The global brands catering to the specific needs of these Gourmet customers are Callebaut® and Cacao Barry®, Carma® and the decorations specialist Mona Lisa®. The Barry Callebaut Group is committed to make sustainable chocolate the norm to help ensure future supplies of cocoa and improve farmer livelihoods. It supports the Cocoa Horizons Foundation in its goal to shape a sustainable cocoa and chocolate future.
Barry Callebaut Digital (BCD) is on a mission to lead the digital revolution in the chocolate industry, and we're looking for a Senior Site Reliability Engineer to support this transformative journey. Reporting directly to the Head of Cloud Management Operations, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications. You will be responsible for monitoring availability, taking a holistic view of system health of our production environment, and maintaining configuration of cloud deployments. In addition, you will collaborate closely with development teams to design and implement robust infrastructure solutions, automate deployment processes, and maintain high availability across our cloud-based platform amidst frequent updates by the development teams.
MAIN RESPONSIBILITIES & SCOPE
- Ensure scalability, performance, and reliability of large-scale, cloud-based applications and infrastructure
- Establish monitoring and observability solutions and address performance bottlenecks, errors and other issues
- Develop and maintain automated deployment pipelines to facilitate seamless and efficient delivery of software updates while minimizing downtime
- Develop and implement strategies to enable zero downtime deployments
- Resolve incidents promptly to minimize service disruptions
- Create and enforce best practices and standards for the deployment and management of applications, databases, and other resources
- Work closely with cross-functional teams, including developers, DevOps engineers, and QA engineers, to drive continuous improvement and innovation
ESSENTIAL EXPERIENCE & KNOWLEDGE / TECHNICAL OR FUNCTIONAL COMPETENCIES
- Minimum of 10 + years of relevant experience
- Good knowledge of IT infrastructures, cloud operations, as well as the design, implementation, and management of highly available and scalable infrastructure
- Proficiency in Azure services, Terraform, observability tools, techniques for monitoring and troubleshooting distributed systems
- Experience with zero downtime deployment strategies and DevOps tools (e.g. Jenkins, CircleCI, Github)
- Independent and self-driven personality, taking responsibility and owning tasks
- Possesses good problem-solving skills and structured way of working
- Openness to try and learn new technologies and skills
- Good written and verbal communication skills, being able to communicate problems to non-technical audiences
At Barry Callebaut, we are committed to Diversity & Inclusion. United by our strong values, we thrive on the diversity of who we are, where we come from, what we’ve experienced and how we think. We are committed to nurturing an inclusive environment where people can truly be themselves, grow to their full potential and feel they belong. #oneBC - Diverse People, Sustainable Growth.
Job Segment:
Sustainable Agriculture, QA, Engineer, Agriculture, Quality, Engineering