We offer
The Cloud Engineering unit is looking for Software Engineers in SRE team to apply software development principles and practices to effectively run our Infrastructure and efficiently solve difficult operations problems, and in doing so continuously meet and/or improve our Customer Experience. As an SRE, you will be evangelizing, building tools and implementing the priorities around reliability that includes infrastructure, product, data and devops. The team will play a significant role in building capabilities through productising observability, automation, integrations, performance, devops and lot more related to high availability and reliability of services, and in doing so partnering with the multiple teams to help establish and drive reliability engineering culture. You will be applying SRE principles to maintain and improve the performance of our services both in on-premise and cloud environments, there by balancing the priorities between Perform (Production on-premise) and Transform (towards Cloud).
Key responsibilities
Playing a pivotal role in connecting Technology, Products, Process and End Users, your Key Responsibilities will include –
• Engage in and improve the whole lifecycle of services from inception and design, deployment, operation, and refinement
• Develop and implement capabilities around observability, Status Pages, chaos engineering, toil reduction, deployment methods and lot more from ground up
• Evangelize, prioritize and implement Operational requirements of high availability, fault tolerant and other resiliency patterns required for delightful customer experience through use of SRE principles
• Manage and optimize infrastructure and monitoring ecosystem, this includes leveraging various technologies as defined by Enterprise Architecture and bringing your development skills to deliver performance of the operating environments
• Lead triaging of complex issues at enterprise level and manage stakeholders’ expectations during incidents and/or escalations, while troubleshooting and proactively driving problem management through blames postmortems, eventually owning some of the RCA actions. Participate in on-call rotations as needed
• Own and support both production run and transformation initiatives to push services towards best-in-class deployment and delivery methodologies, leveraging technologies across on-premise and cloud
• Collaborate with technology and other stakeholders to set SLO and maintain Service level Indicators (SLI’s) that are representative of our customer experience and/or committed SLA
• Exploit data, innovate through data engineering and derive insights to improve operations, engineering enhancements, feature development
We are looking for
Is a passionate Engineer bringing in strong experience from Enterprise grade environment, someone who exemplifies a self-starter, thinking from first principles, continuous learner and could analytically make independent decisions to navigate ambiguities. Exemplifies ‘Has been there and done that’ there by leading by example. An ideal candidate will bring in –
• 4-6 years in SRE role, with overall 10+ years of experience in heterogeneous environments (DC and Cloud, preferably Azure)
• Deep expertise in one or more areas of the full stack development and run, evangelized open-source technologies, applied design of experiments or proof of concepts
• Experienced working in various software development and production run architectures in Monolithic, SOA, Microservices, Distributed systems. Great to have experience in transformation journeys
• Hands on experience in developing, implementing, integrating various alerting and monitoring tools to create enterprise observability
• Hands on experience writing high quality code in one or two of the software languages in modern day software engineering environments
• Good breadth of operational experience, with an eye for software design, performance, scalability and automation. Involved in debugging major problems across production stack
• Excellent communication skills and ability to explain complex technical issues to both technical and non-technical audiences
Click here to Apply Online