Our client represents the connected world, offering innovative and customer-centric information technology experiences, enabling Enterprises, Associates, and Society to Rise™.
They are a USD 6 billion company with 163,000+ professionals across 90 countries, helping 1279 global customers, including Fortune 500 companies. They focus on leveraging next-generation technologies, including 5G, Blockchain, Metaverse, Quantum Computing, Cybersecurity, Artificial Intelligence, and more, on enabling end-to-end digital transformation for global customers.
Our client is one of the fastest-growing brands and among the top 7 IT service providers globally. Our client has consistently emerged as a leader in sustainability and is recognized amongst the ‘2021 Global 100 Most sustainable corporations in the World by Corporate Knights.
We are currently searching for a Site Reliability Engineer (SRE):
Responsibilities:
- Develop and maintain systems and services for the Cash Management Platform, ensuring scalability and reliability.
- Automate deployment processes and implement preventive measures proactively.
- Design and implement dashboards for real-time insights into platform key metrics.
- Collaborate with software developers, DevOps, and infrastructure engineers to ensure robust software development and operations integration.
- Optimize on-call rotations and incident management processes, ensuring incidents are resolved within SLAs.
- Document alarms in Knowledge Base Articles and conduct post-incident reviews to assess platform status.
Requirements:
- Bachelor’s degree in computer science or equivalent experience in SRE, automation, or development roles.
- 7+ years of experience in Site Reliability Engineering or related positions, preferably in major cloud platforms.
- Expertise in automating multi-tenant systems, particularly in cloud environments.
- Strong knowledge of SRE philosophies, tools, and practices, including SLO management and incident resolution.
- Proficiency in Infrastructure-As-Code tools and practices.
- Hands-on experience with Docker, Kubernetes, and networking concepts.
- Experience with monitoring tools like Grafana, Prometheus, Dynatrace, and Splunk.
- Familiarity with integration tools such as PagerDuty, ServiceNow, and Datadog.
- Excellent communication skills, including the ability to explain technical concepts to non-technical stakeholders.
Desired:
- Advanced proficiency in system performance monitoring tools and automation techniques.
- Experience with additional cloud platforms and microservices architectures.
Languages
- Advanced Oral English.
- Native Spanish.
Note:
- On-site, CDMX, Monterrey and Guadalajara.
If you meet these qualifications and are pursuing new challenges, Start your application to join an award-winning employer.Explore all our job openings | Sequoia Career’s Page: *