Location
mexico city
Posted
June 08, 2026
Commute
Local Area
Local Opportunity Near You!
This job is in your area. Enjoy a short commute and work close to home.
Job Description
Overview
Will be responsible for Eyes on glass Monitoring, Triage & Incident Ownership, Troubleshooting & Restoration, Cross-Team Collaboration, Platform & Application Stack Awareness and Service Quality & Process Excellence. Responsibilities
Perform rapid intake, triage, and prioritization of alerts, tickets, and incidents. Act as Incident Owner during high-severity events, ensuring clear communication, timely updates, and swift restoration of service. Maintain accurate, real-time incident timelines and post-incident documentation. Execute root-cause isolation across application, middleware, APIs, data, and infrastructure layers. Use observability/monitoring tools (e.g., Kibana, Dynatrace, Cloud Watch, Grafana) to correlate logs, metrics, and traces; identify anomalies, performance bottlenecks, and failure patterns. Perform targeted mitigations, rollbacks, config fixes, and coordinate hotfixes to restore service quickly. Engage with App Dev, Dev Ops, Database, Network, Security,...
Will be responsible for Eyes on glass Monitoring, Triage & Incident Ownership, Troubleshooting & Restoration, Cross-Team Collaboration, Platform & Application Stack Awareness and Service Quality & Process Excellence. Responsibilities
Perform rapid intake, triage, and prioritization of alerts, tickets, and incidents. Act as Incident Owner during high-severity events, ensuring clear communication, timely updates, and swift restoration of service. Maintain accurate, real-time incident timelines and post-incident documentation. Execute root-cause isolation across application, middleware, APIs, data, and infrastructure layers. Use observability/monitoring tools (e.g., Kibana, Dynatrace, Cloud Watch, Grafana) to correlate logs, metrics, and traces; identify anomalies, performance bottlenecks, and failure patterns. Perform targeted mitigations, rollbacks, config fixes, and coordinate hotfixes to restore service quickly. Engage with App Dev, Dev Ops, Database, Network, Security,...