SRE Observability Engineer
Tata Consultancy ServicesFull Description
Inclusion without Exception:
Tata Consultancy Services (TCS) is an equal opportunity employer, and embraces diversity in race, nationality, ethnicity, gender, age, physical ability, neurodiversity, and sexual orientation, to create a workforce that reflects the societies we operate in. Our continued commitment to Culture and Diversity is reflected in our people stories across our workforce and implemented through equitable workplace policies and processes.
About TCS:
TCS is an IT services, consulting, and business solutions organization that has been partnering with many of the world’s largest businesses in their transformation journeys for over 55 years. Its consulting-led, cognitive-powered portfolio of business, technology, and engineering services and solutions is delivered through its unique Location Independent Agile delivery model, recognized as a benchmark of excellence in software development. A part of the Tata group, India's largest multinational business group, TCS operates in 55 countries and employs over 607,000 highly skilled individuals, including more than 10,000 in Canada. The company generated consolidated revenues of US $ 30 billion in the fiscal year ended March 31, 2025, and is listed on the BSE and the NSE in India. TCS' proactive stance on climate change and award-winning work with communities across the world have earned it a place in leading sustainability indices such as the MSCI Global Sustainability Index and the FTSE4Good Emerging Index.
Required Skill Set:
• We are looking for a Mid-Level Observability Engineer to help implement, operate, and improve observability capabilities across our applications and platforms.
• This role focuses on hands-on onboarding, instrumentation, dashboarding, and alerting, working under established standards and guidance from senior engineers.
• You will collaborate with application, SRE, and operations teams to ensure systems are observable, supportable, and production ready.
• Observability Implementation Implement and maintain metrics, logs, and traces for applications and infrastructure
• Assist with onboarding applications into observability platforms (e.g., Dynatrace, ELK, Datadog)
• Configure dashboards, alerts, and basic anomaly detection Application Support Instrumentation
• Work with development teams to enable structured logging, basic distributed tracing, and core metrics
• Validate observability requirements during Production Readiness Reviews (PRR)Troubleshoot missing or low-quality telemetry
• Monitoring Alerting Configure alerts based on golden signals (latency, errors, traffic, saturation)
• Help reduce alert noise by tuning thresholds and alert logic
• Support incident response by gathering logs, metrics, and traces Operations Reliability Support root cause analysis using observability tools
• Maintain dashboards and documentation used by on-call and support teams
• Participate in on-call rotations (as applicable) Automation Continuous Improvement Assist in automating observability onboarding and validation tasks
• Create and maintain reusable dashboards and alert templates
• Follow established observability standards and best practices
• Required Qualifications Good years of experience in Observability, or SRE
• Working knowledge of metrics, logs, and basic tracing concepts
• Hands-on experience with at least one observability platform (Dynatrace, Elastic ELK, Datadog, New Relic, etc.)
• Basic understanding of SLIs SLOs and service health indicators
• Experience with cloud platforms or hybrid environments
• Ability to write scripts (Python, Bash, PowerShell) for automation and troubleshooting
• Preferred Qualifications Experience with Open Telemetry or APM agents
• Familiarity with Kubernetes or containerized workloads
• Experience working with incident management tools (PagerDuty, ServiceNow)
• Exposure to Dynatrace Kibana ELK or similar cloud-native monitoring
• Experience in regulated or enterprise environments
Salary Range - CA$ 90,000 - CA$ 120,000 Per Year
Note:
* TCS does not use artificial intelligence tools for candidate screening or evaluation.
* This posting is for a current vacancy
* The hiring process includes an initial screening by the TCS Hiring Team, followed by a technical evaluation and managerial discussion conducted by the Business Team, and concluding with the final HR evaluation.
Tata Consultancy Services Canada Inc. is committed to meeting the accessibility needs of all individuals in accordance with the Accessibility for Ontarians with Disabilities Act (AODA) and the Ontario Human Rights Code (OHRC). Should you require accommodation during the recruitment and selection process, please inform Human Resources.
Thank you for your interest in TCS. Candidates that meet the qualifications for this position will be contacted within a 2-week period. We invite you to continue to apply for other opportunities that match your profile.