title-image
Turrior - Let work find you
Recruiters get AI-ranked shortlists and automated outreach, filling roles up to 5× faster.
0%
Popularity
0d
Avg. Time to Hire
0h
Recruiter Res. Time
0%
HR Satisfaction
Careers at PANTHERx Rare Pharmacy
All open opportunities, right here. Explore, apply, grow.
Apply now

Site Reliability Engineer

Full Time
full time
11 Oct 2025
Pittsburgh

About the job

• The Site Reliability Engineer (SRE) will lead the implementation and management of observability, monitoring, and reliability practices across our hybrid infrastructure.
• This role requires hands-on expertise with Datadog or similar observability platforms, strong Azure administration skills, and a deep understanding of incident response and system performance.
• The SRE will work closely with Infrastructure, Support, and Application teams to ensure high availability and operational excellence across on-prem and cloud environments.
• Designs, implements, and manages observability solutions using Datadog or equivalent platforms.
• Develops and maintains monitoring dashboards, alerts, and telemetry pipelines for critical systems.
• Leads incident response efforts, including root cause analysis and postmortem documentation.
• Collaborates with Infrastructure and Application teams to improve system reliability and performance.
• Supports Azure administration tasks including resource monitoring, performance tuning, and cost optimization.
• Defines and enforces best practices for system health, uptime, and scalability.
• Contributes to automation of operational tasks and reliability improvements.
• Documents observability standards, incident workflows, and operational runbooks.

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or equivalent.
  • Minimum of five (5) years of experience in Site Reliability Engineering, Infrastructure Monitoring, or DevOps.
  • Proficiency with Datadog or similar observability platforms (e.g., Prometheus, New Relic, Splunk).
  • Strong Azure administration experience including monitoring, resource management, and automation.
  • Solid understanding of on-prem infrastructure and hybrid cloud environments.
  • Experience with incident response, RCA, and operational documentation.
  • Strong scripting skills (e.g., PowerShell, Python) for automation and integration.
  • Excellent communication and collaboration skills across technical teams.

🔍 ATS Optimization Keywords
Below are skills and terms extracted directly from this job posting to improve Applicant Tracking System (ATS) visibility. This unique feature helps candidates tailor their applications more effectively — a feature exclusive to JobTailor job listings.

Hard Skills

  • Site Reliability Engineering
  • Infrastructure Monitoring
  • DevOps
  • Datadog
  • Azure administration
  • incident response
  • root cause analysis
  • scripting
  • PowerShell
  • Python

Soft Skills

  • communication
  • collaboration

Certifications & Qualifications

  • Bachelor’s degree in Computer Science
  • Bachelor’s degree in Information Technology

Similar Jobs

8 months ago
Long agoPermanent

End-to-end AI hiring for modern HR teams

Turrior uses artificial intelligence to create job listings, automate candidate screening, conduct video interviews, and apply comprehensive AI scoring — helping companies hire faster, more accurately, and with lower operational costs.

Key benefits:

  • AI-powered job creation and structured job data
  • Intelligent candidate screening and automated shortlisting
  • Video interviews with AI-based answer analysis
  • Comprehensive AI scoring of skills, experience, and role fit
  • Recruitment process automation and reduced time-to-hire

Share job