Senior Site Reliability Engineer (SRE) - Cloud & Distributed Systems Job at Dutech Systems, inc, Austin, TX

alc4NHo5eDMrM3M2eVdqYlkrcUcrMjFUMEE9PQ==
  • Dutech Systems, inc
  • Austin, TX

Job Description

Skills:

SRE, DevOps, AWS, GCP, Kubernetes, Docker, Python, Go, Linux, Distributed Systems, Monitoring, Logging, SLIs, SLOs, CI/CD, Observability

We are seeking an experienced Senior Site Reliability Engineer (SRE) to design, build, and operate highly scalable and reliable cloud-based systems. The ideal candidate will have a strong background in DevOps, distributed systems, and cloud infrastructure , with a focus on automation, observability, and system reliability .

This role involves working in a fast-paced environment to ensure system uptime, performance, and operational excellence.

Key Responsibilities:

  • Design, implement, and manage highly available, distributed systems
  • Maintain and optimize cloud infrastructure (AWS/GCP)
  • Develop automation scripts using Python, Go, Java, or Bash
  • Manage containerized environments using Docker and Kubernetes
  • Define and monitor SLIs, SLOs, and error budgets
  • Implement monitoring, logging, and alerting solutions
  • Lead incident management , root cause analysis (RCA), and postmortems
  • Ensure system security and compliance within operational workflows
  • Improve system reliability through performance tuning and optimization
  • Collaborate with engineering teams to enhance deployment and release processes
  • Create and maintain runbooks, dashboards, and operational documentation

Required Qualifications:

  • 8+ years of experience in SRE, DevOps, or Systems Engineering
  • Strong expertise in Linux/Unix systems and system internals
  • Proficiency in at least one programming/scripting language ( Python, Go, Java, Bash )
  • Experience designing and operating distributed systems
  • Hands-on experience with cloud platforms (AWS or GCP)
  • Experience with Docker and Kubernetes
  • Strong understanding of monitoring, alerting, and logging concepts
  • Experience managing SLIs, SLOs, and error budgets
  • Experience with incident management and RCA processes

Preferred Qualifications:

  • Experience with observability tools (Prometheus, Grafana, Datadog, Splunk, Application Insights)
  • Experience supporting 24x7 production environments and on-call rotations
  • Knowledge of chaos engineering and resiliency testing
  • Experience with canary deployments, feature flags, and progressive delivery
  • Strong documentation and communication skills

Job Tags

Contract work

Similar Jobs

FUJIFILM Biotechnologies

Maintenance Technician Job at FUJIFILM Biotechnologies

Position OverviewFUJIFILM Electronic Materials, USA, Inc., is a global leader in chemical solutions which enable the semiconductor industry and the digital universe. We have an exciting opportunity at our North Kingstown, RI facility for a Maintenance Technician II!The ...

DARCARS Automotive Group

Automotive Finance Manager Job at DARCARS Automotive Group

 ...DARCARS Kia of Frederick is seeking a high-performing Automotive Finance Manager to join our dynamic team. If youre an experienced F&I professional looking to maximize earnings and work with a forward-thinking dealership group, we want to hear from you. Potential... 

Versant Media

Audio Engineer, MS NOW Job at Versant Media

 ...creativity, embraces change, and drives connection in an ever-evolving world. Job Description MS NOW is looking for an audio engineer to join our podcast team. This role will work across the unit to support multiple projects and ensure high technical standards by... 

BiVACOR

Electro - Mechanical Assembly Job at BiVACOR

 ...ensuring each device meets exacting standards before it reaches patients. The Product Operator is responsible for assembling, testing, and packaging medical devices in compliance with documented procedures, quality standards, and regulatory requirements (FDA 21 CFR Part... 

Leonardo Worldwide Corporation

CYS_Embedded Software Engineer_GCAP Job at Leonardo Worldwide Corporation

 ...sviluppo embedded ( Keil, IAR, Eclipse, GCC, Green Hills ) Conoscenza di strumenti per simulazione e modellazione software (es. Matlab/Simulink ) Esperienza con ambienti di Continuous Integration e Continuous Deployment ( CI/CD ) Altro (es. Disponibilit a...