Site Reliability Engineer (SRE) Level II Job at KTek Resourcing, Columbus, OH

b2pSdjN6LytidTdjMGlHMENBY1Nwa292enc9PQ==
  • KTek Resourcing
  • Columbus, OH

Job Description

Position : Site Reliability Engineer (SRE) Level II

Location : Columbus, Ohio( 3 days a week)

As a Site Reliability Engineer (SRE) Level II, you will play a key role in maintaining the availability, scalability, and performance of critical infrastructure and services. You will be responsible for building and automating solutions that enhance system reliability and support continuous delivery. In this role, you will handle more complex operational tasks and incidents, provide mentorship to junior SREs, and collaborate with development teams to ensure systems are designed for reliability from the ground up.

Incident Management :

  • Complex incidents and ensure service uptime.
  • Lead troubleshooting efforts for high-impact production issues, providing detailed root cause analysis (RCA) and preventative measures.
  • Participate in on-call rotations, acting as an escalation point for Level 1/2 SREs during major incidents.

Performance & Scalability:

  • Analyze system performance and recommend optimizations for scalability and reliability.
  • Support capacity planning efforts by monitoring system metrics, traffic patterns, and usage trends to predict future resource needs.

System Design & Architecture:

  • Collaborate with software engineering teams to influence the design of new services and applications, ensuring they are scalable, reliable, and resilient from the start.
  • Contribute to architectural decisions, ensuring alignment with best practices in fault tolerance, redundancy, and recovery.

Monitoring & Observability:

  • Build and maintain robust monitoring, alerting, and observability solutions to proactively detect and resolve issues before they impact end users.
  • Optimize existing monitoring tools (e.g., Prometheus, Grafana, Datadog, Dynatrace) and build custom dashboards for better visibility into system health.

Security & Compliance:

  • Ensure systems and infrastructure are secure, compliant, and aligned with organizational policies and industry best practices.
  • Assist with vulnerability management, system patching, and implementing security measures to protect the integrity and availability of services.

Continuous Improvement:

  • Lead efforts to continuously improve operational processes, tools, and workflows.
  • Implement and enforce best practices in deployment, monitoring, and incident management to improve overall system reliability and reduce downtime.

Basic Qualifications:

  • Bachelor’s degree in computer science, Information Technology, or a related field, or equivalent work experience.
  • 3 years of experience in site reliability engineering, DevOps, systems administration, or related roles.
  • Proven track record of managing complex infrastructure, troubleshooting production issues, and optimizing system performance

Preferred Qualifications:

  • Strong experience with Linux/Unix administration and proficiency in scripting (e.g., Python, Bash, Go).
  • 3+ years knowledge of databases – Oracle / MS-SQL Server / DB2
  • 5 years of experience in site reliability engineering, DevOps, systems administration, or related roles.
  • Experience with containerization and orchestration technologies like Docker and Kubernetes.
  • Proficiency with monitoring and observability tools such as Dynatrace, Prometheus, Grafana, Datadog, ELK Stack, or similar platforms.
  • Strong understanding of networking fundamentals (DNS, TCP/IP), load balancing, and CDNs.
  • Experience with CI/CD tools (Jenkins, GitLab CI, CircleCI) and infrastructure automation (Terraform, Ansible, Puppet).
  • Familiarity with distributed systems and microservices architecture.
  • Excellent problem-solving and troubleshooting skills, especially in diagnosing production issues in high-scale environments.
  • Microsoft Office experience
  • Experience working in multi-platform environment
  • Ability to balance both development and support roles
  • Experience in working on projects that involve business segments
  • Strong analytical, strong troubleshooting skills and excellent communication skills
  • Strong interpersonal skills, focus on customer service, and the ability to work well with other IT, vendor, and business groups

Job Tags

Work experience placement, 3 days per week,

Similar Jobs

Ardor Health Solutions

Travel MRI Technologist (Siemens scanners) - $2,705 per week Job at Ardor Health Solutions

Ardor Health Solutions is seeking a travel MRI Technologist for a travel job in Cleveland, Ohio. Job Description & Requirements ~ Specialty: MRI Technologist ~ Discipline: Allied Health Professional ~ Start Date: 07/07/2025~ Duration: 26 weeks ~40 hours ...

Hillcrest Health Services

Weekend Receptionist Job at Hillcrest Health Services

Part Time Concierge Saturday & Sunday 9:30am-6pm Hillcrest Health & Living is looking for an experienced Concierge that excels at being a team player while utilizing their great phone/computer skills at our Hillcrest Firethorn community! The Concierge is responsible...

HNTB

Environmental Monitor/Wetland Protection Specialist Job at HNTB

 ...permitting including, but not limited to: Section 404, Section 401, Wetlands Protection Act, EPA Construction General Permit, and Conservation and Management Permits Provide technical expertise, oversight, and quality control for projects in construction with respect... 

Supplemental Health Care

Travel LPN / LVN - Skilled Nursing Facility (SNF) Long Term Care - $1,379 per week - Urgently Hiring Job at Supplemental Health Care

Supplemental Health Care is seeking a LPN / LVN Skilled Nursing Facility (SNF) Long Term Care for a travel job in Terra Alta, West Virginia. Job Description & Requirements ~ Specialty: Long Term Care ~ Discipline: LPN / LVN ~ Start Date: ASAP ~ Duration: ...

Host Healthcare

Travel Cath Lab Technician - $2,434 per week Job at Host Healthcare

 ...Host Healthcare is seeking a travel Cath Lab Technologist for a travel job in Milwaukee, Wisconsin. Job Description & Requirements ~ Specialty: Cath Lab Technologist ~ Discipline: Allied Health Professional ~ Start Date: 06/10/2025~ Duration: 13 weeks ~4...