Create Email Alert

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.

Similar Jobs

  • Stitch.money

    Site reliability engineer

    ,

    • Ending Soon

    Role The Platform Engineer helps build infrastructure as code into the deployment processes in order to ensure Stitch can scale fast while strengthening reliability and security. Division: Product & Engineering Department Responsibilities Work closely with engineering to improve development and deployment processes Secure, monitor, and maintain

    Job Source: Stitch.money
  • Telecom North America

    Site reliability engineer

    ,

    Telna provides Mobile Networks, CSPs and OEMs with a managed global network infrastructure for cellular connectivity. Telna has the largest LTE and LTE-M footprint in the world. Its multi-network platform enables simplified billing and localization, utilizing 6+ telco pops globally. Telna’s Cronus connectivity platform allows instant access to its

    Job Source: Telecom North America
  • Mentormate

    Senior site reliability engineer - contractor

    ,

    • Ending Soon

    Mentor Mate creates durable technical solutions that deliver digital transformation at scale by blending strategic insights and thoughtful design with brilliant engineering. The company has completed over 1,500 projects and has global technological hubs in Europe and North and South America. With mature and established practices in enterprise web a

    Job Source: Mentormate
  • Gitlab

    Senior site reliability engineer, observability, emea

    ,

    Senior Site Reliability Engineer, Observability, EMEASite Reliability Engineers (SREs) are responsible for keeping all user-facing services and other Git Lab production systems running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to

    Job Source: Gitlab
  • Tumaini

    Reliability engineer

    , KwaZulu-Natal

    • Ending Soon

    Job Description A wonderful opportunity is available for a Reliability Engineer at a leading sugar manufacturing company based in Kwa Zulu-Natal. Requirements: Degree in Electrical or Mechanical Engineering Government Certificate of Competency (GCC) Factories is advantageous Must be experienced in asset care management, maintenance planning - reli

    Job Source: Tumaini
  • E-merge

    Site reliability engineer - pretoria/ hybrid - r800k per annum

    ,

    • Ending Soon

    Are you a seasoned Site Reliability Engineer with a passion for solving complex challenges and delivering cutting-edge software solutions? If you have expertise in cloud-native technologies and fluency in Java, Java Script, Golang, or Python, we want you on our team! Apply Now!!Requirements:3+ years of experience as a Dev Ops or site reliability en

    Job Source: E-merge
  • Goldman Tech Resourcing

    Site engineer

    Durban, KwaZulu-Natal

    • Ending Soon

    These jobs were popular with other job seekers Redheads Engineering Solutions (Pty) Ltd Redheads Engineering Solutions (Pty) Ltd We have an amazing opportunity for a Site Engineer based in KZN, Durban Key Requirements: BSc Degree or relevant qualification in Civil Engineering. Registration with ECSA/SACPCMP. Minimum 5 years of experien

    Job Source: Goldman Tech Resourcing
  • Easy Recruit SA

    site engineer

    Durban

    • Ending Soon

    bachelor in civil engineering 10 years experience construction supervision experience

    Job Source: Easy Recruit SA

Senior site reliability engineer

,

R 400000 ZA Per annum

We are Quadcode, a fintech company excelling in financial brokerage activities and delivering advanced financial products to our global clientele. Our flagship product, an internal trading platform, is offered as a Software-as-a-Service (Saa S) solution to other brokers.

We are currently looking for Senior SRE to join our Service Desk team. The main area of responsibility is the team's oversight of ITSM processes within the company. They are involved in the development of monitoring tools and also monitor the status of our product.

The team has 5 System Engineers, 2 Technical Support Specialists, and a Team Leader.

We have more than 600 servers and more than 2000 virtual machines. We have as own infrastructure, as private and public clouds (Openstack, AWS, GCP, DO), and bare metal. Our trading platform has more than 80 million users.

Working with Agile, Scrum (1–2-week sprints, grooming, planning, retrospective), and SAFe framework.

Daily scrum standups are held at 12:30 pm EET (Cyprus time zone), engaging in peer code reviews, and using collaboration tools like Slack, Google Meet, and Zoom.

You will be responsible for building ITSM processes and applying their experience as a Site Reliability Engineer. You will have interactions with the product teams and IT Operations branch (Software Development, Infrastructure etc.).

Tech stack

OS: Linux Ubuntu; Web server: Nginx; Monitoring: Grafana, Prometheus, Graylog, Jaeger; CI/CD: Jenkins, Git, Gitlab, Docker; Automation: Python, Bash; SCM: Ansible, Chef; Ia C: Terraform. Pulumi; DB: Postgre SQL, Redis, Keydb, My SQL; Cloud: Openstack, AWS, GCP, DO.

Examples of first tasks in the role:

Review processes, platform and infrastructure; Implementation of Grafana On Call; Review and rework ITSM processes if needed.

Responsibilities in the role:

Identification of bottlenecks and preparation of recommendations to improve the reliability of services; Responding to platform emergencies, localizing and resolving the causes of failures, compiling postmortem reports; Development of monitoring and alerting tools ensuring high availability and quick detection of potential issues: (Grafana, Grafana On Call, Prometheus Alert manager, etc.); Active participation in change management processes, including assessment and coordination of changes to the infrastructure within Change Advisory Board (CAB) sessions; Implementation and support of ITSM processes to optimize team workflow and enhance service quality. Development and maintenance of documentation in an up-to-date state.

Requirements:

3+ years of experience in SRE/Dev Ops; Understanding of SRE principles, practical experience in implementing SRE practices; Understanding of principles and practical experience in building resilient systems; Experience with monitoring and logging systems (Prometheus, Graylog, Grafana). Experience with automation tools for software build and deployment (CI/CD): Git Lab, Jenkins; Understanding of virtualization and containerization principles; Understanding of Infrastructure as Code (Ia C) approaches and experience; Proficiency in a programming language for automation script development (Python, Nodejs, Golang, etc.), ability to understand service code; Understanding of network protocols, topologies, and network models; Experience with configuration management tools: Ansible, Chef; Basic experience with relational databases, such as Postgre SQL; Experience in administering Linux operating systems; Fluency in English and Russian (B2 minimum).

As an advantage:

Experience in implementing monitoring and logging systems from scratch; Experience with k8s, Openstack; Advanced programming skills in any language.

We offer full-time remote work as a Service Provider in the following countries:

Bulgaria, Georgia, Belarus, Hungary, Romania, Latvia, Lithuania, Moldova, Azerbaijan, Armenia, Kyrgyzstan, Greece, Croatia, Montenegro, Serbia, or Estonia (a residence permit is a must, except for Georgia).

Currently, over 700 employees and service providers are stationed across its seven global offices located in the UK, Gibraltar, the UAE, the Bahamas, Australia, and the headquarters in Cyprus. By broadening its international presence, Quadcode not only offers a remote or hybrid work model but also presents a myriad of intriguing tasks and challenges for employees. Join us today, and let's shape the future of fintech together!

Note: All applications will be treated with strict confidence. We thank all applicants for their interest, however only those candidates selected for interviews will be contacted.

#LI-JM1 #LI-Remote

#J-18808-Ljbffr

Apply

Create Email Alert

Create Email Alert

Senior site reliability engineer jobs in ,

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.