Create Email Alert

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.

Similar Jobs

Stitch.money

Site reliability engineer

,
- Ending Soon
Role The Platform Engineer helps build infrastructure as code into the deployment processes in order to ensure Stitch can scale fast while strengthening reliability and security. Division: Product & Engineering Department Responsibilities Work closely with engineering to improve development and deployment processes Secure, monitor, and maintain
Job Source: Stitch.money
Telecom North America

Site reliability engineer

,
Telna provides Mobile Networks, CSPs and OEMs with a managed global network infrastructure for cellular connectivity. Telna has the largest LTE and LTE-M footprint in the world. Its multi-network platform enables simplified billing and localization, utilizing 6+ telco pops globally. Telna’s Cronus connectivity platform allows instant access to its
Job Source: Telecom North America
Mentormate

Senior site reliability engineer - contractor

,
- Ending Soon
Mentor Mate creates durable technical solutions that deliver digital transformation at scale by blending strategic insights and thoughtful design with brilliant engineering. The company has completed over 1,500 projects and has global technological hubs in Europe and North and South America. With mature and established practices in enterprise web a
Job Source: Mentormate
Gitlab

Senior site reliability engineer, observability, emea

,
Senior Site Reliability Engineer, Observability, EMEASite Reliability Engineers (SREs) are responsible for keeping all user-facing services and other Git Lab production systems running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to
Job Source: Gitlab
Tumaini

Reliability engineer

, KwaZulu-Natal
- Ending Soon
Job Description A wonderful opportunity is available for a Reliability Engineer at a leading sugar manufacturing company based in Kwa Zulu-Natal. Requirements: Degree in Electrical or Mechanical Engineering Government Certificate of Competency (GCC) Factories is advantageous Must be experienced in asset care management, maintenance planning - reli
Job Source: Tumaini
E-merge

Site reliability engineer - pretoria/ hybrid - r800k per annum

,
- Ending Soon
Are you a seasoned Site Reliability Engineer with a passion for solving complex challenges and delivering cutting-edge software solutions? If you have expertise in cloud-native technologies and fluency in Java, Java Script, Golang, or Python, we want you on our team! Apply Now!!Requirements:3+ years of experience as a Dev Ops or site reliability en
Job Source: E-merge
Goldman Tech Resourcing

Site engineer

Durban, KwaZulu-Natal
- Ending Soon
These jobs were popular with other job seekers Redheads Engineering Solutions (Pty) Ltd Redheads Engineering Solutions (Pty) Ltd We have an amazing opportunity for a Site Engineer based in KZN, Durban Key Requirements: BSc Degree or relevant qualification in Civil Engineering. Registration with ECSA/SACPCMP. Minimum 5 years of experien
Job Source: Goldman Tech Resourcing
Easy Recruit SA

site engineer

Durban
- Ending Soon
bachelor in civil engineering 10 years experience construction supervision experience
Job Source: Easy Recruit SA

Senior site reliability engineer

R 400000 ZA Per annum

We are Quadcode, a fintech company excelling in financial brokerage activities and delivering advanced financial products to our global clientele. Our flagship product, an internal trading platform, is offered as a Software-as-a-Service (Saa S) solution to other brokers.

We are currently looking for Senior SRE to join our Service Desk team. The main area of responsibility is the team's oversight of ITSM processes within the company. They are involved in the development of monitoring tools and also monitor the status of our product.

The team has 5 System Engineers, 2 Technical Support Specialists, and a Team Leader.

We have more than 600 servers and more than 2000 virtual machines. We have as own infrastructure, as private and public clouds (Openstack, AWS, GCP, DO), and bare metal. Our trading platform has more than 80 million users.

Working with Agile, Scrum (1–2-week sprints, grooming, planning, retrospective), and SAFe framework.

Daily scrum standups are held at 12:30 pm EET (Cyprus time zone), engaging in peer code reviews, and using collaboration tools like Slack, Google Meet, and Zoom.

You will be responsible for building ITSM processes and applying their experience as a Site Reliability Engineer. You will have interactions with the product teams and IT Operations branch (Software Development, Infrastructure etc.).

Tech stack

OS: Linux Ubuntu; Web server: Nginx; Monitoring: Grafana, Prometheus, Graylog, Jaeger; CI/CD: Jenkins, Git, Gitlab, Docker; Automation: Python, Bash; SCM: Ansible, Chef; Ia C: Terraform. Pulumi; DB: Postgre SQL, Redis, Keydb, My SQL; Cloud: Openstack, AWS, GCP, DO.

Examples of first tasks in the role:

Review processes, platform and infrastructure; Implementation of Grafana On Call; Review and rework ITSM processes if needed.

Responsibilities in the role:

Identification of bottlenecks and preparation of recommendations to improve the reliability of services; Responding to platform emergencies, localizing and resolving the causes of failures, compiling postmortem reports; Development of monitoring and alerting tools ensuring high availability and quick detection of potential issues: (Grafana, Grafana On Call, Prometheus Alert manager, etc.); Active participation in change management processes, including assessment and coordination of changes to the infrastructure within Change Advisory Board (CAB) sessions; Implementation and support of ITSM processes to optimize team workflow and enhance service quality. Development and maintenance of documentation in an up-to-date state.

Requirements:

3+ years of experience in SRE/Dev Ops; Understanding of SRE principles, practical experience in implementing SRE practices; Understanding of principles and practical experience in building resilient systems; Experience with monitoring and logging systems (Prometheus, Graylog, Grafana). Experience with automation tools for software build and deployment (CI/CD): Git Lab, Jenkins; Understanding of virtualization and containerization principles; Understanding of Infrastructure as Code (Ia C) approaches and experience; Proficiency in a programming language for automation script development (Python, Nodejs, Golang, etc.), ability to understand service code; Understanding of network protocols, topologies, and network models; Experience with configuration management tools: Ansible, Chef; Basic experience with relational databases, such as Postgre SQL; Experience in administering Linux operating systems; Fluency in English and Russian (B2 minimum).

As an advantage:

Experience in implementing monitoring and logging systems from scratch; Experience with k8s, Openstack; Advanced programming skills in any language.

We offer full-time remote work as a Service Provider in the following countries:

Bulgaria, Georgia, Belarus, Hungary, Romania, Latvia, Lithuania, Moldova, Azerbaijan, Armenia, Kyrgyzstan, Greece, Croatia, Montenegro, Serbia, or Estonia (a residence permit is a must, except for Georgia).

Currently, over 700 employees and service providers are stationed across its seven global offices located in the UK, Gibraltar, the UAE, the Bahamas, Australia, and the headquarters in Cyprus. By broadening its international presence, Quadcode not only offers a remote or hybrid work model but also presents a myriad of intriguing tasks and challenges for employees. Join us today, and let's shape the future of fintech together!

Note: All applications will be treated with strict confidence. We thank all applicants for their interest, however only those candidates selected for interviews will be contacted.

#LI-JM1 #LI-Remote

#J-18808-Ljbffr

Name	Expiration	Description
ATTBCookie*	2 years	These cookies are used to remember a user’s choice about cookies on thebigjobsite.com. Where users have previously indicated a preference, that user’s preference will be stored in these cookies.
last-search search redirect-stage original-keyword	1 day Session 1 hour 1 hour	These cookies are used by thebigjobsite.com to pass search data between our own pages.
datadome	1 year	DataDome is a cybersecurity solution to detect bot activity
jjap	1 days	Used to track if you have seen the Job Alerts prompt. Job Alerts is a service you can subscribe to to receive information about new jobs.

What job

... and where?

Similar Jobs

Site reliability engineer

Site reliability engineer

Senior site reliability engineer - contractor

Senior site reliability engineer, observability, emea

Reliability engineer

Site reliability engineer - pretoria/ hybrid - r800k per annum

Site engineer

site engineer

Senior site reliability engineer

Share this job

Create Email Alert