Senior DevOps Engineer at Thelix Holdings

Posted on Wed 03rd Jun, 2026 - www.hotnigerianjobs.com --- (0 comments)

At Thelix Holdings, our mission is to drive transformative change across diverse industries by strategically investing in education, healthcare, financial services, and tourism. We are dedicated to creating long-term value for our stakeholders by ensuring innovation, promoting sustainable growth, and empowering communities globally through impactful programs and cutting-edge solutions.

We are recruiting to fill the position below:

Job Title: Senior DevOps Engineer

Location: Lagos
Employment Type: Full-time (Hybrid)

About the Role

  • We are looking for a highly skilled and execution-driven Senior DevOps Engineer to lead the reliability, scalability, security, automation, and operational excellence of our infrastructure and engineering environments.
  • This role is critical to ensuring platform stability, deployment efficiency, infrastructure scalability, monitoring, incident response, and secure system operations across multiple products and environments.
  • The ideal candidate is hands-on, highly technical, proactive, ownership oriented, and experienced in building production-grade infrastructure for fast-growing technology companies, preferably within Edtech, Fintech, SaaS, payments, lending, or high-availability systems.
  • You will work closely with Engineering, Product, Security, Compliance, and Infrastructure teams to design resilient systems, optimize deployment pipelines, strengthen platform security, and improve operational efficiency.

Key Responsibilities
Infrastructure & Cloud Management:

  • Design, deploy, manage, and optimize cloud infrastructure environments
  • Maintain scalable, secure, and highly available production systems
  • Manage infrastructure across AWS, GCP, Azure, or hybrid environments
  • Implement infrastructure-as-code (IaC) practices using tools like Terraform or CloudFormation
  • Optimize cloud resource usage, performance, and cost efficiency
  • Ensure redundancy, disaster recovery, and business continuity readiness

CI/CD & Deployment Automation:

  • Build and maintain robust CI/CD pipelines for rapid and reliable deployments
  • Automate build, testing, release, and deployment workflows
  • Improve deployment reliability, rollback strategies, and release monitoring
  • Reduce manual operational tasks through scripting and automation
  • Collaborate with engineering teams to improve release velocity and developer productivity

Monitoring, Reliability & Incident Management:

  • Establish proactive monitoring, alerting, and observability systems
  • Manage uptime, system performance, latency, and service reliability
  • Lead incident response, root cause analysis, and postmortem reviews
  • Implement logging and monitoring solutions using tools such as:
    • Grafana
    • Prometheus
    • Datadog
    • ELK Stack
    • New Relic
    • CloudWatch
  • Drive operational excellence and reduce production incidents

Security & Compliance:

  • Implement DevSecOps best practices across infrastructure and deployments.
  • Strengthen platform security, secrets management, and access controls.
  • Manage vulnerability scanning, patching, and infrastructure hardening.
  • Ensure infrastructure aligns with compliance and regulatory requirements.
  • Support security audits, penetration testing remediation, and compliance reviews.
  • Enforce least-privilege access and secure authentication practices.

Containers & Orchestration:

  • Manage containerized environments using Docker and Kubernetes.
  • Optimize orchestration, scaling, networking, and workload reliability.
  • Maintain production-grade Kubernetes clusters and deployment environments.
  • Implement autoscaling, failover, and self-healing infrastructure capabilities.

Collaboration & Technical Leadership:

  • Collaborate cross-functionally with engineering, product, QA, and security teams.
  • Mentor junior DevOps and infrastructure engineers.
  • Drive best practices for operational excellence and system reliability.
  • Participate in architecture reviews and infrastructure planning.
  • Improve internal engineering workflows and platform tooling.

Required Qualifications

  • 5+ years of experience in DevOps, Site Reliability Engineering, Cloud Infrastructure, or Platform Engineering
  • Strong hands-on experience with AWS, GCP, or Azure
  • Deep understanding of CI/CD pipelines and deployment automation
  • Experience with Infrastructure as Code (Terraform, CloudFormation, Ansible, etc.)
  • Strong Linux systems administration experience
  • Experience managing Docker and Kubernetes environments
  • Strong scripting skills using Bash, Python, or similar languages
  • Experience with monitoring and logging tools
  • Strong understanding of networking, DNS, SSL, load balancing, and system security
  • Experience handling production incidents and root cause analysis
  • Familiarity with high-availability and scalable distributed systems

Preferred Qualifications:

  • Experience working in edtech,fintech, payments, banking, lending, or regulated industries
  • Experience implementing DevSecOps practices
  • Familiarity with SOC 2, PCI-DSS, NDPR/GDPR, or ISO compliance requirements
  • Experience with microservices architecture
  • Experience with Kafka, Redis, RabbitMQ, or event-driven systems
  • Experience managing multi-environment deployment pipelines

Certifications such as:

  • AWS Certified DevOps Engineer
  • Certified Kubernetes Administrator (CKA)
  • Google Professional Cloud DevOps Engineer

Key Competencies
We are looking for someone who:

  • Takes strong ownership without constant supervision
  • Can operate effectively under pressure and during incidents
  • Thinks proactively and prevents problems before they occur
  • Communicates clearly with both technical and non-technical stakeholders
  • Balances speed, reliability, and security effectively
  • Has strong troubleshooting and systems thinking abilities
  • Can scale infrastructure alongside business growth

Success Metrics
Success in this role will be measured by:

  • Platform uptime and reliability
  • Deployment frequency and stability
  • Incident response effectiveness
  • Infrastructure scalability and cost optimization
  • Security posture improvements
  • Reduced operational bottlenecks
  • Improved engineering productivity

Nice-to-Have Experience:

  • AI infrastructure deployment
  • MLOps pipelines
  • FinOps/cloud cost optimization
  • Multi-region deployment architecture
  • Zero-downtime deployments
  • Blue-green or canary release strategies
  • API gateway and service mesh experience

Application Closing Date
Not Specified.

How to Apply
Interested and qualified candidates should:
Click here to apply online