At Thelix Holdings, our mission is to drive transformative change across diverse industries by strategically investing in education, healthcare, financial services, and tourism. We are dedicated to creating long-term value for our stakeholders by ensuring innovation, promoting sustainable growth, and empowering communities globally through impactful programs and cutting-edge solutions.
We are recruiting to fill the position below:
Job Title: Senior DevOps Engineer
Location: Lagos
Employment Type: Full-time (Hybrid)
About the Role
- We are looking for a highly skilled and execution-driven Senior DevOps Engineer to lead the reliability, scalability, security, automation, and operational excellence of our infrastructure and engineering environments.
- This role is critical to ensuring platform stability, deployment efficiency, infrastructure scalability, monitoring, incident response, and secure system operations across multiple products and environments.
- The ideal candidate is hands-on, highly technical, proactive, ownership oriented, and experienced in building production-grade infrastructure for fast-growing technology companies, preferably within Edtech, Fintech, SaaS, payments, lending, or high-availability systems.
- You will work closely with Engineering, Product, Security, Compliance, and Infrastructure teams to design resilient systems, optimize deployment pipelines, strengthen platform security, and improve operational efficiency.
Key Responsibilities
Infrastructure & Cloud Management:
- Design, deploy, manage, and optimize cloud infrastructure environments
- Maintain scalable, secure, and highly available production systems
- Manage infrastructure across AWS, GCP, Azure, or hybrid environments
- Implement infrastructure-as-code (IaC) practices using tools like Terraform or CloudFormation
- Optimize cloud resource usage, performance, and cost efficiency
- Ensure redundancy, disaster recovery, and business continuity readiness
CI/CD & Deployment Automation:
- Build and maintain robust CI/CD pipelines for rapid and reliable deployments
- Automate build, testing, release, and deployment workflows
- Improve deployment reliability, rollback strategies, and release monitoring
- Reduce manual operational tasks through scripting and automation
- Collaborate with engineering teams to improve release velocity and developer productivity
Monitoring, Reliability & Incident Management:
- Establish proactive monitoring, alerting, and observability systems
- Manage uptime, system performance, latency, and service reliability
- Lead incident response, root cause analysis, and postmortem reviews
- Implement logging and monitoring solutions using tools such as:
- Grafana
- Prometheus
- Datadog
- ELK Stack
- New Relic
- CloudWatch
- Drive operational excellence and reduce production incidents
Security & Compliance:
- Implement DevSecOps best practices across infrastructure and deployments.
- Strengthen platform security, secrets management, and access controls.
- Manage vulnerability scanning, patching, and infrastructure hardening.
- Ensure infrastructure aligns with compliance and regulatory requirements.
- Support security audits, penetration testing remediation, and compliance reviews.
- Enforce least-privilege access and secure authentication practices.
Containers & Orchestration:
- Manage containerized environments using Docker and Kubernetes.
- Optimize orchestration, scaling, networking, and workload reliability.
- Maintain production-grade Kubernetes clusters and deployment environments.
- Implement autoscaling, failover, and self-healing infrastructure capabilities.
Collaboration & Technical Leadership:
- Collaborate cross-functionally with engineering, product, QA, and security teams.
- Mentor junior DevOps and infrastructure engineers.
- Drive best practices for operational excellence and system reliability.
- Participate in architecture reviews and infrastructure planning.
- Improve internal engineering workflows and platform tooling.
Required Qualifications
- 5+ years of experience in DevOps, Site Reliability Engineering, Cloud Infrastructure, or Platform Engineering
- Strong hands-on experience with AWS, GCP, or Azure
- Deep understanding of CI/CD pipelines and deployment automation
- Experience with Infrastructure as Code (Terraform, CloudFormation, Ansible, etc.)
- Strong Linux systems administration experience
- Experience managing Docker and Kubernetes environments
- Strong scripting skills using Bash, Python, or similar languages
- Experience with monitoring and logging tools
- Strong understanding of networking, DNS, SSL, load balancing, and system security
- Experience handling production incidents and root cause analysis
- Familiarity with high-availability and scalable distributed systems
Preferred Qualifications:
- Experience working in edtech,fintech, payments, banking, lending, or regulated industries
- Experience implementing DevSecOps practices
- Familiarity with SOC 2, PCI-DSS, NDPR/GDPR, or ISO compliance requirements
- Experience with microservices architecture
- Experience with Kafka, Redis, RabbitMQ, or event-driven systems
- Experience managing multi-environment deployment pipelines
Certifications such as:
- AWS Certified DevOps Engineer
- Certified Kubernetes Administrator (CKA)
- Google Professional Cloud DevOps Engineer
Key Competencies
We are looking for someone who:
- Takes strong ownership without constant supervision
- Can operate effectively under pressure and during incidents
- Thinks proactively and prevents problems before they occur
- Communicates clearly with both technical and non-technical stakeholders
- Balances speed, reliability, and security effectively
- Has strong troubleshooting and systems thinking abilities
- Can scale infrastructure alongside business growth
Success Metrics
Success in this role will be measured by:
- Platform uptime and reliability
- Deployment frequency and stability
- Incident response effectiveness
- Infrastructure scalability and cost optimization
- Security posture improvements
- Reduced operational bottlenecks
- Improved engineering productivity
Nice-to-Have Experience:
- AI infrastructure deployment
- MLOps pipelines
- FinOps/cloud cost optimization
- Multi-region deployment architecture
- Zero-downtime deployments
- Blue-green or canary release strategies
- API gateway and service mesh experience
Application Closing Date
Not Specified.
https://www.hotnigerianjobs.com/hotjobs/904048/senior-devops-engineer-at-thelix-holdings.html