About Company:
TasklyAfrica is a remote staffing platform that connects skilled professionals in Africa with companies around the world. We place talent across customer service, software engineering, DevOps, healthcare, AI, and more.
Job Description:
We are seeking a high-velocity and security-minded Senior Infrastructure Engineer to anchor the global AWS ecosystem of TasklyAfrica. In a high-growth SaaS environment, success is defined by "Architectural Resilience", the ability to harmonize rapid feature delivery with the clinical precision required for SOC 2 compliance and 99.9% uptime. This role is designed for a "System Architect" with 5+ years of SRE/DevOps experience who can blend "Infrastructure-as-Code Mastery" (Terraform) with the "Operational Grit" required to manage high-traffic ECS clusters and Cloudflare security layers.
Requirements:
1. Infrastructure Sovereignty & IaC Governance
Cloud Architecture: Own and operate TasklyAfrica’s AWS + Cloudflare infrastructure, ensuring every node, VPC, and S3 bucket is optimized for cost, security, and millimetric performance.
IaC Evolution: Manage and evolve our Terraform configurations to ensure environment consistency and repeatable provisioning across production, staging, and dev.
Scale Management: Lead capacity planning and resource tuning (EC2, ECS, RDS/Aurora) to support a rapidly growing global user base without compromising system latency.
2. Pipeline Engineering & Deployment Velocity
CI/CD Orchestration: Build and maintain high-fidelity automated pipelines (CircleCI preferred) to support modern deployment patterns such as Blue/Green, Canary, and Feature Flags.
Container Governance: Oversee Docker containerization strategies, ensuring seamless orchestration within Amazon ECS for scalable, predictable service delivery.
Developer Productivity: Build internal tools and documentation that reduce manual "toil," allowing cross-functional engineering teams to move faster and more securely.
3. Observability, Security & Compliance
Telemetry & Insights: Maintain and enhance our observability stack—Prometheus, Grafana, and CloudWatch—delivering actionable dashboards and tracking SLO/SLA metrics with clinical accuracy.
Security Hardening: Drive SOC 2 and compliance initiatives, including IAM hardening, secrets management (Vault/AWS Secrets Manager), and encryption-at-rest.
Incident Leadership: Serve as the primary driver for operational readiness, leading structured Root Cause Analysis (RCA) and implementing long-term reliability improvements.
Qualifications and Skills:
Core Infrastructure Profile
Cloud: 2+ years of production AWS expertise (ECS, RDS, VPC, IAM, S3).
IaC/DevOps: Expert-level Terraform and CircleCI proficiency.
Code/Scripting: Strong abilities in Ruby, Python, Go, or Bash.
Observability: Hands-on experience with Grafana, Prometheus, and Linux systems tuning.
Professional Requirements
Experience: 5+ years in DevOps/SRE roles, preferably within Fintech or high-compliance sectors (SOC 2, ISO 27001).
Leadership: Proven ability to lead incident responses and collaborate across cross-functional engineering and data science teams.
Salary
Very attractiveApplication Closing Date: Not specified
Application Instructions:
Click the button below to apply
Job Information
Deadline
Not specified
Job Type
Full-time
Industry
ICT/TECH
Work Level
Experienced
State
Nigeria
Country
Nigeria