Responsibilities:
Design, implement and maintain Ansible playbooks for infrastructure provisioning, configuration management and application deployment.
Develop and manage CI/CD pipelines using tools like Jenkins, GitLab CI or GitHub Actions.
Collaborate with data science teams to operationalize machine learning models and workflows.
Build and maintain event-driven automation
Monitor system performance, troubleshoot issues and ensure high availability and scalability.
Implement infrastructure as code (IaC)
Ensure security, compliance and best practices across all DevOps processes.
Document processes, configurations and architectural decisions.
Requirements:
5+ years of experience in a DevOps or Site Reliability Engineering (SRE) role.
Proficiency in Ansible for automation and configuration management.
Experience with cloud platforms (AWS, Azure or GCP).
Familiarity with machine learning pipelines and tools
Strong understanding of event-driven systems and messaging platforms
Proficient in scripting languages such as Python, Bash, or Go.
Experience with containerisation and orchestration (Docker, Kubernetes).
Solid understanding of networking, security and system administration.
EA License # 14C****
#J-*****-Ljbffr