Role
You will be a pivotal leader in Suitmedia's infrastructure and operations team, responsible for designing, building, and maintaining highly reliable, robust, and secure infrastructures with a strong emphasis on integrating security practices throughout the entire development and operations lifecycle. This role demands deep technical expertise in cloud platforms, automation, and cybersecurity.
Responsibilities
- Design and implement cost-optimized, secured, scalable, and highly-available cloud architectures (utilizing AWS, GCP, Azure) with high performance for all digital products.
- Lead the provisioning, configuration, and maintenance of infrastructure across all environments (production, staging, development, etc.) in a consistent, automated, and auditable manner (Infrastructure-as-Code).
- Establish and optimize comprehensive CI/CD pipelines, working closely with development teams to fully automate their build, test, and release procedures securely and efficiently.
- Implement advanced security best practices across infrastructure and applications, conduct standard penetration testing (e.g., using OWASP ZAP), and proactively identify and remediate security vulnerabilities.
- Design and execute performance testing strategies including load testing and stress testing (e.g., using Locust), and drive performance improvement initiatives at all levels of infrastructure.
- Design and implement robust automatic backup and disaster recovery plans, ensuring business continuity and data integrity.
- Design and implement comprehensive logging and monitoring solutions for all infrastructures, applications, and services with relevant metrics and alerting (e.g., Grafana, Prometheus, NewRelic, DataDog).
- Lead incident response and troubleshooting efforts in production environments, collaborating effectively with development and other teams to swiftly resolve issues.
- Establish and champion DevSecOps engineering best practices including systems, application security, processes, and comprehensive documentation for long-term sustainability.
Requirements
- Educational Background: Bachelor's degree in Computer Science, Software Engineering, Information Technology, or a related field.
- Professional Experience: Minimum of 4-6 years of progressive experience in DevOps or SRE (Site Reliability Engineering) roles, with at least 2+ years specifically in a DevSecOps or security-focused infrastructure role.
- Technical/Hard Skills: Excellent working knowledge of Linux server operating systems. Strong proficiency with CI/CD tools (e.g., Jenkins, AWS CodePipeline, GitLab CI, GitHub Actions). In-depth experience with FaaS (Function-as-a-Service) and CaaS (Container-as-a-Service) infrastructure, including Docker, Docker Swarm, and Kubernetes. Expert-level familiarity with the products and services of at least one major cloud platform (AWS, GCP, Azure). Strong understanding of infrastructure-as-code (e.g., Terraform, CloudFormation) and configuration management (e.g., Ansible, Puppet). Proficient with server monitoring and application performance management (APM) tools (e.g., Grafana, Prometheus, PagerDuty, NewRelic, DataDog). Familiarity with scalable database technologies (RDBMS and NoSQL).
- Soft Skills: Excellent attention to detail, strong curiosity, and proactive problem-solving abilities. Excellent teamwork and communication skills (written & verbal), capable of collaborating across development and operations teams. Strong understanding of web security best practices and secure coding principles.
- Bonus Points: Having cloud architect/engineer certifications (e.g., AWS Certified DevOps Engineer - Professional, Azure DevOps Engineer Expert). Having project experience in ISO 27001 information security standard compliance. Experience with microservices technology and distributed systems.
Thank you your interest in Suitmedia. Please note that while we are accepting applications, this posting is part of our ongoing effort to build a talent pool for future opportunities. We will keep your application on file and contact you should a suitable position become available.