Role
You will be a key contributor to our infrastructure and operations team, responsible for building, maintaining, and enhancing reliable, robust, and secure infrastructures that support Suitmedia's digital products and services. This role emphasizes hands-on implementation of automation, CI/CD, and system monitoring.
Responsibilities
- Provision, configure, and maintain infrastructures for all environments (production, development, etc.) using automation and Infrastructure-as-Code principles.
- Set up and manage CI/CD pipelines, collaborating closely with development teams to automate their build, test, and release procedures.
- Implement security best practices at the infrastructure level, assist in basic penetration tests, and immediately identify security findings.
- Perform load testing and stress testing of applications and infrastructure to ensure performance and scalability.
Implement automatic backup and disaster recovery mechanisms for critical systems. - Configure and maintain logging and monitoring solutions for all infrastructures, ensuring relevant metrics are collected and alerts are functional.
- Perform preventive infrastructure maintenance and contribute to performance improvement initiatives across the infrastructure stack.
- Troubleshoot and resolve incidents in production and non-production environments, collaborating with respective teams to diagnose and fix issues.
- Contribute to the documentation of DevOps engineering best practices, systems, applications, and processes.
Qualifications
- Educational Background: Bachelor's degree in Computer Science, Software Engineering, Information Technology, or a related field.
- Professional Experience: Minimum of 2-4 years of experience in DevOps, Cloud Engineering, or System Administration roles.
- Technical/Hard Skills: Excellent working knowledge of Linux/Unix server operating systems (and Windows familiarity). Familiarity with a range of CI/CD tools (e.g., GitLab CI, Google Cloud Build, AWS CodeBuild, GitHub Actions, Jenkins). Experience with products of top-tier cloud platforms (AWS, GCP, Azure, Aliyun). Proficient with virtualization and containerization technologies (Docker, Kubernetes). Familiarity with infrastructure-as-code concepts and configuration management tools (e.g., Ansible, Puppet, Terraform). Experience with server monitoring and application performance management (APM) tools (e.g., Grafana, Prometheus, NewRelic, DataDog). Familiarity with scalable database technologies (RDBMS and NoSQL).
- Soft Skills: Excellent attention to detail, strong curiosity, and a proactive approach to problem-solving. Excellent teamwork and communication skills (written & verbal).
- Bonus Points: Having cloud engineer certifications (e.g., AWS Certified Solutions Architect - Associate). Experience with microservices technology. Basic understanding of web security principles.