The CompanyOur client is a local IT service and consulting company specialising in delivering customised enterprise solutions.
LocationCyberjaya, Selangor, Malaysia
Job SummaryThe Enterprise Cloud Infrastructure Specialist will lead the design, deployment, migration, integration and management of various cloud infrastructure solutions for enterprise clients. They will play a pivotal role in transforming client IT landscapes by migrating critical on-premises workloads (servers, databases, applications) to secure, scalable, and high-performance Virtual Private Cloud (VPC) environments.
Key Responsibilities1. Cloud Infrastructure Design & Architecture: - Design secure, scalable, and highly available Virtual Private Cloud (VPC) architectures tailored to enterprise client requirements.
- Develop detailed infrastructure blueprints, network topologies (VPCs, subnets, firewalls, VPNs, load balancers), and security models.
- Select optimal cloud services (IaaS, PaaS) based on workload characteristics (compute, storage, networking).
- Integrate cloud solutions with existing on-premises infrastructure (hybrid cloud designs).
2. Cloud Deployment, Integration & Automation: - Implement and manage OpenStack private cloud environments (KVM stack, Microsoft Hyper-V, Proxmox VE).
- Deploy, configure, and optimize hypervisors, and VMs within cloud or hybrid environments.
- Provisioning, configuration management, and operational tasks to ensure consistency and efficiency.
3. Enterprise Workload Migration: - Plan and execute migration strategies, re-host, re-platforming, and refactoring migrations of enterprise workloads (physical & virtual servers, applications) from on-premises environments to target VPCs.
- Utilize migration tools (e.g., Fivetran, Airbyte, Cloudsfer, etc) effectively.
- Minimize downtime and ensure data integrity during migration events.
- Perform thorough pre-migration assessments and post-migration validation.
4. Managed Infrastructure & Operations: - Implement monitoring, logging, and alerting solutions (e.g., Prometheus/Grafana, ELK Stack, CloudWatch, PRTG) for proactive infrastructure management.
- Establish operational procedures for patch management, security hardening, capacity planning, and cost optimization within managed cloud environments.
- Provide Tier 3/4 support escalation for complex cloud infrastructure issues.
5. Backup, Disaster Recovery & Business Continuity: - Design, implement, and manage robust cloud-based backup strategies using enterprise tools (e.g., Veeam, Commvault, Rubrik, cloud-native solutions like AWS Backup, Azure Backup).
- Architect, deploy, and test comprehensive Disaster Recovery (DR) and Business Continuity Planning (BCP) solutions for cloud and hybrid environments.
- Provide consultancy for Disaster Recovery Planning.
- Provide expert consultancy to clients on DR strategy, RPO/RTO definition, and solution selection.
- Lead DR drills and ensure recovery plans are effective and documented.
6. Security & Compliance: - Implement cloud security best practices (identity & access management, network security groups/firewalls, encryption at rest & in transit, vulnerability management).
- Ensure infrastructure designs and deployments adhere to relevant compliance standards (e.g., ISO 27001, SOC 2, PDPA, CSA, PCI DSS) as required by clients.
7. Collaboration & Documentation: - a. Collaborate closely with project managers, solution architects, application teams, and client stakeholders.
- Create and maintain comprehensive technical documentation (design docs, runbooks, as built configurations, operational procedures).
- Mentor junior engineers and share knowledge within the team.
8. Operations & Life-Cycle Management: - Establish operational procedures for monitoring, maintenance, and upgrades.
- Conduct performance tuning, fault isolation, and root cause analysis.
- Provide post-deployment support and optimization services.
9. Client Engagement & Delivery: - Collaborate with solutions team to gather requirements and present technical solutions.
- Lead technical workshops, presentations, and training sessions.
- Act as a trusted advisor for cloud services strategy and transformation initiatives.
10. Key Deliverables: - Cloud Design Documents (HLD/LLD)
- Implementation Plans and Configuration Templates
- Test and Validation Reports
- Operational Runbooks and SOPs
- Technical Presentations and Client Reports
11. Associated Deliverables: - Produce, update, review and approve Trouble-shooting Knowledge Base, Operations & Maintenance technical, Operation Readiness Checklist, UAT checklist, MOP documentations, etc. as compliance to Managed Services industry certifications (ISO 27001/2, etc.)
- Review and approve Technical Support Bulletins and other technical documentation in the Knowledge Base for initial investigation by Tier 1 & 2 Cloud Ops
- Review and approve of technical documentation part of Operations & Maintenance SOPs for training materials, manuals, troubleshooting guides, etc.
Technical Expertise - Proven experience building and migrating to Virtual Private Cloud (VPC/VNet) environments is essential.
- Virtualization: Expert-level knowledge and hands-on experience with NFV and VNF framework and architecture to build, implement and operate. Experience with other hypervisors (KVM, Hyper-V) is a plus.
- Migration Expertise: Proven track record of successfully planning and executing large-scale migrations of enterprise workloads (servers, applications) from on-premises to cloud environments using industry standard tools and methodologies.
- Networking: Strong understanding of core networking concepts (TCP/IP, DNS, DHCP, Routing, VPN) and cloud networking (VPCs, Subnets, Security Groups/NSGs/NACLs, Load Balancers, Gateways).
- Backup & Disaster Recovery: Hands-on experience designing, implementing, and managing enterprise-grade backup solutions and Disaster Recovery strategies in the cloud. Experience with major backup tools (Veeam, Commvault, etc.) and cloud-native services.
- Operating Systems: Strong Linux (RHEL, CentOS, Ubuntu, etc) and Windows Server administration skills.
- Security: Solid understanding of cloud security principles and best practices.
Qualifications - Cloud Platforms: Deep, hands-on expertise with at least two major public clouds (AWS, Azure, GCP) *AND/OR* significant experience with OpenStack private cloud deployment and management.
- Advantage with relevant Cloud Certifications: AWS Solutions Architect Google Cloud Professional, Apache CloudStack.
- Experience with configuration management tools (e.g., Ansible, Puppet, Chef, Terraform).
- Deep knowledge of specific industry compliance requirements.
- Experience with multi-cloud or hybrid cloud management platforms.
- Bachelor’s degree in computer science, Information Technology, Engineering, or a related field (or equivalent demonstrable experience).
- 5+ years of hands-on experience in designing, deploying, and managing enterprise IT infrastructure, with a minimum of 3+ years focused on public/private cloud platforms.