Principal Engineer, Infrastructure & Cloud Operations

Mediacorp Pte. Ltd.

COMPANY DESCRIPTION

Mediacorp is Singapore's largest content creator and national media network, operating a suite of TV channels, radio stations, and multiple digital platforms. Its mission is to engage, entertain, and enrich audiences by harnessing the power of creativity.

We are committed to creating an inclusive and diverse workplace where talent thrives. Our hiring decisions are made based on merit and fit-to-role. If you have a disability or special need which requires accommodation to participate in the recruitment process, please inform us when you submit your online application. We will be happy to support as necessary.

Thank you for your interest and application to this role. Please note that only short-listed candidates will be contacted.

DESIGNATION : Principal Engineer, Infrastructure & Cloud Operations

RESPONSIBILITIES

We are looking for a hands-on Principal Engineer, Infrastructure & Cloud Operations to manage and improve Mediacorp's cloud and hybrid infrastructure platforms. This role will focus on infrastructure reliability, security, automation, and operational excellence, with strong emphasis on Terraform-based Infrastructure-as-Code to enable consistent, secure, and scalable deployments. The successful candidate will work closely with application, security, engineering, and vendor teams to support platform delivery, incident response, and continuous service improvement.

Cloud & Hybrid Infrastructure Operations

  • Operate and support Mediacorp's cloud and hybrid infrastructure across AWS, Azure and/or GCP, as well as supporting on-premises platforms.
  • Provide L2/L3 operational support for Linux and Windows environments, including troubleshooting, incident response, root cause analysis, and service restoration.
  • Manage infrastructure lifecycle activities including patching, upgrades, vulnerability remediation, backup and recovery, and operational health checks.
  • Ensure infrastructure platforms are reliable, secure, scalable, cost-conscious, and aligned with business and operational requirements.

Infrastructure-as-Code & Automation

  • Design, build, and maintain Infrastructure-as-Code capabilities using Terraform.
  • Develop reusable Terraform modules, standard infrastructure patterns, and environment promotion workflows.
  • Integrate infrastructure provisioning with CI/CD pipelines to improve consistency, auditability, and deployment reliability.
  • Drive automation initiatives to reduce manual effort, operational risk, and repetitive support activities.

Reliability, Observability & Incident Management

  • Improve infrastructure reliability through monitoring, alert optimisation, runbook development, and post-incident reviews.
  • Strengthen observability across infrastructure and platform services, including logging, metrics, alerting, and operational dashboards.
  • Support major incident management, root cause analysis, and follow-up actions to prevent recurrence.
  • Contribute to disaster recovery, high availability, and resilience planning where required.

Security, Governance & Compliance

  • Embed security and compliance requirements into infrastructure operations and deployment workflows.
  • Implement secure-by-default practices including least-privilege access, system hardening, logging, monitoring, backup controls, and audit readiness.
  • Support vulnerability remediation, configuration compliance, and operational risk assessments.
  • Work with security and governance teams to ensure infrastructure standards are practical, enforceable, and aligned with delivery needs.

Stakeholder Management & Technical Leadership

  • Act as a subject matter expert for cloud and hybrid infrastructure operations.
  • Partner with application teams, security teams, project stakeholders, vendors, and service providers to deliver stable and secure infrastructure services.
  • Provide technical guidance and mentoring to engineers on infrastructure operations, Terraform, automation, and secure operational practices.
  • Contribute to continuous improvement initiatives that raise operational maturity and service quality.

On-call Support

  • Participate in a 24x7 on-call rotation to support critical platforms and services.
  • Provide timely escalation support, incident response, and service recovery during production issues.

QUALIFICATIONS

  • At least 7 years of relevant experience in infrastructure, cloud operations, platform engineering, or systems engineering.
  • Strong hands-on experience managing cloud and hybrid infrastructure across AWS, Azure and/or GCP.
  • Strong Terraform and Infrastructure-as-Code experience, including reusable modules, state management, and CI/CD integration.
  • Solid Linux and Windows administration skills, including production troubleshooting, patching, upgrades, and vulnerability remediation.
  • Good knowledge of core infrastructure services including compute, storage, networking, IAM, load balancing, monitoring, backup, and disaster recovery.
  • Proficiency in scripting and automation using Python, PowerShell, Bash, or equivalent.
  • Strong understanding of secure infrastructure operations, including hardening, least-privilege access, compliance controls, and audit readiness.
  • Good communication, ownership, problem-solving, stakeholder management, and mentoring skills, with willingness to support a 24x7 on-call rotation.

Good to Have

  • Relevant cloud certifications in AWS, Azure, GCP, or equivalent.
  • Experience with DevSecOps, policy-as-code, observability, ITSM, CI/CD, or CDN technologies.
  • Interest or experience in AI-enabled operations, automation, or intelligent workflow improvement.

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.