VP, Production Support Lead (Investment Services), Technology Group
Singapore, SG
GIC is one of the world’s largest sovereign wealth funds. With over 2,000 employees across 11 locations around the world, we invest in more than 40 countries globally across asset classes and businesses. Working at GIC gives you exposure to an extraordinary network of the world’s industry leaders. As a leading global long-term investor, we Work at the Point of Impact for Singapore’s financial future, and the communities we invest in worldwide.
Technology Group
We experiment, design, and lead a 24×7 global business where we support core capabilities in asset management, trading, investment operations, and risk management. We deliver secure, reliable, and integrated solutions, and provide insights on new, and emerging technologies.
Business Partner & Solutions
You will help to explore new, and existing technology to support our strategic, and operational business needs across our Public Markets, Private Markets, Total Portfolio Risk, Corporate Services, and Enterprise Solutions.
What impact can you make in this role?
We are looking for a talented and driven Production Support Lead to join our dynamic team. The Production Support Lead will be responsible for ensuring the reliability, availability, and performance of all systems and services related to investment services in both public and private markets. Investment services is responsible for providing portfolio services for private market investments in real estate, private equity, infrastructure, cross-strategy investments, and services to support public markets investment. This role includes overseeing the operations team and collaborating with various teams to implement initiatives aimed at enhancing system stability and reliability.
What will you do as a VP, Production Support Lead (Investment Services)?
Monitoring & Observability:
• Define and lead the observability strategy to ensure comprehensive visibility into the health of all investment services applications in both public and private markets.
• Establish SOP to monitor system performance and availability, using metrics and logs to identify and resolve issues proactively.
• Oversee the architecture, engineering, and integration of events, logging, metrics, tracing, dashboards, alerting, and Service Level Objectives (SLOs).
• Implement best practices for observability to enhance incident detection and facilitate root cause analysis.
• Collaborate with DevOps teams to develop scalable, resilient, and secure systems.
• Promote the adoption of enterprise observability tool standards, including Grafana, Prometheus, OpenTelemetry, Splunk, and Datadog.
• Drive the automation of operational tasks to minimise toil and integrate reliability throughout the software delivery lifecycle.
• Establish and maintain performance Service Level Agreements (SLAs), Service Level Indicators (SLIs), and SLOs that align with product objectives.
• Communicate effectively with stakeholders, providing updates on system reliability and performance metrics.
Reliability:
• Design disaster recovery and business continuity planning
• Work with software architects to design high availability architecture on existing and new systems
• Develop and maintain incident response playbooks and documentation.
Transformation & Efficiency:
• Manage incidents and outages, conducting post-mortems to identify root causes and prevent future occurrences.
• Drive capacity planning and performance tuning efforts to ensure systems can handle expected loads.
• Spearhead the modernisation of legacy systems and platforms, including cloud migration and tool consolidation, in collaboration with functional leads.
• Implement data-driven decision-making and performance benchmarking using advanced metrics and scorecards.
• Lead initiatives for cost optimisation and performance tuning across platforms and environments.
• Lead initiatives to enhance monitoring and alerting frameworks, ensuring proactive detection of issues, and minimising downtime through automated remediation processes.
Team Leadership & Development:
• Oversee production support, implementing best practices and fostering a culture of innovation.
• Promote a culture of innovation, collaboration, continuous improvement, and engineering excellence.
What makes you a successful candidate?
• Bachelor’s degree in computer science, engineering, or a related field; or equivalent experience.
• 8+ years of experience in Site Reliability Engineering, DevOps, or a related field.
• Proven expertise in monitoring and observability tools and frameworks, along with hands-on knowledge of cloud platforms (AWS, GCP, Azure).
• Experience implementing and scaling SRE practices, including automation, incident response, and performance optimisation.
• Strong knowledge of cloud platforms (e.g., AWS, Azure, Google Cloud) and container orchestration (e.g., Kubernetes, Docker).
• Proficiency in programming and scripting languages (e.g., Python, Go, Bash).
• Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack).
• Proficiency with infrastructure-as-code platforms such as HashiCorp Terraform is a plus
• Strong understanding of modern software delivery practices, CI/CD pipelines, and agile methodologies.
• Excellent problem-solving skills and the ability to work under pressure.
• Strong communication and collaboration skills.
• Certifications in cloud or SRE practices are a plus.
Work at the Point of Impact
We need to be forward-looking to attract the right people to help us become the Leading Global Long-term Investor. Join our ambitious, agile, and diverse teams - be empowered to push boundaries and pursue innovative ideas, share your views, and be heard. Be anchored on our PRIME Values: Prudence, Respect, Integrity, Merit and Excellence, which guides us in how we make our day-to-day decisions. We strive to inspire. To make an impact.
Flexibility at GIC
At GIC, our offices are vibrant hubs for ideation, professional growth, and interpersonal connection. At the same time, we believe that flexibility allows us to do our best work and be our best selves. Thus, our teams come into the office four days per week to harness the benefits of in-person collaboration, but have the flexibility to choose which days they work from home and adjust this arrangement as situational needs arise.
We are an equal opportunity employer
GIC is an equal opportunity employer, and we value diversity. We do not discriminate based on race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment.
Learn more about our Technology Group here: