Director, Network Reliability Engineering - 11506
Coupang · India
Coupa makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. Coupa AI is informed by trillions of dollars of direct and indirect spend data across a global network of 10M+ buyers and suppliers. We empower you with the ability to predict, prescribe, and automate smarter, more profitable business decisions to improve operating margins.
Why join Coupa?
🔹 Pioneering Technology: At Coupa, we're at the forefront of innovation, leveraging the latest technology to empower our customers with greater efficiency and visibility in their spend. 🔹 Collaborative Culture: We value collaboration and teamwork, and our culture is driven by transparency, openness, and a shared commitment to excellence. 🔹 Global Impact: Join a company where your work has a global, measurable impact on our clients, the business, and each other.
Learn more on Life at Coupa blog and hear from our employees about their experiences working at Coupa.
What You’ll Do : Strategic Leadership & Vision: Define the long-term technical strategy and roadmap for Coupa's global cloud network infrastructure, aligning with business objectives and product goals. Global Team Leadership: Hire, mentor, and lead global teams of high-performing network development engineers, managers, and architects across numerous geographies. Foster a culture of continuous learning, automation, and engineering excellence. Project Planning & Execution: Lead complex, multi-quarter infrastructure projects from conception through detailed planning and execution, ensuring timely delivery, optimal resource allocation, and alignment with cross-functional dependencies. Network Architecture & Deployment: Oversee the architectural design, deployment, and scaling of complex network solutions strictly in public cloud environments (AWS, GCP, and Azure). Monitoring & Observability: Define and drive a comprehensive network observability strategy. Implement robust monitoring, alerting, and telemetry frameworks to ensure proactive issue detection, rapid troubleshooting, and deep visibility into global network health and performance. Security & Compliance: Champion cloud network security initiatives, ensuring the robust deployment of cloud-native security tooling (IDS/IPS, WAFs) to maintain strict compliance and data protection standards. Automation & IaC Strategy: Drive an "infrastructure as code" (IaC) and automation-first mindset across the organization to eliminate toil, streamline deployments, and improve network reliability. Cross-Functional Collaboration: Partner closely with Site Reliability Engineering (SRE), Cloud Operations, Security, and Product teams to deliver seamless and highly available services. Operational Excellence & Budgeting: Manage department budgets, cloud infrastructure spending, and vendor relationships. Ensure operational excellence by establishing SLAs, SLOs, and leading high-level incident response.
What You Will Bring to Coupa : Education & Experience: Bachelor's or Master's degree in Computer Science, Engineering or equivalent experience. 10+ years of professional experience in enterprise SaaS environments, including at least 5+ years in a Senior Manager or Director-level role. Global Leadership: Proven experience leading and scaling distributed engineering teams across numerous geographies and time zones. Cloud & Networking Expertise: Deep, authoritative knowledge of public cloud networking constructs with hands-on technical experience in AWS and GCP (Azure is a plus). Expert in cloud-native routing, DNS, VPNs, Transit Gateways, VPCs/Shared VPCs, and global traffic management. Observability Tools: Strong experience designing and utilizing modern monitoring and observability platforms to maintain highly available cloud networks. Project & Delivery Management: Demonstrated success in project planning, agile methodologies, and managing large-scale infrastructure rollouts from end to end. Security Acumen: Strong background in cloud network security, including deep architectural understanding of zero-trust architectures and deploying enterprise security controls. Automation Leadership: Proven track record of leading teams that utilize coding languages (Python, Go, Java) and IaC tools (Terraform, Ansible, CloudFormation) to automate cloud infrastructure. Problem Solving & Communication: Exceptional critical thinking and problem-solving skills. Executive-level communication abilities to articulate complex technical concepts to non-technical stakeholders and the C-suite.