Cloud infrastructure Team Lead
About The Position
Coralogix is a modern, full-stack observability platform transforming how businesses process and understand their data. Our unique architecture powers in-stream analytics without reliance on expensive indexing or hot storage. We specialize in comprehensive monitoring of logs, metrics, trace and security events with features such as APM, RUM, SIEM, Kubernetes monitoring and more, all enhancing operational efficiency and reducing observability spend by up to 70%.
As Cloud infrastructure Team Leader, you will lead a specialized team responsible for the availability, stability, and efficiency. You will manage a growing team, providing technical leadership, mentorship, and operational excellence. This role requires a blend of people management (70%) and technical contribution (30%), with a focus on ensuring reliability, defining the release process.
What You'll Do
- Lead and Scale the Team: Manage and mentor a growing team of engineers, fostering a collaborative and high-performing culture.
- Ensure Platform Reliability: Oversee the design and implementation of systems to maintain the availability, stability, and efficiency of the platform.
- Define and Manage Release Processes: Develop robust release workflows to ensure seamless delivery of features and updates to the platform.
- Collaborate Across Teams: Work closely with product, engineering, and external stakeholders to align on priorities, requirements, and deliverables.
- Drive Technical Excellence: Contribute to architecture, troubleshooting, and scalability strategies, ensuring the platform meets high-performance standards.
- Capacity Planning and Optimization: Proactively address scalability and resource efficiency challenges to optimize costs and performance.
Requirements
- Extensive Cloud Experience: 5+ years working with cloud platforms (AWS, GCP, or Azure) and distributed systems.
- Leadership Experience: Proven experience managing and mentoring a team of at least 3 engineers, with plans for scaling teams.
- Proficiency with Kubernetes and Istio (or similar): Deep understanding of Kubernetes, service mesh technologies, and container orchestration.
- Kafka Expertise: Hands-on experience with Kafka, including scaling and performance optimization.
- Observability and Monitoring: Strong background with observability tools like Prometheus, Grafana, and OpenTelemetry or equivalents.
- System Scalability and Stability: Demonstrated experience designing, implementing, and maintaining highly scalable and stable systems.
- Strong Programming Skills: Solid experience in programming for system-level development.
- Familiarity with Release Processes: Experience defining and managing CI/CD pipelines and release workflows.
- Collaborative and Strategic Thinking: Ability to manage cross-functional collaboration and align technical efforts with business goals.
Nice-to-Have:
- Experience with white-label or B2B SaaS products.
- Familiarity with high-performance databases like PostgreSQL or ClickHouse.
- Background in managing partnerships with large enterprises
Cultural Fit
We’re seeking candidates who are hungry, humble, and smart. Coralogix fosters a culture of innovation and continuous learning, where team members are encouraged to challenge the status quo and contribute to our shared mission. If you thrive in dynamic environments and are eager to shape the future of observability solutions, we’d love to hear from you.
Coralogix is an equal opportunity employer and encourages applicants from all backgrounds to apply.