Coralogix

Platform Engineer (Application SRE)

  • Engineering
  • Berlin, Germany
  • Senior
  • Full-time

Description

Coralogix is a modern, full-stack observability platform transforming how businesses process and understand their data. Our unique architecture powers in-stream analytics without reliance on expensive indexing or hot storage. We specialize in comprehensive monitoring of logs, metrics, trace and security events with features such as APM, RUM, SIEM, Kubernetes monitoring and more, all enhancing operational efficiency and reducing observability spend by up to 70%.

As an Application SRE at Coralogix, you’ll join our Platform Group, the team responsible for designing, building, and maintaining the critical infrastructure and services that power our observability platform. In this role, you will:

  • Influence platform architecture, establishing design patterns for resiliency, scalability, and maintainability.
  • Shape the group culture, fostering an environment of innovation, collaboration, and continuous improvement.
  • Work cross-functionally with engineering, product, security, QA, and other teams to ensure reliability, scalability, and performance across the organization.

You’ll be hands-on, combining software development, platform engineering, and advanced troubleshooting of complex production issues. You will also have opportunities to contribute to open-source projects aligned with Coralogix’s infrastructure needs.


Requirements


  • This role requires the candidate to be located in Europe due to time zone alignment and regional market focus.
  • Software Engineering Expertise: 5+ years of experience developing in GoRust, or Java.
  • Distributed Systems & Platform Skills: Hands-on experience with KafkaKubernetes, and microservices at scale.
  • Infrastructure/DevOps Mastery: Skilled in containerization, CI/CD workflows, and Infrastructure-as-Code (Terraform, Helm, etc.).
  • Data Systems Proficiency: Experience operating ClickHouseOpenSearch, and related caching or event-driven pipelines.
  • Reliability & Observability: Proficiency in monitoring, logging, alerting (Prometheus, Grafana, Coralogix, etc.) and defining SLIs/SLOs.
  • Open-Source Engagement: Evidence of contributions to open-source communities or tools (e.g., patches, PRs, discussions).
  • Troubleshooting & Production Expertise: Proven ability to debug large-scale, high-volume production systems with a strong understanding of distributed principles.


Preferred Qualifications

  • High-Volume Data Pipeline Experience: Familiarity with optimizing throughput and reliability in event-driven architectures.
  • Cloud Experience: Proficiency in AWS, GCP, or Azure for production environments.
  • Community Leadership: Past involvement in SRE/DevOps communities, conferences, or meetups; experience sharing and demonstrating best practices.


Cultural Fit

We’re seeking candidates who are hungry, humble, and smart. Coralogix fosters a culture of innovation and continuous learning, where team members are encouraged to challenge the status quo and contribute to our shared mission. If you thrive in dynamic environments and are eager to shape the future of observability solutions, we’d love to hear from you.

Coralogix is an equal opportunity employer and encourages applicants from all backgrounds to apply.