Site Reliability Engineer (Data streaming et storage infrastructure)F/M

Job summary
Permanent contract
Paris
Salary: Not specified
No remote work
Skills & expertise
Generated content
Aws
Slack
Java
Angular
Grafana
+15
Apply

DataDome
DataDome

Interested in this job?

Apply
Questions and answers about the job

The position

Job description

⭐ About the team:

Made up of six subteams (Dashboard, Engine, Infrastructure, Integrations, Threat Research & Security), the DataDome tech team is spread across Europe and the US. We handle over 2 000 billion events per day giving responses within 3ms (99p). We are present in more than 25 data centers around the world, deployed using Docker.

We deploy on AWS, Scaleway, Vultr, and GCP, using Docker, Ansible and Terraform, and monitor with Grafana and Prometheus. We handle an average of 10 billion requests per day and manage more than 350 TB of data per month.

When it comes to our stack, we run real-time detection layer in Java, low latency Stream Engine running on Flink in Scala, ElasticSearch for storage, Kafka for communication between layers, HAProxy for load balancing, Symfony & Angular for our dashboards. We use Slack, GitHub, Hangouts, & StackOverflow for Teams. While previous experience in cyber is not a must, we'll pair you with mentors who will help you bridge the gaps. As #growth is part of our DNA, we’ll give you the resources and support you need to develop mastery.

The DevOps / SRE Team

Our DevOps / Site Reliability Engineering team is currently composed of 12 SRE Engineers in charge of improving our delivery processes and providing a reliable and scalable platform for our customers.

One of the team goal, called Core team, is to handle data stream and data storage in the most performant and efficient way. We are looking for an experienced Site Reliability Engineer with experience in distributed data storage solution.

  • As a member of our Core team, you will :
    • Design, develop and optimise solutions based on ElasticSearch, Kafka and Flink cluster.
    • Develop efficient indexing and search strategies for large volumes of data.
    • Monitor ElasticSearch and kafka performance and storage capacity.
    • Work with the team to integrate ElasticSearch, Kafka and Flink cluster with different cloud provider and bare metal server provider.
  • Requirements

    • You have worked at very high scale with systems like ElasticSearch, cassandra, mongodb
    • You have a great experience working in Unix/Linux environments and a good knowledge of networking systems
    • You care about code quality, simplicity and performance
    • You have a real passion for automation
    • You are familiar with, or ready to take, on-call duties
    • You are fluent in English

    Bonus points

    • You have been building big platform at scale and monitoring production environments for several years in the cloud and/or on premises.
    • You have worked with Kubernetes.
    • You monitor your own house and/or homelab :-)
    • You contributed to an open-source project

What’s in it for you?

    • Flex Life: While we offer remote, hybrid, & in-office options each position specifies the level of flexibility. Our Parisian office is located next to the Opera Garnier. You will also receive a 500€ stipend to help you set up your ideal workspace if you work hybrid or remotely.
    • Generous Health Benefits: We have partnered with Alan for your healthcare needs.
    • Professional Development: #Growth is part of our DNA, therefore we have invested in an internal Learning and Development platform and offer the opportunity to request additional training and support via your manager.
    • Events & Team building: Feel the #TeamSpirit both virtually & onsite, with several events & workshops planned throughout the year, including an annual offsite evens, quarterly online and offline events and parties, lunch & learns, & much more.
    • Parent Care: Gift & care packages for parents.
    • PTO: Based on the country you are based from (e.g. 25 days in France).

🦄 DataDome’s bot and online fraud protection detects and mitigates attacks with unparalleled accuracy and zero compromise. Our machine learning solution analyzes 3 trillion signals per day to adapt to new threats in real time. Hundreds of enterprises worldwide—including Reddit, Rakuten, and AngelList—trust DataDome’s solution and 24/7 SOC experts to protect their mobile apps, websites, and APIs against online fraud, ATO, carding, scraping, layer 7 DDoS, credential stuffing, and more.

A force multiplier for IT and security teams, DataDome is fully transparent, easy to deploy, and frictionless for consumers. We offer the only secure, user-friendly, and privacy compliant CAPTCHA integrated with our complete 360° bot detection solution. With 25+ regional PoPs and autoscaling technology, DataDome responds to requests with zero latency and no impact on the speed of protected platforms.

DataDome was ranked top G2 Leader in Bot Detection & Mitigation for three consecutive periods: Fall 2022-Spring 2023, was named a Strong Performer in the 2022 Forrester Wave: Bot Management, and placed 21st in cybersecurity on the Inc. 5000. Certified a Great Place to Work in the US and France, DataDome’s dedicated team of 160+ BotBusters spans the globe as far as its high-profile customer base.

DataDome is an equal opportunity employer, and proud to be committed to diversity and inclusion. We will consider all qualified applicants without regard to race, color, nationality, gender, gender identity or expression, sexual orientation, religion, disability or age.

Want to know more?

These job openings might interest you!

These companies are also recruiting for the position of “Cloud Computing and DevOps”.

See all job openings
Apply