As a Data Scientist, you will integrate the Data Science team crossing expertise in Computer Vision, Time Series Forecasting and Data Insights. You will be at the heart of the development of the Heuritech code base and data stack. Working along with our Product team, you will help to build and provide insights from data to our clients, based on our data warehouse. We expect this to be done mainly through the development of industrialized data solutions (methodology, data transformations and analytics) that can be further leveraged internally or directly accessible by clients.
Our full data pipeline starts with data gathering and ingestion, handled by the Data Engineering team. The data is then processed through Computer Vision AI models producing a large amount of semantic labels. Those models and their deployment are handled by the Data Science team. These labels are also aggregated throughout time to build time series, from which we construct various metrics and compute forecasts.
Under the supervision and guidance of the Data Science Lead, you will lead various projects and key components of our stack, from defining methodology and transformations to apply in our Snowflake Data Warehouse to defining new metrics to plug in our products. At Heuritech we encourage ownership and proactivity, so you will be granted a lot of autonomy to carry out your mission.
You will take part in developing robust data analytics products, help the business & Product team deliver insights to our clients, and help build the Data Science toolbox to make our processes more efficient. Being the go-to person on our data and the metrics we give to our clients, you will sometimes take part of the technical followup of some clients needs and help translate it into a technical solution. You will also be asked to explore our data to retrieve insights out of it and improve the methodology around our processes using data. As such, you will play a pivotal role between the Data Science team, Product team and Data Engineering team from the crawling of raw data to the delivery of actionable insights.
You have several years of experience (3-4+) in the fields of Data Science. You are familiar with working around big data topics around cloud solutions (SnowFlake / Google Big Query / Amazon Redshift) and you really like building clean and strong data solutions. We are looking for someone eager to give business-driving answers for our clients using our semantic data stored in a newly built data warehouse. We are looking for someone who loves to tinker with the data and likes to strive for code quality and scalability.
We are also looking for someone that will not only work on prototyping on a notebook but also industrialize and integrate the solutions in long-living data software. You are proactive and like to rechallenge the existing. You’re eager to transform technical debt into reliable, clean and well orchestrated data pipelines. You like to mock up new data explorations & viz along with business experts, and to try new technologies for that.
Former experience with databases (either SQL or NoSQL) or data warehouses (Google Big Query, Snowflake, etc.), data modeling and querying. You are comfortable with SQL language.
Experience in developing industrialized and automated data pipelines. A former experience in DBT is a plus.
Knowledge of tools like Docker or Apache Airflow.
Experience in Python programming and with the usual Data Science python tools (Numpy, Pandas, Sklearn).
Code quality is not secondary for you (unit/integration tests, CI/CD, code formatting, pre-commit hooks, etc.)
You are an expert of descriptive statistics tools and techniques. Notably, you have a very good understanding of Machine Learning-related metrics (precision, recall, etc.).
Good knowledge of data exploration methods (PCA, high dimension visualization methods, statistical tests, etc.).
Having a prior experience with a deep learning framework is a plus but definitely not mandatory (TensorFlow, Pytorch)
As we are an international company, you must be very comfortable with english.
Have previous experience in the improvement of user experience in a data product.
You are eager to share results about complex processes to a novice public (internally and / or externally).
Last but not least, we appreciate people willing to share good practice of development and their advances across technical watches, or the results of the projects they are staffed on.
You are looking for a full-time position
Send your CV & cover letter (a few accurate points are better than a long letter), references are a plus
A first 30 minutes visio with the Data Science team lead and understand our common objectives
If it fits well, we will send you a technical test
A final meeting in our offices to debrief around the technical test and to meet our team
APPLY TO: paul.morel@heuritech.com and thomas.robert@heuritech.com
Ces entreprises recrutent aussi au poste de “Data / Business Intelligence”.
Voir toutes les offres