Back to search

Senior Data Scientist Nlp/Genai - Catalog

Mirakl
Paris, FR
full timeFRParisOn-siteTechnologyEngineeringlesjeudis.com

Job Description

About Mirakl:Founded in 2012, Mirakl has been at the forefront of marketplace innovation, empowering every business to compete in the platform economy.Today, Mirakl's operating system combines an enterprise marketplace solution (Mirakl Platform) that enables retailers and B2B organizations to launch, scale, and operate marketplaces and dropship, AI-powered multichannel selling (Mirakl Connect), retail media (Mirakl Ads) and an agentic commerce infrastructure (Mirakl Nexus).With dual headquarters in Boston and Paris, Mirakl helps a global ecosystem of 450+ marketplaces (B2C and B2B) and a network of over 100k third-party marketplace sellers. Brands like Macy's, Decathlon, Carrefour, Asos, and Airbus Helicopters use Mirakl to grow their businesses in new and remarkable ways.For more information: www.mirakl.com.Mirakl in Numbers: Founded in 2012 | Member of French Tech Next40 750+ employees in 9 offices worldwide: Paris, Barcelona, Bordeaux, Boston, London, Munich, New York, Sydney, TokyoFR 350+ Mirakl Tech teams members mainly based in France 5 Saas SolutionsOur Values:Working at Mirakl means accelerating your career alongside ambitious, passionate, and supportive colleagues. We're proud of the diversity of backgrounds, perspectives, and experiences that make our teams unique.Our 5 values guide how we collaborate: Work Hard Together: Teamwork and collaboration are the foundation of our success Get Things Done: We prioritize action and efficiency for impactful results Go Above & Beyond: We tackle challenges proactively and always aim for excellence Succeed Through Expertise: Knowledge sharing and continuous learning are core to our culture Satisfy & Empower Clients: We're committed to our clients' successAbout the jobYou'll join our Data Science team, where your main mission will be to prototype, iterate, and ship algorithms to production in close collaboration with Product, Data Engineering, and Software teams. Your projects will focus on Marketplace catalog challenges, including NLP, Computer Vision, and large-scale Generative AI (custom LLMs). The topics you'll tackle will have a real impact on our customers: we aim to make the most of our rich, diverse data to grow their revenue, streamline marketplace operations, and ensure user and transaction safety.As for remote set-up it would be: - 4 days worked from our offices per week - A day worked remotely per weekWe're hiring on a permanent contract (CDI), based in our Paris or Bordeaux Office (1 day remote per week). As part of our Data team (60+ people), you will work on:Catalog topics:Automatic rewriting of marketing content based on business needsExtracting product attributes from images and free textDetecting product variantsProduct categorizationAutomated onboarding of sellers' productsMerging product pages from multiple sourcesPredicting trending productsWhat's in it for you:Build algorithms that visibly impact 500+ e-commerce/marketplace sites in 40 countries, including some with very high volumes (millions of products, customers, and orders per year)Work with cutting-edge techniques (multimodal models, LLM fine-tuning, etc.). Mirakl is one of the few French players with fine-tuned LLMs in large-scale production. Join us and keep pushing that pioneer spiritReal autonomy and ownership over your projectsOur stack and tools:Python, Tensorflow, Pytorch, Hugging Face, Databricks, Spark, AWS (Amazon Redshift, s3, etc.), SQL, Airflow, Delta Lake. Spécifiques LLM : Autotrain, Unsloth, Galileo, LangChain, Anyscale.Day to day, you will:Analyze and prepare data, prototype algorithmsPut them into production with Data Engineers and dev teamsBuild dashboards to demonstrate algorithm performance and monitor productionPresent results at the weekly data science meeting and join team brainstormsPartner with other teams to refine use cases, user experience, and integration pathsYou'll love this job if:You have at least 4 years' experience as a Data Scientist, with strong hands-on NLP and applied ML in industryYou've deployed Machine Learning algorithms to productionYou know NLP and Computer Vision algorithms and state-of-the-art architectures (e.g., Transformers). Knowledge of the latest LLMs is a plusYou're fluent in Python and TensorFlow and/or PyTorchYou have experience with Spark developmentYou're pragmatic, data-driven, and business-orientedYou take full ownership of your topics, work autonomously, and are a great team playerYou bring a positive mindset: respect and kindness are core to your valuesYou enjoy sharing your work through internal talks, conferences, or writingMeet Arthur Delaitre, Data Science Manager for the team:Wants to join us ? ☆A 30-minute phone call with one of our Tech recruiters. We'll discuss your background, expectations, and what Mirakl can offer youA 30-minute technical Zoom with someone from the Data Science team to dive into concrete aspects of your expertise and how it fits our projectsA take-home assignmentA 75-minute technical debrief and discussion with the Data Science team managerA final 1-hour Zoom with future Mirakl colleagues about our values and cultureWe welcome collaborators with their diverse perspectives and experiences to power us forward. These often far exceed conventional job requirements and help us create a culture of continuous learning. If you're ready to join a global leader powering digital transformation for 450+ of the world's most innovative retailers and B2B organizations..We may use Artificial Intelligence (AI) solutions to help streamline our hiring process, including screening applications, analyzing resumes, and assessing responses. While AI helps us work efficiently, all final hiring decisions are made by humans. For more information, visit our AI Guidelines for Candidates and Interviews.
Posted 4/22/2026
30% complete