back to home

Engineer1999 / A-Curated-List-of-ML-System-Design-Case-Studies

This repository contains a curated collection of 300+ case studies from over 80 companies, detailing practical applications and insights into machine learning (ML) system design. The contents are organized to help you easily find relevant case studies based on industry or specific ML use cases.

9,200 stars
1,337 forks
5 issues

AI Architecture Analysis

This repository is indexed by RepoMind. By analyzing Engineer1999/A-Curated-List-of-ML-System-Design-Case-Studies in our AI interface, you can instantly generate complete architecture diagrams, visualize control flows, and perform automated security audits across the entire codebase.

Our Agentic Context Augmented Generation (Agentic CAG) engine loads full source files into context, avoiding the fragmentation of traditional RAG systems. Ask questions about the architecture, dependencies, or specific features to see it in action.

Embed this Badge

Showcase RepoMind's analysis directly in your repository's README.

[![Analyzed by RepoMind](https://img.shields.io/badge/Analyzed%20by-RepoMind-4F46E5?style=for-the-badge)](https://repomind-ai.vercel.app/repo/Engineer1999/A-Curated-List-of-ML-System-Design-Case-Studies)
Preview:Analyzed by RepoMind

Repository Summary (README)

Preview

ML System Design Case Studies Repository

Description

Welcome to the ML System Design Case Studies Repository! This repository is a comprehensive collection of 300+ case studies from over 80 leading companies, showcasing practical applications and insights into machine learning (ML) system design. Companies like Netflix, Airbnb, and Doordash have shared their experiences, providing a valuable resource for anyone interested in learning how ML is used to improve products and processes.


🌐 Featured Resource: <a href="https://horizonx.live" target="_blank" rel="noopener noreferrer">HorizonX.live</a>

Supercharge your Research journey with <a href="https://horizonx.live" target="_blank" rel="noopener noreferrer">HorizonX.live</a> — the all-in-one platform for all your research needs.

  • 🔍 Search & Explore: Instantly find trending ML papers, code, and datasets.
  • 🗂️ Personalized Libraries: Save, organize, and annotate your favorite research.
  • 📝 Interactive Paper Summaries: Concise, easy-to-digest summaries for each paper.

🚀 Upcoming Features

We are continuously working to enhance this resource! Here are some exciting features coming soon:

🌟 Upcoming Features

  • <span style="color: #39FF14; font-size: 1.2em; vertical-align: middle;">🟢</span> Context-Aware Literature Engine
    Speed up your literature review with AI that understands your research context.

  • 🤖 AI-Powered Brainstorming Engine
    Collaborate with your intelligent assistant to spark, shape, and elevate breakthrough ideas.

  • 🛠️ Low-Code Data Analysis
    Transform raw data into publication-ready charts and insights—no coding required.

  • 👥 Real-Time Collaboration
    Google Docs meets research-grade Overleaf for seamless teamwork.

  • 🔗 Unified Knowledge Nexus
    A single intelligent workspace that connects your ideas, insights, and research tools.

  • 📚 Automated Citation Manager
    Intelligent reference management system for effortless citations.

  • 📝 Smart Formatting Assistant
    Journal compliance made effortless.

  • 🔍 Pre-Publication Quality Check
    Your 24/7 peer review partner for manuscript quality assurance.

  • 💻 On-Demand HPC Access
    Computing power without the setup headaches.

Stay tuned for these updates and feel free to suggest features or contribute!

Try it now: <a href="https://horizonx.live" target="_blank" rel="noopener noreferrer">https://horizonx.live</a>


Features

  • Wide Range of Industries: Explore case studies from various industries such as tech, finance, healthcare, and more.
  • Diverse ML Applications: Learn about different ML use cases, including computer vision (CV), natural language processing (NLP), recommender systems, search and ranking, fraud detection, and many more.
  • Product Features: Discover how ML powers specific user-facing features, from grammatical error correction to generating outfit combinations.

Why This Resource is Valuable

  • Authentic and In-depth: Each case study is sourced from detailed blogs, papers, or articles about ML systems developed in-house, providing genuine and firsthand insights.
  • Practical Applications: The studies cover real-world ML systems that are actively used in production, offering practical and proven examples.
  • Focused and Detailed: The case studies focus on specific ML use cases, providing clear and comprehensive information on the target users, model designs, evaluation criteria, and deployment architectures.

How to Use

  • Short Description: Use the discription to quickly find case studies relevant to your interests.
  • Explore and Learn: Dive into the detailed descriptions and implementations to gain a deeper understanding of ML system design.
  • Share and Collaborate: If you find the database helpful, spread the word and contribute to the repository by suggesting new case studies.

Enjoy exploring the wealth of knowledge in these case studies and enhance your understanding of machine learning system design!

Real-world ML systems

IndexCompanyIndustryDescription (< 5 words)TitleYear
1StripeFintech and bankingPrevent fraudelent transactionsHow we built it: Stripe Radar2023
2WalmartE-commerce and retailRecommend complementary itemsPersonalized ‘Complete the Look’ model2023
3UberDelivery and mobilityForecast demand for airport ridesDemand and ETR Forecasting at Airports2023
4PinterestSocial platformsPrevent advertiser churnAn ML based approach to proactive advertiser churn prevention2023
5Stitch FixE-commerce and retailGenerate ad headlinesA New Era of Creativity: Expert-in-the-loop Generative AI at Stitch Fix2023
6SwiggyDelivery and mobilityRecommend items to orderBuilding a mind reader at Swiggy using Data Science2023
7MicrosoftTechDiagnose production incidents with LLMLarge-language models for automatic cloud incident management2023
8FoodpandaDelivery and mobilityOptimize menu sorting orderMenu Ranking2023
9ZillowE-commerce and retailEstimate the house market valueBuilding the Neural Zestimate2023
10AirbnbTravel,E-commerce and retailIdentify user interestsPrioritizing Home Attributes Based on Guest Interest2023
11GitHubTechGenerate code and code suggestionsInside GitHub: Working with the LLMs behind GitHub Copilot2023
12DoorDashDelivery and mobilityOptimize courier waiting timeLifecycle of a Successful ML Product: Reducing Dasher Wait Times2023
13LinkedinSocial platformsSelect best payment gatewayImproving the customer’s experience via ML-driven payment routing2023
14WayfairE-commerce and retailPredict delivery timesDelivery-Date Prediction2023
15LinkedinSocial platformsDetect viral spamViral spam content detection at LinkedIn2023
16LyftDelivery and mobilityRecommend content in appThe Recommendation System at Lyft2023
17HoneycombTechGenerate queries with natural languageAll the Hard Stuff Nobody Talks About when Building Products with LLMs2023
18ZalandoE-commerce and retailForecast demand in fashion e-commerceDeep Learning based Forecasting: a case study from the online fashion industry2023
19EtsyE-commerce and retailRecommend relevant marketplace itemsHow We Built a Multi-Task Canonical Ranker for Recommendations at Etsy2023
20YelpSocial platformsOrganize e-commerce content using embeddingsYelp Content As Embeddings2023
21MonzoFintech and bankingSelect relevant marketing messagesOptimising marketing messages for Monzo users2023
22MonzoFintech and bankingDetect patterns in text dataUsing topic modelling to understand customer saving goals2023
23WayfairE-commerce and retailPredict new product’s sales potentialHow Wayfair uses “Predicted Winners” Models to Accelerate Success for New Products2023
24AirbnbTravel,E-commerce and retailPersonalized listing searchLearning To Rank Diversely2023
25TwitterSocial platformsRecommend interesting tweetsTwitter's Recommendation Algorithm2023
26DoorDashDelivery and mobilityPredict if a store is openHow DoorDash Upgraded a Heuristic with ML to Save Thousands of Canceled Orders2023
27WayfairE-commerce and retailIdentify business customersHamlet: Wayfair's ML Approach to Identifying Business Shopper2023
28WayfairE-commerce and retailDetect fraud with embeddingsIntroducing Melange: A Customer Journey Embedding System for Improving Fraud and Policy Abuse Detection2023
29AirbnbTravel,E-commerce and retailImprove travel search experienceBuilding Airbnb Categories with ML & Human in the Loop2023
30SpotifyMedia and streamingAutomatically generate ad contentHow We Automated Content Marketing to Acquire Users at Scale2023
31InstacartE-commerce and retailPredict availability of food itemsHow Instacart Modernized the Prediction of Real Time Availability for Hundreds of Millions of Items While Saving Costs2023
32LinkedinSocial platformsPersonalize the homepage feedEnhancing homepage feed relevance by harnessing the power of large corpus sparse ID embeddings2023
33DoordashDelivery and mobilityForecast order volumes and deliveriesHow DoorDash Built an Ensemble Learning Model for Time Series Forecasting2023
34ExpediaTravel,E-commerce and retailForecast flight pricesUsing Synthetic Search Data for Flights Price Forecasting2023
35NextdoorSocial platformsGenerate engaging email subject linesLet AI Entertain You: Increasing User Engagement with Generative AI and Rejection Sampling2023
36CriteoTechFigure out users' preferencesRecommender systems need a user model2023
37AppleTechIdentify objects on imagesFast Class-Agnostic Salient Object Segmentation2023
38ZillowE-commerce and retailIdentify and block unwanted callersSpectroBrain: Detecting Phone Spam with Semi-Supervised Learning2023
39AlgoliaTechSuggest relevant search queriesFeature Spotlight: Query Suggestions2023
40NetflixMedia and streamingIn-video searchBuilding In-Video Search2023
41GrabDelivery and mobility,Banking and financeAutomatically tag sensitive dataLLM-powered data classification for data entities at scale2023
42DoordashDelivery and mobilityAccurately forecast demand during holidaysHow DoorDash Improves Holiday Predictions via Cascade ML Approach2023
43NetflixMedia and streamingPersonalize video clipsThe Next Step in Personalization: Dynamic Sizzles2023
44BlaBlaCarDelivery and mobilityPrevent phishing and payment fraudHow we used machine learning to fight fraud at BlaBlaCar — Part 12023
45InstacartE-commerce and retailPersonalize user experience by recommending relevant productsUsing Contextual Bandit models in large action spaces at Instacart2023
46PinterestSocial platformsRecommend similar visual contentTraining Foundation Improvements for Closeup Recommendation Ranker2023
47SpotifyMedia and streamingRecommend new complementary musicSpotify Track Neural Recommender System2023
48MetaSocial platformsGenerate code with LLMIntroducing Code Llama, a state-of-the-art large language model for coding2023
49GrammarlyTechSuggest gender-inclusive grammatical error correctionsImproving the Performance of NLP Systems on the Gender-Neutral “They”2023
50NetflixMedia and streamingDetect speech and music in audioDetecting Speech and Music in Audio Content2023
51SalesforceTechExtract relevant information from a knowledge articleResolve Cases Quickly with Interactive Einstein Search Answers2023
52EtsyE-commerce and retailShow relevant adsLeveraging Real-Time User Actions to Personalize Etsy Ads2023
53GitHubTechAI copilot for code generationHow to build an enterprise LLM application: Lessons from GitHub Copilot2023
54UberDelivery and mobilityDetect potential fraudulent entitiesRisk Entity Watch – Using Anomaly Detection to Fight Fraud2023
55ExpediaTravel,E-commerce and retailPredict Customer Lifetime Value (CLV)Expedia Group’s Customer Lifetime Value Prediction Model2023
56DailymotionMedia and streamingRecommend diversified video contentReinvent your recommender system using Vector Database and Opinion Mining2023
57SwiggyDelivery and mobilityPredict food delivery timeWhere is my order? — Part I2023
58SwiggyDelivery and mobilityСonversational and open-ended searchSwiggy’s Generative AI Journey: A Peek Into the Future2023
59New York TimesMedia and streamingRecommend recipes to readersHow The New York Times Cooking Team Makes Personalized Recipe Recommendations2023
60ExpediaTravel,E-commerce and retailSuggest diverse travel recommendationsGenerating Diverse Travel Recommendations2023
61Stitch FixE-commerce and retailPersonalize styling recommendationsAccelerating AI: Implementing Multi-GPU Distributed Training for Personalized Recommendations2023
62DoordashDelivery and mobilityAreas for using Generative AIDoorDash identifies Five big areas for using Generative AI2023
63EtsyE-commerce and retailSearch by imageFrom Image Classification to Multitask Modeling: Building Etsy’s Search by Image Feature2023
64SpotifyMedia and streamingGenerate audio podcast previewsLarge-Scale Generation of ML Podcast Previews at Spotify with Google Dataflow2023
65Delivery HeroDelivery and mobilityBetter understand user behaviorPersonalisation @ Delivery Hero: Understanding Customers2023
66SwiggyDelivery and mobilityPredict food delivery timePredicting Food Delivery Time at Cart2023
67NetflixMedia and streamingGenerate content recommendations for usersLessons Learnt From Consolidating ML Models in a Large Scale Recommendation System2023
68LinkedinSocial platformsShow relevant jobs in searchHow LinkedIn Is Using Embeddings to Up Its Match Game for Job Seekers2023
69ExpediaTravel,E-commerce and retailAlert users about optimal dealsIncreasing Travelers’ Engagement Through Price Alerts2023
70WalmartE-commerce and retailResolve entities and detect relationshipsExploring an Entity Resolution Framework Across Various Use Cases2023
71ThoughtworksTechAI copilot for product strategyBuilding Boba AI2023
72GrabDelivery and mobility,Banking and financeAutomatically detect new fraud typesUnsupervised graph anomaly detection - Catching new fraudulent behaviours2023
73DropboxTechIdentify date formats in file namesIs this a date? Using ML to identify date formats in file names2023
74GrabDelivery and mobility,Banking and financeСreate scalable lookalike audiencesStepping up marketing for advertisers: Scalable lookalike audience2023
75WayfairE-commerce and retailSend relevant communications to customersGriffin: How Wayfair Leverages Reinforcement Learning to Send Customers Relevant Communications2023
76WhatnotE-commerce and retailDetect marketplace spamHow Whatnot Utilizes Generative AI to Enhance Trust and Safety2023
77InstacartE-commerce and retailPredict grocery item availabilityHow Instacart’s Item Availability Evolved Over the Pandemic2023
78InstacartE-commerce and retailPredict availability of food itemsInstacart’s Item Availability Architecture: Solving for scale and consistency2023
79BlaBlaCarDelivery and mobilityPrevent phishing and payment fraudHow we built our machine learning pipeline to fight fraud at BlaBlaCar — Part 22023
80SalesforceTechSummarize Slack conversationsAI Summarist: Get Your Time Back on Slack, Boost Productivity & Focus, Personalize Information Consumption2023
81MetaSocial platformsShow users relevant content at scaleScaling the Instagram Explore recommendations system2023
82Delivery HeroDelivery and mobilityRecommend restaurants for new customersPersonalisation @ Delivery Hero: Ranking restaurants for new users2023
83SwiggyDelivery and mobilityPredict food delivery timeHow ML Powers — When is my order coming? — Part II2023
84SalesforceTechRecommend apps in the marketplaceOn the Diversity and Explainability of Enterprise App Recommendation Systems2023
85GrabDelivery and mobility,Banking and financeOptimize promotional campaignsScaling marketing for merchants with targeted and intelligent promos2023
86GitHubTechAutomated code reviews and PR taggingGenerative AI-enabled compliance for software development2023
87Delivery HeroDelivery and mobilityRecommend restaurantsDon’t Worry, We Got You: Personalised Model2023
88OLXE-commerce and retailPredict order delivery timeMachine Learning for Delivery Time Estimation2023
89SpotifyMedia and streamingTarget in-app messagingExperimenting with Machine Learning to Target In-App Messaging2023
90NubankFintech and bankingAutomatically route customer phone callsPresenting Precog, Nubank’s Real Time Event AI2023
91InstacartE-commerce and retailBuild an internal AI assistantScaling Productivity with Ava — Instacart’s Internal AI Assistant2023
92MetaSocial platformsTranslate and transcribe across speech and textBringing the world closer together with a foundational multimodal model for speech translation2023
93VimeoMedia and streamingCustomer support AI assistantFrom idea to reality: Elevating our customer support through generative AI2023
94EbayE-commerce and retailRecommend relevant e-commerce itemsBuilding a Deep Learning Based Retrieval System for Personalized Recommendations2022
95Mercado LibreDelivery and mobilityPredict product dimensions for deliveryPredicting package dimensions based on a similarity model at Mercado Libre2022
96DoordashDelivery and mobilityRecommend substitute itemsEvolving DoorDash’s Substitution Recommendations Algorithm2022
97PinterestSocial platformsPersonalize homepage contentsHow Pinterest Leverages Realtime User Actions in Recommendation to Boost Homefeed Engagement Volume2022
98InstacartDelivery and mobilitySearch food and grocery itemsHow Instacart Uses Embeddings to Improve Search Relevance2022
99WalmartE-commerce and retailAssist in e-commerce shoppingA Unified Multi-task Model for Supporting Multiple Virtual Assistants in Walmart2022
100SpotifyMedia and streamingSearch for podcastsIntroducing Natural Language Search for Podcast Episodes2022
101NextdoorSocial platformsPredict harmful commentsUsing predictive technology to foster constructive conversations2022
102WalmartE-commerce and retailFill shopping cart via voice dialogVoice Reorder Experience: add Multiple Product Items to your shopping cart2022
103ExpediaTravel,E-commerce and retailCategorize customer feedbackCategorising Customer Feedback Using Unsupervised Learning2022
104FoodpandaDelivery and mobilityClassify restaurants and cuisinesClassifying restaurant cuisines with subjective labels2022
105EbaySocial platformsRecommend products and contentMulti-Relevance Ranking Model for Similar Item Recommendation2022
106GoustoDelivery and mobilityPredict subscription churnUsing Data Science to Retain Customers2022
107GoogleTechGenerate summariesAuto-generated Summaries in Google Docs2022
108YelpSocial platformsPersonalize recommendationsBeyond Matrix Factorization: Using hybrid features for user-business recommendations2022
109PayPalFintech and bankingPrioritize sales leadsSales Pipeline Management with Machine Learning: A Lightweight Two-Layer Ensemble Classifier Framework2022
110GrubhubDelivery and mobilityForecast order volumeForecasting Grubhub Order Volume At Scale2022
111GithubTechDetect vulnerabilities in codeLeveraging machine learning to find security vulnerabilities2022
112UberDelivery and mobilityDetect payment fraudProject RADAR: Intelligent Early Fraud Detection System with Humans in the Loop2022
113GojekDelivery and mobilityPredict food delivery timesHow We Estimate Food Debarkation Time With 'Tensoba'2022
114UberDelivery and mobilityPredict estimated time of arrivalDeepETA: How Uber Predicts Arrival Times Using Deep Learning2022
115TrivagoTravel,E-commerce and retailOptimize accommodation rankingExplore-exploit dilemma in Ranking model2022
116GoustoDelivery and mobilityRecommend food items and recipesGousto R-series Vol 2: Tackling the Cold-Start Problem in Recipe Recommendation Engine2022
117SpotifyMedia and streamingForecast user activity metricsHow We Built Infrastructure to Run User Forecasts at Spotify2022
118GoogleTechSummarize conversationsConversation Summaries in Google Chat2022
119AirbnbTravel,E-commerce and retailImprove travel search experienceBuilding Airbnb Categories with ML and Human-in-the-Loop2022
120UberDelivery and mobilitySend timely push notificationsHow Uber Optimizes the Timing of Push Notifications using ML and Linear Programming2022
121MetaSocial platformsPersonalize daily digest notificationsImproving Instagram notification management with machine learning and causal inference2022
122InstacartDelivery and mobilityRecommend relevant food itemsPersonalizing Recommendations for a Learning User2022
123ExpediaTravel,E-commerce and retailRank relevant travel dealsHow to Optimise Rankings with Cascade Bandits2022
124DoordashDelivery and mobilityPersonalize recommendations on homepageHomepage Recommendation with Exploitation and Exploration2022
125LinkedinSocial platformsImprove post search functionalityImproving Post Search at LinkedIn2022
126ArtefactTechEvaluate success of past promotionsForecasting something that never happened: how we estimated past promotions profitability2022
127DoordashDelivery and mobilityFind high-value merchantsBuilding the Model Behind DoorDash’s Expansive Merchant Selection2022
128GrammarlyTechSuggest text editsUnder the Hood of the Grammarly Editor, Part Two: How Suggestions Work2022
129AmazonMedia and streamingSuggest music to listen toThe Amazon Music conversational recommender is hitting the right notes2022
130SnapSocial platformsRank relevant adsMachine Learning for Snapchat Ad Ranking2022
131InstacartE-commerce and retailAutocomplete user searches in e-commerceHow Instacart Uses Machine Learning-Driven Autocomplete to Help People Fill Their Carts2022
132ZillowE-commerce and retailSelect tags for product listingsHelping Home Shoppers Find a Home to Love Through Home Insights2022
133NetflixMedia and streamingDetect account or content fraudMachine Learning for Fraud Detection in Streaming Services2022
134AirbnbTravel,E-commerce and retailImprove customer supportHow AI Text Generation Models Are Reshaping Customer Support at Airbnb2022
135LinkedinSocial platformsPredict churn and upsell productsThe journey to build an explainable AI-driven recommendation system2022
136AutotraderE-commerce and retailPersonalize automotive search resultsReal-Time Personalisation of Search Results with Auto Trader's Customer Data Platform2022
137PelotonTechRecommend fitness training videosHow We Built: An Early-Stage Machine Learning Model for Recommendations2022
138WalmartE-commerce and retailCategorize e-commerce productsSemantic Label Representation with an Application on Multimodal Product Categorization2022
139DoordashDelivery and mobilitySearch food and grocery items3 Changes to Expand DoorDash’s Product Search Beyond Delivery2022
140FaireE-commerce and retailRank e-commerce items (feature store)Real-time ranking at Faire part 2: the feature store2022
141New York TimesMedia and streamingPersonalize paywall limitsHow The New York Times Uses Machine Learning To Make Its Paywall Smarter2022
142LinkedinSocial platformsPredict ad click-through rateChallenges and practical lessons from building a deep-learning-based ads CTR prediction model2022
143ZillowE-commerce and retailIdentify customers that are likely to convertIdentifying High-Intent Buyers2022
144NetflixMedia and streamingRecommend content to viewReinforcement Learning for Budget Constrained Recommendations2022
145WalmartE-commerce and retailForecast anomalies in refrigerationForecast Anomalies in Refrigeration with PySpark & Sensor-data2022
146Stitch FixE-commerce and retailRecommend e-commerce itemsClient Time Series Model: a Multi-Target Recommender System based on Temporally-Masked Encoders2022
147GojekDelivery and mobilityPredict estimated time of deliveryHow We Estimate Food Debarkation Time With ‘Tensoba’2022
148ZillowE-commerce and retailExtract text featuresIncorporating Listing Descriptions into the Zestimate2022
149EtsyE-commerce and retailRank marketplace search resultsDeep Learning for Search Ranking at Etsy2022
150WalmartE-commerce and retailCurate e-commerce product recommendationsScaling Product Recommendations using Basket Analysis- Part 12022
151LyftDelivery and mobilityOptimize trip pricePricing at Lyft2022
152GrammarlyTechCorrect grammatical errorsInnovating the Basics: Achieving Superior Precision and Recall in Grammatical Error Correction2022
153TwitterSocial platformsRecommend accounts to followModel-based candidate generation for account recommendations2022
154AirbnbTravel,E-commerce and retailImprove customer travel experienceIntelligent Automation Platform: Empowering Conversational AI and Beyond at Airbnb2022
155SwiggyDelivery and mobilityFlag incorrectly captured locationsUsing deep learning to detect dissonance between address text and location2022
156UberDelivery and mobilityVerify documentsUber’s Real-Time Document Check2022
157WayfairE-commerce and retailOptimize email sending time and frequencyNightingale: Scalable Daily Sales Email Sending Decision Model2022
158Didact AIFintech and bankingPredict stock pricesDidact AI: The anatomy of an ML-powered stock picking engine2022
159WayfairE-commerce and retailIdentify specific entities within a textWayfair’s New Approach to Aspect Based Sentiment Analysis Helps Customers Easily Find “Long Tail” Products2022
160OdaDelivery and mobilityPredict driver's non-driving timeHow we went from zero insight to predicting service time with a machine learning model — Part 2/22022
161WayfairE-commerce and retailPredict intent in customer support messagesBuilding Wayfair’s First Virtual Assistant: Automating Customer Service by Text Based Intent Prediction2022
162LinkedinSocial platformsEstimate the impact of product changesOcelot: Scaling observational causal inference at LinkedIn2022
163GrabDelivery and mobility,Banking and financeDetect fraud with graph modelsGraph for fraud detection2022
164LyftDelivery and mobilityMake causally valid forecastsCausal Forecasting at Lyft (Part 1)2022
165GlassdoorSocial platformsRecommend interesting posts to usersPersonalized Fishbowl Recommendations with Learned Embeddings: Part 22022
166NetflixMedia and streamingImprove video quality at scaleFor your eyes only: improving Netflix video quality with neural networks2022
167GlassdoorSocial platformsRecommend interesting posts to usersPersonalized Fishbowl Recommendations with Learned Embeddings: Part 12022
168DailymotionMedia and streamingRecommend diversified video contentOptimizing video feed recommendations with diversity: Machine Learning first steps2022
169Siemens HealthineersTechOptimize software testingUsing Machine Learning for Fast Test Feedback to Developers and Test Suite Optimization2022
170LyftDelivery and mobilityMake causally valid forecastsCausal Forecasting at Lyft (Part 2)2022
171LinkedinSocial platformsDeliver more relevant job recommendationsImproving job matching with machine-learned activity features2022
172CookidooE-commerce and retailPersonalize recipe recommendationsBuilding A Recipe Recommender System For the Thermomix on Cookidoo – Part 12022
173LinkedinSocial platformsImprove ML model performance with multitask learningApplying multitask learning to AI models at LinkedIn2022
174NetflixMedia and streamingApply causality in experiments and marketingA Survey of Causal Inference Applications at Netflix2022
175PinterestSocial platformsRecommend bids for advertizersAdvertiser Recommendation Systems at Pinterest2021
176GrubhubDelivery and mobilityForecast volume order“I See Tacos In Your Future”: Order Volume Forecasting at Grubhub2021
177SlackTechDetect spam invitesBlocking Slack Invite Spam With Machine Learning2021
178FaireE-commerce and retailSearch and navigate marketplace itemsBuilding Faire’s new marketplace ranking infrastructure2021
179DoordashE-commerce and retailPredict delivery supply and demandManaging Supply and Demand Balance Through Machine Learning2021
180OLXE-commerce and retailRecommend e-commerce itemsItem2Vec: Neural Item Embeddings to enhance recommendations2021
181DropboxTechSearch by image contentHow image search works at Dropbox2021
182ScribdMedia and streamingExtract metadata from documentsInformation Extraction at Scribd2021
183MicrosoftTechRank customer support casesML and customer support (Part 1): Using Machine Learning to enable world-class customer support2021
184Stitch FixE-commerce and retailRecommend e-commerce inventoryAlgorithm-Assisted Inventory Curation 2021
185TwitterSocial platformsForecast resource usage and costForecasting SQL query resource usage with machine learning2021
186GoogleTechSuggest past photos to look atA snapshot of AI-powered reminiscing in Google Photos2021
187UberDelivery and mobilityIdentify cash intermediariesApplying Machine Learning in Internal Audit with Sparsely Labeled Data2021
188MicrosoftTechCluster customer support issues by similarityML and customer support (Part 2): Leveraging topic modeling to identify the top investment areas in support cases2021
189GoustoDelivery and mobilityRecommend food items and recipesGousto R-series vol 1: Three tales of the Rouxcommender family2021
190AppleTechRecognize people in photosRecognizing People in Photos Through Private On-Device Machine Learning2021
191PinterestSocial platformsFind lookalike users for ad targetingThe machine learning behind delivering relevant ads2021
192PinterestSocial platformsDetect spam usersFighting Spam using Clustering and Automated Rule Creation2021
193PayPalFintech and bankingDetect payment fraudDeploying Large-scale Fraud Detection Machine Learning Models at PayPal2021
194DattoTechPredict hard drive failuresPredicting Hard Drive Failure with Machine Learning2021
195BumbleSocial platformsDetect rude messagesMultilingual message content moderation at scale (part 2)2021
196NextdoorSocial platformsSend relevant and timely updatesNextdoor Notifications: How we use ML to keep neighbors informed2021
197DropboxTechIdentify best time for renewal chargeOptimizing payments with machine learning2021
198SwiggyDelivery and mobilityRank restaurants in searchLearning To Rank Restaurants2021
199BrexFintech and bankingClassify bank transactionsHow We Built a (Mostly) Automated System to Solve Credit Card Merchant Classification2021
200GrammarlyTechCapture what readers pay attention toATTN: How Grammarly’s NLP/ML Team Figured Out Where Readers Focus in an Email2021
201DoordashDelivery and mobilityExtract information from imagesHow DoorDash Quickly Spins Up Multiple Image Recognition Use Cases2021
202AppleTechIdentify best user experienceInterpretable Adaptive Optimization2021
203AirbnbTravel,E-commerce and retailData privacy and securityAutomating Data Protection at Scale, Part 22021
204Capital OneFintech and bankingIdentify suspicious account activityHow Machine Learning Can Help Fight Money Laundering2021
205WayfairE-commerce and retailAssign color names to productsFrom RGB to Descriptive Color Names: Wayfair's in-house color algorithms to improve customer shopping experience.2021
206Capital OneFintech and bankingAutomate incident managementAutomated detection, diagnosis & remediation of app failure2021
207PinterestSocial platformsDetect policy-violating commentsHow Pinterest powers a healthy comment ecosystem with machine learning2021
208SpotifyMedia and streamingPersonalize homepage content (podcasts, playlist, music)The Rise (and Lessons Learned) of ML Models to Personalize Content on Home (Part I)2021
209Stitch FixE-commerce and retailRecommend looksStitching together spaces for query-based recommendations2021
210OcadoE-commerce and retailForecast e-commerce grocery demandFinding the sweet spot2021
211WalmartE-commerce and retailCategorize e-commerce productsDeep Learning: Product Categorization and Shelving2021
212WalmartE-commerce and retailRecommend learning contentMozrt, a Deep Learning Recommendation System Empowering Walmart Store Associates with a Personalized Learning Experience2021
213WalmartE-commerce and retailIdentify refrigeration defrostPredicting Defrost in Refrigeration Cases at Walmart using Fourier Transform2021
214New York TimesMedia and streamingRecommend content to readMachine Learning and Reader Input Help Us Recommend Articles2021
215Mercado LibreE-commerce and retailForecast demand for e-commerce itemsMarketplace Forecasting: Sales or Demand? Why not both? Let’s find out!2021
216SwiggyDelivery and mobilityRank food dishes in searchUsing Deep Learning for Ranking in Dish Search2021
217PayPalFintech and bankingRecommend financial productsCross-Selling Optimization Using Deep Learning2021
218WayfairE-commerce and retailAutomate ads placement and biddingEvolution of Ads Bidding at Wayfair2021
219Capital OneFintech and bankingImprove cardholder experienceImproving Virtual Card Numbers with Edge Machine Learning2021
220ShopifyE-commerce and retailCategorize e-commerce productsUsing Rich Image and Text Data to Categorize Products at Scale2021
221ScribdMedia and streamingRecommend content to readEmbedding-based Retrieval at Scribd2021
222SwiggyDelivery and mobilityDetect fraud in online food deliveryDeFraudNet: An End-to-End Weak Supervision Framework to Detect Fraud in Online Food Delivery2021
223AmazonE-commerce and retailPredict coordinates of delivery locationUsing learning-to-rank to precisely locate where to deliver packages2021
224PayPalFintech and bankingPredict declined transactionsUsing Machine Learning to Improve Payment Authorization Rate2021
225StripeFintech and bankingDetect fraud in online paymentsA primer on machine learning for fraud detection2021
226SlackTechPredict Slack connect invitesEmail Classification2021
227WayfairE-commerce and retailRecommend furniture itemsMARS: Transformer Networks for Sequential Recommendation2021
228GrammarlyTechDetect grammatical errorsGrammatical Error Correction: Tag, Not Rewrite2021
229NordstromE-commerce and retailGenerate outfit combinationsAI-Created Outfits2021
230DoordashDelivery and mobilityDeliver orders on timeUsing ML and Optimization to Solve DoorDash’s Dispatch Problem2021
231ZillowE-commerce and retailRecommend similar homesImproving Recommendation Quality by Tapping into Listing Text2021
232LifenTechRecognize PDF layoutFast graph-based layout detection2021
233PayPalFintech and bankingPrevent repeated payment fraudHow PayPal Uses Real-time Graph Database and Graph Analysis to Fight Fraud2021
234BumbleSocial platformsDetect rude messagesMultilingual message content moderation at scale (part 1)2021
235SpotifyMedia and streamingPersonalize homepage content (podcasts, playlist, music)The Rise (and Lessons Learned) of ML Models to Personalize Content on Home (Part II)2021
236SwiggyDelivery and mobilityEstimate travel distanceLearning to Predict Two-Wheeler Travel Distance2021
237ExpediaTravel,E-commerce and retailPersonalize travel search resultsPersonalized Ranking Model for Lodging2021
238ScribdMedia and streamingClassify documentsCategorizing user-uploaded documents2021
239MetaSocial platformsPersonalize the newsfeed contentHow machine learning powers Facebook’s News Feed ranking2021
240GoogleTechCorrect grammatical errorsGrammar Correction as You Type, on Pixel 62021
241NubankFintech and bankingPredict conversions and attract new customersBeyond prediction machines2021
242GrammarlyTechCorrect grammatical errorsAdversarial Grammatical Error Correction2021
243ScribdMedia and streamingClassify user-uploaded documentsIdentifying Document Types at Scribd2021
244OdaDelivery and mobilityPredict driver's non-driving timeHow we went from zero insight to predicting service time with a machine learning model — Part 12021
245Mercado LibreE-commerce and retailPredict customer engagement and LTVCausal Inference — Estimating Long-term Engagement2021
246DailymotionMedia and streamingTarget contextual advertisingHow Deep Learning can boost Contextual Advertising Capabilities2021
247WayfairE-commerce and retailOptimize digital adsBuilding Scalable and Performant Marketing ML Systems at Wayfair2021
248WayfairE-commerce and retailShow relevant content to new customersShare of Voice Optimization Engine2021
249WayfairE-commerce and retailOptimize paid media marketingContextual Bandit for Marketing Treatment Optimization2021
250MicrosoftTechClassify cloud workload typesHow we used ML — and heuristic data labeling — to help customers with their cloud migration2021
251GithubTechHelp users find contribution opportunitiesHow we built the good first issues feature2020
252LinkedinSocial platformsServe personalized learning recommendationsA closer look at the AI behind course recommendations on LinkedIn Learning, Part 12020
253BumbleSocial platformsDerive information from imagesImage detection as a service2020
254GojekDelivery and mobilityGenerate names for pickup pointsHow Gojek Uses NLP to Name Pickup Locations at Scale2020
255MozillaTechPredict the outcome of software testsTesting Firefox more efficiently with machine learning2020
256AdyenFintech and bankingPredict probability of transaction successOptimizing payment conversion rates with contextual multi-armed bandits2020
257WayfairE-commerce and retailDetect payment fraudExplainable Fraud Detection2020
258LyftDelivery and mobilityProvide location suggestionsHow Lyft predicts a rider’s destination for better in-app experience2020
259ZillowE-commerce and retailGenerate floor plans from photosZillow Floor Plan: Training Models to Detect Windows, Doors and Openings in Panoramas2020
260LinkedinSocial platformsServe personalized learning recommendationsA closer look at the AI behind course recommendations on LinkedIn Learning, Part 22020
261DoordashDelivery and mobilityOptimize marketing spendingOptimizing DoorDash’s Marketing Spend with Machine Learning2020
262EtsyE-commerce and retailPersonalize e-commerce searchBringing Personalized Search to Etsy2020
263AirbnbTravel,E-commerce and retailRank travel search resultsImproving Deep Learning for Ranking Stays at Airbnb2020
264WayfairE-commerce and retailImprove search experience for new customersBayesian Product Ranking at Wayfair 2020
265TwitterSocial platformsPredict value of ad requestsUsing machine learning to predict the value of ad requests2020
266ZyngaGamingPersonalize push notification timingDeep Reinforcement Learning in Production Part 2: Personalizing User Notifications2020
267ZillowE-commerce and retailRank homes to buyGuided Search — Personalized Search Refinements to Help Customers Find their Dream Home2020
268PicnicDelivery and mobilityPredict delivery drop timesOptimal drop times using machine learning2020
269ShopifyE-commerce and retailCategorize e-commerce productsCategorizing Products at Scale2020
270GojekDelivery and mobilityTarget cross-sell to existing usersHow We Built a Matchmaking Algorithm to Cross-Sell Products2020
271PayPalFintech and bankingDetect payment fraudMulti-Domain Fraud Detection While Reducing Good User Declines2020
272OLXE-commerce and retailDetect stolen photosFighting fraud with Triplet Loss2020
273StripeFintech and bankingDetect fraud in online paymentsSimilarity clustering to catch fraud rings2020
274DoordashDelivery and mobilitySearch for restaurants and dishesThings Not Strings: Understanding Search Intent with Better Recall2020
275SpotifyMedia and streamingRecommend shortcuts for homepageReach for the Top: How Spotify Built Shortcuts in Just Six Months2020
276WayfairE-commerce and retailRecommend complementary productsThe Visual Complements Model (ViCs): Complementary Product Recommendations From Visual Cues2020
277DailymotionMedia and streamingAutomatically categorize videosHow we used Cross-Lingual Transfer Learning to categorize our content2020
278DuolingoTechTeaching foreign languagesHow Duolingo uses AI in every part of its app2020
279FirefoxTechAutomatically assign new untriaged bugsTeaching machines to triage Firefox bugs2019
280DropboxTechPredict files users search forUsing machine learning to predict what file you need next2019
281ZoominfoTechPredict data accuracyUsing Machine Learning to Determine Contact Accuracy Scores2019
282AirbnbTravel,E-commerce and retailRecommend marketplace itemsMachine Learning-Powered Search Ranking of Airbnb Experiences2019
283LyftDelivery and mobilityPredict location of traffic control elementsDetecting Stop Signs and Traffic Signals: Deep Learning at Lyft Mapping2019
284GojekDelivery and mobilityPersonalize search resultsThe Secret Sauce Behind Search Personalisation2019
285InstacartDelivery and mobilitySpot lost demandModeling the unseen2019
286AppleTechIdentify text languageLanguage Identification from Very Short Strings2019
287Stitch FixE-commerce and retailExtract information from customer notesGive Me Jeans not Shoes: How BERT Helps Us Deliver What Clients Want2019
288LyftDelivery and mobilityDetect errors in mapsHow Lyft Creates Hyper-Accurate Maps from Open-Source Maps and Real-Time Data2019
289KingGamingAutomate playtesting pipelineHuman-Like Playtesting with Deep Learning2019
290GojekDelivery and mobilityAnalyse the relevance of search resultsIs This What You Were Looking For?2019
291LyftDelivery and mobilityBuild a marketing automation platformBuilding Lyft’s Marketing Automation Platform2019
292WayfairE-commerce and retailModel upliftModeling Uplift Directly: Uplift Decision Tree with KL Divergence and Euclidean Distance as Splitting Criteria2019
293GojekDelivery and mobilityAccurately forecast demandUnder the Hood of Gojek’s Automated Forecasting Tool2019
294LyftDelivery and mobilityPredict rides and driver hoursMaking cohort-based long-term forecasts at Lyft2019
295LyftDelivery and mobilityPredict fraudulent activityFingerprinting fraudulent behavior2018
296NetflixMedia and streamingImprove streaming qualityUsing Machine Learning to Improve Streaming Quality at Netflix2018
297LyftDelivery and mobilityIdentify user fraudFrom shallow to deep learning in fraud2018
298InstacartE-commerce and retailPredict grocery item availabilityPredicting the real-time availability of 200 million grocery items2018
299LyftDelivery and mobilityPersonalize marketing offersEmpowering personalized marketing with machine learning2018
300InstacartE-commerce and retailOptimize food delivery logisticsSpace, Time and Groceries2017
301AirbnbTravel,E-commerce and retailPredict Value of HomesUsing Machine Learning to Predict Value of Homes On Airbnb2017
302NetflixMedia and streamingImprove Streamning QualityUsing Machine Learning to Improve Streaming Quality at Netflix2018
303Booking.comTravel,E-commerce and retail150 Successful Machine Learning Models150 Successful Machine Learning Models: 6 Lessons Learned at Booking.com2019
304ChicisimoFashion and retailGrow User base using vertical ML approchHow we grew from 0 to 4 million women on our fashion app, with a vertical machine learning approach2019
305AirbnbTravel,E-commerce and retailML Powered search rankingMachine Learning-Powered Search Ranking of Airbnb Experiences2019
306LyftDelivery and mobilityShallow to deep learning in fraudFrom shallow to deep learning in fraud2018
307UberDelivery and mobility100+ Petabytes with Minute LatencyUber's Big Data Platform: 100+ Petabytes with Minute Latency2018
308DropboxTechModern OCR with CV and DLCreating a Modern OCR Pipeline Using Computer Vision and Deep Learning2017
309UberTechScaling ML with MichelangeloScaling Machine Learning at Uber with Michelangelo2019

For more information, visit Evidently AI - ML System Design and ML Systems Design

Star History

Star History Chart