System Design for Recommendations and Search - Eugene Yan

2022/01/24


  • Typically
    • Batch
      • Pre-computed
      • Decouple compute from serving
      • Lower operational load
      • Make prediction for everyone
    • Real-time
      • Responsive to time-sensitive context
      • Reduced cost on non-visiting users
      • Make prediction per request
      • Higher cost on operation
  • In this talk, we will focus on real-time aka on-demand

Overview

System Design for Recommendations and Search

Taobao

Taobao Architecture

Facebook

facebook embedding system overview

Taobao - ATBRG

taobao-atbrg-system-overview

Doordash

doordash-understanding-search-intent-with-better-recall


Retrieval

The Youtube Recommendations (DNN)

deep-neural-networks-for-youtube-recommendations

Instagram (word2vec)

ig2vec

Ranker

The Youtube Recommendations (via DNN)

youtube-recommendation-ranker

Alibaba (via Transformer)

bst-ranker

Offline-to-online transitions

offline-to-online-transitions


Reference:

#recommender-system #recommender-system/archtecture #search