Skip to content

This project to implement different way to scale up AI system like using kafka, improve performance using batch prediction, ....

Notifications You must be signed in to change notification settings

sonhm3029/Scalable-AI-Systems-for-Real-World-Applications

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scalable-AI-Systems-for-Real-World-Applications

Batch prediction

This help improve performance of system. Because normaly, each model can predict in batch.

The main idea is store incomming request and wait for at least one of these following condition is satisfied:

  • The number of request reach threshold (batch size)

  • The timeout exceed

To response to exact client, there are 2 ways:

  • Using socket

  • Client send 2 request (1 for send request for making prediction, 1 is a long poll request send immediately after the first one to get the prediction result)

About

This project to implement different way to scale up AI system like using kafka, improve performance using batch prediction, ....

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages