{"product":{"product_id":"B08HKSG5X8","title":"Machine Learning Engineering","price":"34.95","image_url":"https://m.media-amazon.com/images/I/31OvBqRzV9L._SL500_.jpg","url":"https://www.amazon.com/dp/B08HKSG5X8"},"comments":[{"body":"I've read [Burkov's 100-page ML book](http://www.themlbook.com/) and it was great, so based on that I'd also recommend his latest book on [ML engineering](https://www.amazon.com/gp/product/B08HKSG5X8/ref=dbs_a_def_rwt_hsch_vapi_tkin_p1_i0), although I haven't read that one myself.\r\n\r\n[Hands on ML](https://github.com/ageron/handson-ml2) is quite famous, and has some chapters on scalability + TFRecords. \r\n\r\nI've personally deployed a simple pre-baked sklearn matrix factorization model using [FastAPI](https://fastapi.tiangolo.com/), and [Docker](https://www.docker.com/), with all the documentation open in another browser, it was quite manageable, but probably not robust enough for anything besides a hobbyist project.\r\n\r\nIf I were working on something that needed serious uptime and scalability, I'd probably start looking at working with [TF Serving API](https://www.tensorflow.org/tfx/guide/serving). Leveraging all the stuff inside TF Serving is likely a bit overkill, and you may be satisfied with a pipeline where you just get your research team to export the models ([API Doc Link](https://www.tensorflow.org/tutorials/keras/save_and_load)), and then just use [cortex](https://www.cortex.dev/).","subreddit_name":"MachineLearning","author":"dwrodri"}]}