How To Build a Concurrent Web Scraper with Puppeteer, Node.js, Docker, and Kubernetes
In this tutorial, you will use Node.JS, Puppeteer, Docker, and Kubernetes to build a highly scalable scraper that can simultaneously extract data from multiple pages. You will test your scraper on the dummy bookstore, books.toscrape.com, where you will scrape data from 400 pages in ~30 seconds.