Skip to content

GallardoEstelle/web_scraping_python

 
 

Repository files navigation

Serverless Web Scraping in Python for AI, Fun and Profit

(Using Step Functions and Lambdas)

This material is also covered in Chapter 7 of Pragmatic AI

Web Scraping for AI/ML consists of three phases:

A. Doing the Work
B. Scheduling the Work
C. Modeling the Work

A. Web Scraping Techniques (Doing the Work)

B. Orchestrating Retrieval (Scheduling the Work)

Step Function Workflow

C. Wiring Results into A Machine Learning Pipeline (Modeling the Work)

  • TBD

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.9%
  • Other 0.1%