Skip to content
This repository has been archived by the owner on Mar 16, 2024. It is now read-only.
/ amazon-scraper Public archive

🏙️ (Live Link) (2022) Navigates to amazon, searches for samsung phones and pulls the title and price data. I highly recommend working with Linux (including virtual machines) or MacOs.

License

Notifications You must be signed in to change notification settings

kkamara/amazon-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

57 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This project is now deprecated. Please use PHP Scraper.

selenium-py.png

Amazon Scraper API

(2022) Navigates to amazon, searches for samsung phones and pulls the title and price data. I highly recommend working with Linux (including virtual machines) or MacOs.

Important note:

Before you try to scrape any website, go through its robots.txt file. You can access it via domainname/robots.txt. There, you will see a list of pages allowed and disallowed for scraping. You should not violate any terms of service of any website you scrape.

Proven in a production environment

Getting up and running on amazon ec2.

Requirements

Installation

cp .env.example .env
touch database/database.sqlite
composer i
make dev
# optional
# make backend-migrate
# (optional)
# npm install
# npm run dev

Using Docker?

docker build -t laravel-docker-aws .
docker run -it -p 8001:80 laravel-docker-aws

Usage

Update the command at ./app/Console/Commands/BrowseAmazon.php

php artisan browse:amazon

BrowserInvoker.php

Browser.php

Adding a new command

php artisan make:crawler crawler_test

Browser Testing

  alias sail='vendor/bin/sail'
  sail dusk

Mail Server

docker-mailhog3.png

Mail environment credentials are at .env.

The mailhog docker image runs at http://localhost:8025.

Misc

See php scraper.

See php reactjs boilerplate.

See python amazon scraper 2.

Using Laravel dusk outside of tests.

Running ChromeDriver and Selenium in Python on an AWS EC2 Instance.

The Makefile for this project contains useful commands for a Laravel application and can be found at laravel-makefile.

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

License

BSD

About

🏙️ (Live Link) (2022) Navigates to amazon, searches for samsung phones and pulls the title and price data. I highly recommend working with Linux (including virtual machines) or MacOs.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published