Skip to content

Wasilp/challenge-collecting-data

Repository files navigation

challenge-collecting-data

Real estate scrapping script for zimmo.be

This project was realised for Thomas promotion as first group project

Technologies

Python 3, Selenium, beautifullSoup, Thread, Concurrent.

Instalation

pip install -r requirements.txt

How to use it

To use it you will have first to uncomment some lines in main to get the url of house and apartement to sell.
First house and after apartement.
I'm looking in every subType url to get more data other way your reasearch is limit to 100 pages.
After you get all url's comment it and you can uncomment what is below 'COLLECTING DATA FROM HERE'.
You can set max workers to the limit you want but for now, i would not recommend to go further 4-5 as the code need to be pimped for that.

Pending things to do

Need to integrate a script that will detect captcha and do it automatically. Need to process more than 1 worker without being detected

Copyright (C) 11/11/2020 Pierre Wasilewski,Nooreyni Ousmane DIOP,hajrashidimad

About

Real estate scrapping script

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages