Web scraping for social scientists

Introduction

There is an unprecedented amount of information on the internet that could usefully be harvested in order to build social science research datasets.

This half-day course will showcase suitable techniques for web scraping.

The value, logic and process of capturing data stored on websites will be described in detail, and practical examples and exercises will be demonstrated using the Python programming language.

It is most suited to empirical social science researchers but will be of value to researchers from a wide range of disciplines (e.g., digital humanities).

Course materials

This repository houses the materials underpinning a half-day SGSSS course on web scraping run by Dr Diarmuid McDonnell, University of the West of Scotland. The course was first run on 2024-06-05.

Programme

The course programme can be viewed here.

Materials

The training materials can be found in the following folders:

code - Jupyter Notebooks containing executable Python code for the web scraping lessons.
installation - Guidance on installing Python Jupyter Notebooks.
presentations - PDF versions of the course lectures.
reading - lists of interesting and relevant web scraping online articles.

Acknowledgements

I am grateful to the Scottish Graduate School of Social Sciences (SGSSS) for funding this course and its continued committment to high quality methods training for social scientists.

Further information

Please do not hesitate to get in contact if you have queries, criticisms or ideas regarding these materials: Dr Diarmuid McDonnell

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
code		code
installation		installation
presentations		presentations
reading		reading
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
sgsss-2024-web-scraping-course-outline-2024-06-05.docx		sgsss-2024-web-scraping-course-outline-2024-06-05.docx
sgsss-2024-web-scraping-course-outline-2024-06-05.pdf		sgsss-2024-web-scraping-course-outline-2024-06-05.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web scraping for social scientists

Introduction

Course materials

Programme

Materials

Acknowledgements

Further information

About

Releases

Packages

Languages

License

DiarmuidM/sgsss-web-scraping-for-social-scientists-2024

Folders and files

Latest commit

History

Repository files navigation

Web scraping for social scientists

Introduction

Course materials

Programme

Materials

Acknowledgements

Further information

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages