Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

No login parse #885

Merged
merged 15 commits into from
Nov 5, 2024
Merged

No login parse #885

merged 15 commits into from
Nov 5, 2024

Conversation

martig7
Copy link
Contributor

@martig7 martig7 commented Aug 23, 2024

Issue

closes #861. Scrapes SIS and catalog.rpi.edu using beautiful soup and selenium. The full scrape should take ~15 mins depending on how many browser instances you give it. It should be extremely accurate with prerequisites, corequisites, and descriptions now too. Also gets extra information from professor Goldschmidt's website.

Test Procedure

Run no_login.py. Change the parameters in name == "main" to change the term.

@martig7 martig7 self-assigned this Sep 16, 2024
@martig7 martig7 added Review Ready! Inform team that PR is ready for review python Pull requests that update Python code Priority 2 Important Issue Priority 1 Critical Issue and removed Priority 2 Important Issue labels Sep 16, 2024
@becausej
Copy link
Collaborator

becausej commented Nov 1, 2024

Reviewed.

Copy link
Contributor

@dorian451 dorian451 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

seems to work

@dorian451 dorian451 merged commit 5153051 into YACS-RCOS:master Nov 5, 2024
1 of 2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Priority 1 Critical Issue python Pull requests that update Python code Review Ready! Inform team that PR is ready for review
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Feature Request — Webscrape without Logging In
3 participants