Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TX: votes: catch about 1200 vote events that were missing #5152

Merged
merged 2 commits into from
Dec 26, 2024

Conversation

jessemortenson
Copy link
Contributor

Problems that were causing vote events to be missed:

  • House journals sometimes have a "C" variant ("continuing") that was not getting scraped
  • HJR/SJR bill numbers were being missed
  • variants of committee substitute bills (which start with CS or CH) were being missed. Previously only CSSB and CSHB were being caught

Also did some mild refactor to make debugging easier. For example, you can now specify a url_match=88RSJ03-15-F.HTM to scrape one particular URL for testing. I think this whole scraper would benefit from a more procedural rewrite, as the abstractions used in here make it significantly more difficult to understand what's happening and make improvements.

@jessemortenson jessemortenson merged commit d28034d into main Dec 26, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant