Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add ARCH text files derivatives. #541

Merged
merged 7 commits into from
Jun 17, 2022
Merged

Add ARCH text files derivatives. #541

merged 7 commits into from
Jun 17, 2022

Conversation

ruebot
Copy link
Member

@ruebot ruebot commented Jun 13, 2022

What does this Pull Request do?

Add ARCH text files derivatives.

  • Add css, html, js, json, plain text, and xml information extraction
    methods
  • Add app extractors
  • Add Python implementation of extractors
  • Add tests
  • Resolves Add ARCH text files derivatives #540

How should this be tested?

- Add css, html, js, json, plain text, and xml information extraction
  methods
- Add app extractors
- Add Python implementation of extractors
- Add tests
- Resolves #540
@codecov
Copy link

codecov bot commented Jun 13, 2022

Codecov Report

Merging #541 (905161d) into main (2b8b717) will increase coverage by 0.82%.
The diff coverage is 98.10%.

@@             Coverage Diff              @@
##               main     #541      +/-   ##
============================================
+ Coverage     93.05%   93.87%   +0.82%     
- Complexity       42       48       +6     
============================================
  Files            39       44       +5     
  Lines           835      980     +145     
  Branches         52       52              
============================================
+ Hits            777      920     +143     
- Misses           35       36       +1     
- Partials         23       24       +1     

ruebot added a commit to archivesunleashed/notebooks that referenced this pull request Jun 14, 2022
ruebot added a commit to archivesunleashed/aut-docs that referenced this pull request Jun 14, 2022
@ruebot ruebot requested a review from ianmilligan1 June 14, 2022 17:11
@ruebot ruebot marked this pull request as ready for review June 14, 2022 17:11
ruebot added a commit to archivesunleashed/aut-docs that referenced this pull request Jun 14, 2022
ruebot added a commit to archivesunleashed/aut-docs that referenced this pull request Jun 17, 2022
#119)

* Documentation updates for archivesunleashed/aut#541
* Missed crawl date, update binary information, remove redundant image information (looks like it was never fully removed).
@ruebot ruebot merged commit 8172855 into main Jun 17, 2022
@ruebot ruebot deleted the issue-540 branch June 17, 2022 14:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add ARCH text files derivatives
2 participants