Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Development of PDBManager Class (WIP) #272

Merged
merged 131 commits into from
Mar 31, 2023
Merged
Show file tree
Hide file tree
Changes from 15 commits
Commits
Show all changes
131 commits
Select commit Hold shift + click to select a range
5578157
add PDB manager #270
a-r-j Feb 24, 2023
8c177e1
add download method
a-r-j Feb 24, 2023
1d2bb0b
add clustering utilities
a-r-j Feb 24, 2023
6a10c9d
Add dataset splits functionality and add new documentation
amorehead Feb 25, 2023
3012ffe
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 25, 2023
d304a06
Resolve merge conflicts with remote
amorehead Feb 25, 2023
228946f
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 25, 2023
a2f1424
Remove unused test
amorehead Feb 25, 2023
a6d2137
Address lingering SonarCloud concerns
amorehead Feb 25, 2023
052801b
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 25, 2023
685a2e6
add deposition date parsing
a-r-j Feb 26, 2023
abeef32
remove pdb.py
a-r-j Feb 26, 2023
5412622
add chain extraction util
a-r-j Feb 26, 2023
e4ffe3d
add chain writing method
a-r-j Feb 26, 2023
ae8a246
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 26, 2023
f520f98
After fixing merge conflicts, add more filters and add time-based splits
amorehead Feb 26, 2023
22122b3
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 26, 2023
c4d8de4
Fix up SonarCloud concerns
amorehead Feb 26, 2023
67bc788
Improve verbiage surrounding PDB resolutions
amorehead Feb 27, 2023
3b42dab
Simplify code and improve variable names
amorehead Feb 27, 2023
9ed7171
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 27, 2023
1878f78
Track names of splits in df_splits
amorehead Feb 27, 2023
67524a8
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 27, 2023
d252d2e
Fix column naming during merging of DataFrame splits
amorehead Feb 27, 2023
81e4c23
add additional properties
a-r-j Feb 27, 2023
279ad3e
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 27, 2023
1baf309
refactor clustering to allow file caching and overwriting
a-r-j Feb 27, 2023
e27c11c
Merge branch 'pdb_manager' of https://github.com/amorehead/graphein i…
a-r-j Feb 27, 2023
408ab6a
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 27, 2023
f9e8d8d
add description to assert statements
a-r-j Feb 27, 2023
bf36dc2
Merge branch 'pdb_manager' of https://github.com/amorehead/graphein i…
a-r-j Feb 27, 2023
af2818b
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 27, 2023
b6c52d8
Add extra documentation around clustering function, and address small…
amorehead Feb 27, 2023
407a80c
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 27, 2023
f6e4e40
add method to write selection to CSV
a-r-j Feb 27, 2023
7821d9f
Merge branch 'pdb_manager' of https://github.com/amorehead/graphein i…
a-r-j Feb 27, 2023
59b95b2
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 27, 2023
3b7d4c9
improve from_fasta documentation
a-r-j Feb 27, 2023
7797e29
Merge branch 'pdb_manager' of https://github.com/amorehead/graphein i…
a-r-j Feb 27, 2023
7a66e91
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 27, 2023
7a6ff72
Enable code reuse for length filters
amorehead Feb 27, 2023
49b594d
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 27, 2023
624ed1b
Minor documentation changes to FASTA write-out function
amorehead Feb 27, 2023
a36c960
Add ability to perform most API calls for a subset of splits
amorehead Feb 27, 2023
88f5f14
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 27, 2023
45c5a6f
Update .gitignore
amorehead Feb 27, 2023
44663bb
Fix missing download call, and add more documentation to download fun…
amorehead Feb 27, 2023
ed6c3e4
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 27, 2023
fd56f47
Fix small bug when merging different splits together
amorehead Feb 27, 2023
8df967c
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 27, 2023
540662b
Fix bug in length filtering functions, fix print bugs in utils, and a…
amorehead Feb 28, 2023
0c4c39e
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 28, 2023
9eac3e9
Fix string formatting
amorehead Feb 28, 2023
8462fa5
Update PDB write-out logic and documentation
amorehead Feb 28, 2023
0fb19cc
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 28, 2023
2ba6c99
Add PDB download workaround for PDBs that can no longer be downloaded
amorehead Feb 28, 2023
587f900
Make exception more specific
amorehead Mar 1, 2023
055a192
Add TQDM for data split exporting
amorehead Mar 2, 2023
35d1064
Enable PDBManager root to be set to an arbitrary location
amorehead Mar 21, 2023
ff675bd
Merge branch 'a-r-j:pdb_manager' into pdb_manager
amorehead Mar 21, 2023
9ae9375
add initial tests
a-r-j Mar 21, 2023
054ff60
update changelog
a-r-j Mar 21, 2023
8bb6e63
add tutorial notebook
a-r-j Mar 21, 2023
3a316ba
Merge branch 'pdb_manager' of https://github.com/amorehead/graphein i…
a-r-j Mar 21, 2023
e1ac7b0
Allow all chains in a complex to be exported together
amorehead Mar 21, 2023
a4969d6
add module-level import
a-r-j Mar 22, 2023
efb3bc4
Remove old, unused PDBManager prototype file
amorehead Mar 22, 2023
7335baa
add parsing & checks for unavailable PDB structures
a-r-j Mar 25, 2023
f8ad185
fix download checker
a-r-j Mar 25, 2023
f8ae78e
actually fix download checker
a-r-j Mar 25, 2023
1436907
add availability filter
a-r-j Mar 25, 2023
c0cdaf0
Default to export model 1's chains only in PDBManager, and clean-up n…
amorehead Mar 27, 2023
b7226a0
Merge branch 'master' into pdb_manager
a-r-j Mar 27, 2023
5ff520e
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 27, 2023
e65935d
add tutorial nblink
a-r-j Mar 27, 2023
0c96f38
add tutorial to datasets sections
a-r-j Mar 27, 2023
4d1007c
mv pdb data to ml API
a-r-j Mar 27, 2023
64ec09f
rm pyg dataset import
a-r-j Mar 27, 2023
96f3ac5
rm unused code
a-r-j Mar 27, 2023
618bcee
fix annotation
a-r-j Mar 27, 2023
8d5ca7a
add MMTF download format
a-r-j Mar 29, 2023
d8e5c62
refactor dependency utils
a-r-j Mar 29, 2023
142c8d9
refactor graphein.utils.utils.import_message
a-r-j Mar 29, 2023
25f576c
refactor graphein.protein.utils.is_tool
a-r-j Mar 29, 2023
bd4c60c
update .gitignore
a-r-j Mar 29, 2023
746ad26
ignore cif too
a-r-j Mar 29, 2023
98f0075
ignore cif too
a-r-j Mar 29, 2023
0a09b00
ignore foldcomp files
a-r-j Mar 29, 2023
5ad764c
catch straggling erroneous imports
a-r-j Mar 29, 2023
fe84d40
ignore mol2
a-r-j Mar 29, 2023
be63c6d
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 29, 2023
dd7ce39
update folding utils
a-r-j Mar 29, 2023
9737fd6
add max batch option
a-r-j Mar 29, 2023
9b6c832
add foldcomp utils
a-r-j Mar 29, 2023
a899352
Merge branch 'pdb_manager' of https://github.com/amorehead/graphein i…
a-r-j Mar 29, 2023
2bf820d
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 29, 2023
ea0812d
add notebook updates [WIP]
a-r-j Mar 29, 2023
c82a344
Merge branch 'pdb_manager' of https://github.com/amorehead/graphein i…
a-r-j Mar 29, 2023
4a3bf5c
move manager class into graphein.ml
a-r-j Mar 29, 2023
cdcce18
remove datasets init
a-r-j Mar 29, 2023
88c6d91
fix import util refactor I didn't catch
a-r-j Mar 29, 2023
9e3315e
add PDBmanager to __init__
a-r-j Mar 30, 2023
9b11370
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 30, 2023
4f47c6b
fix oligomeric filtering
a-r-j Mar 30, 2023
81d4359
update notebook
a-r-j Mar 30, 2023
86d5118
fix dataset init
a-r-j Mar 30, 2023
5f1c525
fix protein.coord renaming in tensor module
a-r-j Mar 30, 2023
e572f40
add try/except to pyg-related datasets
a-r-j Mar 30, 2023
eacff28
add try/except to pyg-related datasets
a-r-j Mar 30, 2023
8417a0e
add mmseqs to CI build
a-r-j Mar 30, 2023
d1d713d
rollback dssp install to conda
a-r-j Mar 30, 2023
d727656
ignore pdb manager notebook in minimal tests
a-r-j Mar 30, 2023
4115bd5
fix code smell
a-r-j Mar 30, 2023
51ca8f6
fix metrics
a-r-j Mar 30, 2023
33926b6
shorten line lengths
a-r-j Mar 31, 2023
6f7f0c3
add minimum scipy version
a-r-j Mar 31, 2023
4f25b5c
remove python 3.7 from CI
a-r-j Mar 31, 2023
dc6112d
Add Torch 2.0.0 to CI
a-r-j Mar 31, 2023
78379f9
add note about multiple split strategies
a-r-j Mar 31, 2023
52efc31
add torch cluster install to CI
a-r-j Mar 31, 2023
cdbfef3
Merge branch 'pdb_manager' of https://github.com/amorehead/graphein i…
a-r-j Mar 31, 2023
9aae6e9
update dockerfile to torch 2.0
a-r-j Mar 31, 2023
418669c
switch docker pytorch 1.13 for VMD python version conflict
a-r-j Mar 31, 2023
6a9a234
switch out torchtyping for jaxtyping
a-r-j Mar 31, 2023
37ab1ec
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 31, 2023
0a8affb
update tensor shape syntax for jaxtyping
a-r-j Mar 31, 2023
ef5b811
update tensor shape syntax for jaxtyping
a-r-j Mar 31, 2023
ae1a524
remove torch-dependent tests from minimal install testing
a-r-j Mar 31, 2023
160409b
update test ignores
a-r-j Mar 31, 2023
b19df0d
install dssp from apt, rather than conda in docker
a-r-j Mar 31, 2023
cd4db9e
update typing extensions version
a-r-j Mar 31, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
14 changes: 14 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -148,3 +148,17 @@ datasets/examples/*
*.zip
datasets/regnetwork/human
notebooks/lightning_logs
cc-to-pdb.tdd
entries.idx
pdb_cluster_all_seqs.fasta
pdb_cluster_cluster.tsv
pdb_cluster_rep_seq.fasta
pdb_entry_type.txt
pdb_seqres.txt
pdb_seqres.txt.gz
pdb.fasta
resolu.idx
source.idx

# Local test directories
tmp/
Empty file added graphein/datasets/__init__.py
Empty file.
Loading