Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some entries have huge number of interfaces and thus need a lot of storage #205

Closed
josemduarte opened this issue Jan 23, 2018 · 4 comments
Closed
Milestone

Comments

@josemduarte
Copy link
Contributor

E.g. 4aqq, 933 interfaces: 274 MB needed to store all files for it. The problem here is the way we expand operators treats many identical symmetric copies as different (NCS-like) interfaces.

It'd be good to find a solution that can reduce the number of interfaces we store and display for these cases.

@josemduarte josemduarte added this to the 3.0.5 milestone Jan 23, 2018
@josemduarte
Copy link
Contributor Author

josemduarte commented Jan 23, 2018

See also #141, another issue affecting entries with NCS operators

And also #210

@josemduarte
Copy link
Contributor Author

4aqq happens to have many clashes in some interfaces, perhaps that's also the issue here. The calculation would abort with a "too many clashes" error in 3.0.3 and before. 3.0.4-SNAPSHOT changes the aborting on too many clashes behaviour.

@josemduarte
Copy link
Contributor Author

Another additional problem: pymol gets stuck in producing the assembly files for some of these entries, probably because the script that we pass to it is too long.

An example where I saw this issue: 4cdq, it has 487 interfaces in total. It produces all of the interfaces files but then it freezes when running pymol to produce the assembly file for assembly 1. The pymol process is still alive but zombie.

@josemduarte
Copy link
Contributor Author

josemduarte commented Feb 1, 2018

And an additional issue: interface calculation takes a very long time for these entries. For instance 15 minutes for 4cdq. That's because we consider symmetric copies to be different, so there's a lot of redundant ASA calculation.

A better example: 5lwg takes 45 minutes (1652 interfaces).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant