-
Notifications
You must be signed in to change notification settings - Fork 529
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MSA format for "unpairedMsa" in fold_input.json #47
Comments
Here is my suggesttion:
Example: {
"protein": {
"id": "A",
"sequence": "MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF",
"unpairedMsa": ">seq1\\nMVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF\\n>seq2\\nMVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTFFPHF",
"pairedMsa": "",
"templates": []
}
} Considerations:
|
Thanks @Hanziwww . Does that input.json work for you? For me, it does not recognize the first sequence of the MSA (looks like it reads an empty sequence):
|
Hi @smg3d, You're absolutely right—I made a mistake in my previous response. The newline character in JSON strings should be represented as Here's the corrected JSON input: {
"name": "My AlphaFold Job",
"modelSeeds": [1],
"sequences": [
{
"protein": {
"id": "A",
"sequence": "MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF",
"unpairedMsa": ">seq1\nMVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF\n>seq2\nMVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTFFPHF",
"pairedMsa": "",
"templates": []
}
}
],
"dialect": "alphafold3",
"version": 1
} Here's how you can run AlphaFold using Docker with the corrected JSON: docker run -it \
--volume /home/mars/disk3/af3input:/root/af_input \
--volume /home/mars/disk3/af3output:/root/af_output \
--volume /home/mars/disk3/af3md:/root/models \
--volume /home/mars/disk3/af3db:/root/public_databases \
--gpus all alphafold3 \
python run_alphafold.py \
--json_path=/root/af_input/fold_input.json \
--model_dir=/root/models \
--output_dir=/root/af_output output cif: my_alphafold_job_model.zip Sorry for misleading. |
Thanks @Hanziwww . It works now. I think it might be a good idea to show such an example in the input doc:
|
I'm glad to hear that the input is working now. By the way, I'd like to introduce a user-friendly graphical interface that I developed to solve the JSON generation issue and running AlphaFold 3 predictions. Feel free to check out GUI repository. |
It works. Thanks for sharing. May I ask if it's ok to skip pairedMsa and templates in terms of model performance? I didn't see much differences on my end though. |
Certainly. According to the guidelines, you can skip pairedMsa and templates, but when using unpairedMsa, these parameters still need to be present (even if they're left empty). |
Thanks for providing the AF3 source. it is really appreciated.
I could not find the format to use in order to provide our own MSA in the input json file.
The input documentation mentions "If the unpairedMsa field is set to a custom A3M string, AlphaFold 3 will use the provided MSA instead of building one as part of the data pipeline. This is considered an expert option.". But what is the format of the "custom A3M string"
The doc provides the two following examples, but does not show the string or list format for
unpairedMsa
and
For
"unpairedMsa":
I tried filename and various list formats, but none are working.The text was updated successfully, but these errors were encountered: