Machine Learning for Complete Intersection Calabi-Yau 3-folds

We consider a machine learning approach to predict the Hodge numbers of Complete Intersection Calabi-Yau (CICY) 3-folds in the framework of String Theory using two different sets of data: a first dataset containing the configuration matrices of 7890 CICY manifolds (this is the original dataset) and a dataset (the favourable dataset) containing their favourable embedding (at least for most of them).

The repository can be found on GitHub at this URL.

Methodology

We start from a full exploratory data analysis to study distribution of data, patterns, correlations and prepare the dataset for the analysis. In this section we do not perform statistical inference but we notice reproducible patterns in the distribution of the data.

We then perform a full machine learning prediction analysis using several algorithms: we point out pros and cons of each of them, and we discuss the theory behind and how it can help in improving the results. We use Bayes statistics for hyperparameter optimisation.

Among the algorithms presented we use several linear algorithms and support vector machines as well as decision trees algorithms. We then implement our version of neural networks which are able to improve the final accuracy by more than 25% on the best result obtained with the previous algorithms (we obtain 72% of accuracy using the Gaussian kernel trick in the SVM regressor and 99%+ with the neural network).

The choice of the architecture of the neural networks was driven by considerations typical of computer vision and object recognition. We use convolutional neural networks to "automatically" build features necessary for inference of the Hodge numbers: we took inspiration from both the classical LeNet originally developed by Y. LeCun in 1998 to which changed the kernel and used a larger version, and the more modern Inception Network developed by Google. In the last case we considered different convolutions (over rows and columns of the configuration matrix of the CICY manifolds) and concatenated the results found by two concurrent networks: the results reached 100% in test accuracy.

Description of the Files

The analysis is divided into different Jupyter notebooks (in this example list, hyperlinks return the version for the original dataset):

the preanalysis contains a detailed visual analysis of the dataset with outliers detection, clustering and PCA performance comparison, feature engineering and features selection,
the classical analysis deals with the more "classical" approach of machine learning using linear regression models, support vector machines and decision trees,
the ConvNet analysis uses convolutional neural networks to build the appropriate architecture to predict the Hodge numbers starting from the configuration matrix of the manifold,
the transfer learning analysis applies a more refined architecture to the feature engineered set using transfer learning from the previous convolutional models,
the stacking analysis is an attempt at stacking ensemble learning to improve the results of the previous analysis.

Each IPython notebook is entirely independent and can be run separately. The only real requirement is to first run (at least once) the preanalysis notebook to generate the "analysis-ready" dataset.

Notice that there are several versions of apparently the same notebooks: those named cicy3o... refer to the original dataset, while cicy3f... to the dataset with favourable embeddings. We also present data for the original dataset using only half of the training set and the prediction of the logarithm of the second Hodge number using neural networks.

Installation Prerequisites

In order to run the analysis you will need a Jupyter installation (we used an Anaconda environment) using Python 3.6 at least. Moreover you will be required to install the following packages:

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
mltools_repo @ 7e31b51		mltools_repo @ 7e31b51
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
cicy3f_ml.ipynb		cicy3f_ml.ipynb
cicy3f_nn.ipynb		cicy3f_nn.ipynb
cicy3f_nn_transfer-learning.ipynb		cicy3f_nn_transfer-learning.ipynb
cicy3f_preanalysis.ipynb		cicy3f_preanalysis.ipynb
cicy3f_stack.ipynb		cicy3f_stack.ipynb
cicy3o_ml.ipynb		cicy3o_ml.ipynb
cicy3o_ml_half_training.ipynb		cicy3o_ml_half_training.ipynb
cicy3o_nn.ipynb		cicy3o_nn.ipynb
cicy3o_nn_logh21.ipynb		cicy3o_nn_logh21.ipynb
cicy3o_nn_transfer-learning.ipynb		cicy3o_nn_transfer-learning.ipynb
cicy3o_preanalysis.ipynb		cicy3o_preanalysis.ipynb
cicy3o_stack.ipynb		cicy3o_stack.ipynb
environment.yml		environment.yml
mltools		mltools
plots_paper_favourable.ipynb		plots_paper_favourable.ipynb
plots_paper_original.ipynb		plots_paper_original.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning for Complete Intersection Calabi-Yau 3-folds

Methodology

Description of the Files

Installation Prerequisites

About

Languages

thesfinox/ml-cicy

Folders and files

Latest commit

History

Repository files navigation

Machine Learning for Complete Intersection Calabi-Yau 3-folds

Methodology

Description of the Files

Installation Prerequisites

About

Topics

Resources

Stars

Watchers

Forks

Languages