Advances in QA: Re-implementing 'Dynamically Fused Graph Networks'

This is the group project from the "Advances in QA" class, winter semester 2019/2020, Uni Saarland. "We" are Yu-Wen Chen, Siyana Pavlova, and Simon Preissner. Our project report is contained in the repository as well.

The project aims to re-implement the system published by Xiao et al. (2019), "Dynamically Fused Graph Network for Multi-hop Reasoning", with as little assistance from their openly available code as possible.

DFGN in Short

Multi-hop question answering (QA) requires a system to derive the answer to a question from multiple text resources which, each on their own, don't contain the full answer.

In short, the workflow of dynamically fused graph networks (DFGN) is as follows: select several relevant paragraphs, construct an entity graph from them, and then look at parts of this graph to compute which entities contribute the most to answering the question at hand. Perform this step multiple times (each time looking at different parts of the entity graph) and take into account the entities' contribution from the previous iteration. This way, the graph network converges to the final answer.

The paper describes an architecture which is split into 5 modules:

paragraph selector returns only the most relevant paragraphs (= "the context")
graph constructor returns an entity graph from the context
encoder uses BERT and BiDAF to encode the context and the question
fusion block – the heart of DFGN – looks at parts of the entity graph for multiple iterations and exchanges information between the graph's nodes
predictior takes the fusion block's output and passes it through a stacked LSTM architecture to output the final answers

The intuition is that with multiple iterations, relevant entities propagate their importance to other, directly connected entities. The entity graph has no document boundaries, enabling free flow of information ( = reasoning) across paragraphs.

Naming conventions: the system is trained in two parts: the paragraph selector (module 1) and what we call DFGN (modules 3-5). Therefore, we call the final result full network (modules 1-5).

External and Internal Modules

utils.py (local) provides helper functions and classes.
torch (get it here)
tqdm for progress bars. (get it here)
flair for named entity recognition (NER) (get it here). Python 3.8 and above cause issues with flair, use a lower version. We have run this with Python 3.6
pycorenlp.StanfordCoreNLP for NER (not necessary) (get it here)
transformers by Huggingface, supplying BERT (get it here)
sklearn mainly for evaluation (get it here)
ujson for running the official HotPotQA evaluation script (get it here)
pandas (get it here)

You can install each of these modules individually or use the requirements.txt file:

pip install -r requirements.txt

Train the Paragraph Selector

Execute train_ps.py and pass a configuration file and a model name for execution. The model name will be used to create a directory with all outputs (model config, model parameters, losses, times, scores during training). Example:

python3 train_ps.py config/train_ps_final.cfg my_ps_model

Train the DFGN

Training a DFGN with train_dfgn.py means that the Encoder, FusionBlock, and Predictor modules are trained jointly, using a previusly trained ParagraphSelector model and the EntityGraph module to process a question before it is encoded. This script runs similarly to train_ps.py:

python3 train_dfgn.py config/train_dfgn.cfg my_dfgn_model

Have a look at the config file used in this example in order to get an idea of the required (and optional) parameters for training. If you run into issues with your GPU, try setting device-related parameters to "False" or decrease the batch size. For training DFGN, it might have to be below 4.

Test the Paragraph Selector

This is just as straightforward as training: upon execution, pass a configuration file and name of the model that you want to test to eval_ps.py and the script will compute precision, recall, F1 score, and accuracy and log them:

python3 eval_py.py config/eval_ps.cfg my_ParagraphSelector_model

The predictions made during evaluation are also logged in a directory named after the model.

Test the DFGN with `eval_dfgn.py`

Similarly to the evaluation script for the Paragraph Selector, pass a configuration file and the name of the directory containing the model.

python3 eval_dfgn.py config/eval_dfgn.cfg my_DFGN_model

Pre-trained Models

You can download pre-trained models for the ParagraphSelector and the subsequent DFGN from this Google Drive.

Configuration Files

The class ConfigReader in the utils module can parse files in raw text format to a number of data types. The syntax of configuration files (preferably indicated by the extension '.cfg') follows Python syntax in most parts. Here are important details:

one parameter per line, containing a name and a value
- name and value are separated by at least one white space or tab
- names should only contain alphanumeric symbols and '_' (no '-', please!)
list-like values are allowed (use Python list syntax)
- strings within value lists don't need to be quoted
- value lists either with or without quotation (no ["foo", 3, "bar"] )
- mixed lists will exclude non-quoted elements
multi-word strings are marked with single or double quotation marks
strings containing quotation marks are not tested yet. Be careful!
lines starting with # are ignored
no in-line comments!
config files should have the extension 'cfg' (to indicate their purpose)

Suppose there is a file called 'my_config.cfg' which contains a line batch_size 42. ConfigReader can be used to access the value 42 like this:

from utils import ConfigReader

file_path = 'my_config.cfg'
cfg = ConfigReader(file_path)
my_batch_size = cfg("batch_size")

ConfigReader objects hold the parsed parameters as a dictionary, which allows to access all (or sets of) parameters at once. Note that there is no control of whether all parameters that are required for the execution of a program are actually specified in the config file, and that ConfigReader returns None for parameters that it doesn't hold.

Files and Directories

modules/ — the main modules of the architecture
- ParagraphSelector.py - implements the Paragraph Selector from the paper (section 3.1)
- EntityGraph.py- implements the Graph Constructor from the paper (section 3.2) and builds the binary matrix used in section 3.4
- Encoder.py - implements the Encoder from the paper (section 3.3)
- FusionBlock.py - implements the Fusion Block from the paper (section 3.4)
- Predictor.py - implements the LSTM Prediction Layer from the paper (section 3.5)
config/ — configuration files; input to ConfigReader objects
models/ — results on performance tests of (ParagraphSelector, DFGN) models
playground/ — code snippets and little scripts; unimportant for running code

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Advances in QA: Re-implementing 'Dynamically Fused Graph Networks'

DFGN in Short

External and Internal Modules

Train the Paragraph Selector

Train the DFGN

Test the Paragraph Selector

Test the DFGN with `eval_dfgn.py`

Pre-trained Models

Configuration Files

Files and Directories

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 267 Commits
.idea		.idea
__pycache__		__pycache__
config		config
models		models
modules		modules
playground		playground
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
eval_dfgn.py		eval_dfgn.py
eval_ps.py		eval_ps.py
hotpot_evaluate_v1.py		hotpot_evaluate_v1.py
project_report.pdf		project_report.pdf
requirements.txt		requirements.txt
train_dfgn.py		train_dfgn.py
train_ps.py		train_ps.py
utils.py		utils.py

siyanapavlova/Advances_in_QA

Folders and files

Latest commit

History

Repository files navigation

Advances in QA: Re-implementing 'Dynamically Fused Graph Networks'

DFGN in Short

External and Internal Modules

Train the Paragraph Selector

Train the DFGN

Test the Paragraph Selector

Test the DFGN with eval_dfgn.py

Pre-trained Models

Configuration Files

Files and Directories

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Test the DFGN with `eval_dfgn.py`

Packages