-
Notifications
You must be signed in to change notification settings - Fork 0
/
publications.html
181 lines (175 loc) · 16.5 KB
/
publications.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
<!DOCTYPE html>
<html lang="en">
<head>
<title>Annika Stein</title>
<meta charset="UTF-8">
<meta name="viewport" content="width=device-width, initial-scale=1">
<!-- Stylesheets, standards and custom style for this page -->
<link rel="stylesheet" href="https://annikastein.github.io/SkewbPage/SkewbSkills/normalize.css">
<link rel="stylesheet" href="https://www.w3schools.com/w3css/4/w3.css">
<link rel='stylesheet' href='https://fonts.googleapis.com/css?family=Roboto'>
<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/all.min.css">
<link rel="stylesheet" href="https://cdn.jsdelivr.net/gh/jpswalsh/academicons@1/css/academicons.min.css">
<link rel="stylesheet" href="css/main.css">
</head>
<body>
<div class="header">
<h1>Annika Stein</h1>
<p>Physicist @ ETAP, Johannes Gutenberg-Universität Mainz</p>
</div>
<nav class="navbar" id="myTopnav">
<a href="index.html" id="homebutton">Home</a>
<a href="javascript:void(0);" class="icon" onclick="respNav()">☰</a>
<a href="education.html">Education and Qualifications</a>
<a href="work.html">Experience (Teaching / Work)</a>
<a href="#" class="active">Publications</a>
<a href="talks_theses.html">Talks, Theses and more</a>
<a href="activities.html">Other Activities</a>
<a href="contact.html" class="right">Contact</a>
</nav>
<div class="row">
<div class="main">
<div class="w3-container">
<h2 class="w3-text-black w3-padding-16">Publications</h2>
<div class="card">
<h3>Novel jet flavour tagging algorithms exploiting adversarial deep learning techniques with efficient computing methods and preparation of open data for robustness studies</h3>
<h4 id="phd_thesis"><b><u>Annika Stein</u></b></h4>
<div class="note">
<p><em>PhD Thesis</em><br>
<a href="https://doi.org/10.18154/RWTH-2024-07840" target="_blank">https://doi.org/10.18154/RWTH-2024-07840</a><br>
2024
</p>
</div>
</div>
<div class="card">
<h3>Jet Flavour Tagging OpenData Sample</h3>
<h4 id="open_data"><b><u>Annika Stein, Taylor C. Briggs</u></b></h4>
<div class="note">
<p><em>kaggle dataset</em><br>
<a href="https://www.kaggle.com/datasets/annikastein/jet-tagging-opendata-mockup" target="_blank">https://www.kaggle.com/datasets/annikastein/jet-tagging-opendata-mockup</a><br>
2023
</p>
</div>
<button type="button" class="collapsible">Abstract</button>
<div class="content">
<p>A collection of low-level information useful for classifying a particle jet's flavour, i.e. related to the jet's constituents like charged or neutral particle flow candidates, or secondary vertex information. This is the first openly available dataset of that kind that has been derived with experiment-specific software run on top of CERN Open Data and then stored in experiment-independent format, making ROOT or similar frameworks obsolete. Get ready to jump into this fruitful particle physics task without worrying about containers, special tools and physics terminology.</p>
<p><em>Contribution: Prepared the dataset with a framework to extract and calculate low-level information based on jet constituent properties. A set of plugins and scripts have been written to facilitate the production with high-performance computing infrastructures, for experts with CMSSW experience. To facilitate exchange with other (academic) communities, I derived a novel file structure that is fully independent of experiment-specific software and which does not require any experience with ROOT. I then executed production and conversion of the samples. The file structure has been exchanged with the co-author, whose feedback was included for the public version of the dataset. I modified an exemplary notebook that shows how to use the files directly on the kaggle platform, increasing the datasets's usability score.
</em></p>
</div>
</div>
<div class="card">
<h3>A modern framework for jet tagger development</h3>
<h4>From training to first data-to-simulation agreement results in one go<h4>
<h4 id="software_framework_jet_tagging_data"><b><u>Annika Stein</u></b></h4>
<div class="note">
<p><em>arXiv preprint</em><br>
<a href="https://arxiv.org/abs/2306.17846" target="_blank">arXiv:2306.17846</a><br>
July 2023
</p>
</div>
<button type="button" class="collapsible">Abstract</button>
<div class="content">
<p>This paper presents a new tool to perform various steps in jet tagger development in an efficient and comprehensive way. A common data structure is used for training, as well as for performance evaluation in data. The introduction of this new framework reduces the amount of data to be stored while accomplishing the same tasks, and shortens waiting times between algorithm development and data-to-simulation results becoming available from months to days, taking typical CMS experiment pipelines as a reference. Proper utilization of high-throughput systems enables first data-to-simulation studies with a recent neural network architecture, Particle Transformer, adapted to jet flavour tagging. Unlike official implementations of the collaboration, the new framework allows investigating different variants, like different training paradigms, and their impact on data/simulation agreement, without producing any new large files on disk, and within the same run of the analysis framework. Besides being more time- and storage-efficient and thus enabling the first results of that kind to be available just few hours after finishing neural network training, the framework is currently the only realization capable of studying how adversarial techniques affect data/simulation agreement for tagger algorithm outputs as well as inputs.</p>
<p><em>Contribution: Came up with the entire restructured framework design, developed it to be production-ready and showed first application to a new algorithm.
</em></p>
</div>
</div>
<div class="card">
<h3>A first look at early 2022 proton-proton collisions at √s = 13.6
TeV for heavy-flavor jet tagging</h3>
<h4 id="comm_earlyrun3_btv_cms">CMS Collaboration</h4>
<div class="note">
<p><em>Detector Performance Summary, CMS, CERN</em><br>
<a href="https://cds.cern.ch/record/2854698?ln=en" target="_blank">CMS DP-2023/012</a><br>
March 2023
</p>
</div>
<button type="button" class="collapsible">Abstract</button>
<div class="content">
<p> Identification of jets stemming from heavy-flavor (bottom or charm) hadrons relies primarily on the inputs due to reconstruction of charged particle tracks and secondary vertices contained in these jets. Thus, it is crucial to study the data vs. simulation distributions of the kinematic variables that are used as inputs for the heavy-flavor tagging algorithms. With the start of LHC Run 3, new track reconstruction methods and changes of the general data-taking conditions have been introduced in the CMS experiment. In this note, a comparison of early Run 3 data vs. simulation distributions for several input variables, tagging discriminants, and other relevant kinematic observables in four different phase space regions that are enriched in b, c, and light (udsg) jets, is presented. The proton-proton collision data recorded by the CMS detector during the early part of the 2022 run, corresponding to an integrated luminosity of 7.65 fb−1, has been used for this study.</p>
<p><em>Contribution: Wrote code to produce samples, modified code to switch from Run 2 to Run 3 setup, involving new jet collection (PUPPI), and extended input collections (DeepJet).
Advised the entire commissioning team on how to efficiently split and produce the samples with CMS tool "CRAB", supported in case of questions or problems with the CMS software + grid resources, and produced samples containing 2022 data myself.
Contributed configuration to run the framework at different computing clusters with various priority queues, leading to a net increase of throughput for individual analysers getting the required amount of cores + core hours.
Investigated newly added features in a pre-commissioning study to determine their best representation with histograms and checked their validity in an independent cross-check.
Parts of my master thesis code are re-used to simplify access and obtain reproducible visualization of tagger input features which requires understanding of their definitions and physical meaning.
I kept up with developments of other POGs to include latest recommendations in object selections and ensured the code is in line with official experiment- and era-specific guidelines.
I identified a vast amount of (edge) cases and difficult-to-catch technicalities during extensive code review to ensure usability and consistent results across phase spaces and calibrations.
</em></p>
</div>
</div>
<div class="card">
<h3>Improving robustness of jet tagging algorithms with adversarial training: exploring the loss surface</h3>
<h4 id="adversarial_training_acat"><b><u>Annika Stein</u></b></h4>
<div class="note">
<p><em>Proceedings of 21th International Workshop on Advanced Computing and Analysis Techniques in Physics Research : AI meets Reality (ACAT 2022), to be published in the Journal Of Physics: Conference Series</em><br>
<a href="https://arxiv.org/abs/2303.14511" target="_blank">arXiv:2303.14511</a> [hep-ex] | <a href="https://indico.cern.ch/event/1106990/contributions/5096950/" target="_blank">Contribution</a><br>
2023
</p>
</div>
<button type="button" class="collapsible">Abstract (<em>Under review.</em>)</button>
<div class="content">
<p>In the field of high-energy physics, deep learning algorithms continue to gain in
relevance and provide performance improvements over traditional methods, for example when
identifying rare signals or finding complex patterns. From an analyst’s perspective, obtaining
highest possible performance is desirable, but recently, some attention has been shifted towards
studying robustness of models to investigate how well these perform under slight distortions
of input features. Especially for tasks that involve many (low-level) inputs, the application of
deep neural networks brings new challenges. In the context of jet flavor tagging, adversarial
attacks are used to probe a typical classifier‘s vulnerability and can be understood as a model
for systematic uncertainties. A corresponding defense strategy, adversarial training, improves
robustness, while maintaining high performance. Investigating the loss surface corresponding
to the inputs and models in question reveals geometric interpretations of robustness, taking
correlations into account.</p>
<p><em>Contribution: Conducted the entire study, wrote the code, interpreted the results and derived new ideas on top of that, designed the poster (see the original <a href="https://indico.desy.de/event/31214/contributions/120862/" target="_blank">here</a>) and then wrote the proceedings. Follow-up of <a href="#robustness_jet_tagging">this publication</a>.</em></p>
</div>
</div>
<div class="card">
<h3>Adversarial training for b-tagging algorithms in CMS</h3>
<h4 id="adversarial_training_btv_cms">CMS Collaboration (<b><u>Annika Stein</u></b>)</h4>
<div class="note">
<p><em>Detector Performance Summary, CMS, CERN</em><br>
<a href="https://cds.cern.ch/record/2839919?ln=en" target="_blank">CMS DP-2022/049</a><br>
October 2022
</p>
</div>
<button type="button" class="collapsible">Abstract</button>
<div class="content">
<p>Modern neural networks bring considerable performance improvements in various areas of high-energy physics, such as object identification. Flavour-tagging is one example that profits from complex architectures, leveraging information from large numbers of low-level inputs. While such tagging algorithms are evaluated on data and simulation for analysis purposes, training is usually performed with simulated samples only. Differences in performance between these two domains are observed which need to be calibrated against. With a new strategy, called adversarial training, we reduce the observed differences prior to any calibration, and improve robustness of the classifier against injected mis-modelings that mimic systematic uncertainties. In this note, studies on adversarial robustness and agreement between data and simulation are carried out with the DeepJet algorithm, evaluated on Run2 samples. The addition of an adversarial module is envisaged to be included in newly developed tagging algorithms for Run3.</p>
<p><em>Contribution: All steps starting from custom dataset production to the presentation of final results were performed by me, which I then summarized in the note.</em></p>
</div>
</div>
<div class="card">
<h3>Improving Robustness of Jet Tagging Algorithms with Adversarial Training</h3>
<h4 id="robustness_jet_tagging"><b><u>Annika Stein</u></b>, Xavier Coubez, Spandan Mondal, Andrzej Novak, Alexander Schmidt</h4>
<div class="note">
<p><em>Computing and Software for Big Science, 6 (2022) 15</em><br>
<a href="https://arxiv.org/abs/2203.13890" target="_blank">arXiv:2203.13890</a> [physics.data-an] | <a href="https://doi.org/10.1007/s41781-022-00087-1" target="_blank">https://doi.org/10.1007/s41781-022-00087-1</a><br>
<a href="https://github.com/AnnikaStein/Adversarial-Training-for-Jet-Tagging" target="_blank">Code</a> | March 2022 (preprint v1) | published 10 Sep 2022 (Comput Softw Big Sci).
</p>
</div>
<button type="button" class="collapsible">Abstract</button>
<div class="content">
<p>Deep learning is a standard tool in the field of high-energy physics, facilitating considerable sensitivity enhancements for numerous analysis strategies. In particular, in identification of physics objects, such as jet flavor tagging, complex neural network architectures play a major role. However, these methods are reliant on accurate simulations. Mismodeling can lead to non-negligible differences in performance in data that need to be measured and calibrated against. We investigate the classifier response to input data with injected mismodelings and probe the vulnerability of flavor tagging algorithms via application of adversarial attacks. Subsequently, we present an adversarial training strategy that mitigates the impact of such simulated attacks and improves the classifier robustness. We examine the relationship between performance and vulnerability and show that this method constitutes a promising approach to reduce the vulnerability to poor modeling.</p>
<p><em>Contribution: Conducted the entire study form start to finish myself, main author. First I performed a similar study for my master thesis, then I rewrote the framework for a new data structure, redid the analysis introducing more validations, interpreted the results and acted as corresponding author. Presented first <a href="https://indico.desy.de/event/31214/contributions/120862/" target="_blank">here</a>.</em></p>
</div>
</div>
</div>
</div>
<div class="side">
<div class="w3-container">
<div class="card">
<h3>Links</h3>
<a href="https://orcid.org/0000-0003-0713-811X" target="_blank"><i class="fab fa-orcid fa-fw w3-margin-right w3-xxlarge jgu-red"></i></a>
<a href="https://arxiv.org/a/stein_a_4.html" target="_blank"><i class="ai ai-arxiv fa-fw w3-margin-right w3-xxlarge jgu-red"></i></a>
<a href="https://inspirehep.net/authors/2060569" target="_blank"><i class="ai ai-inspire fa-fw w3-margin-right w3-xxlarge jgu-red"></i></a>
<a href="https://www.researchgate.net/profile/Annika-Stein-4" target="_blank"><i class="fab fa-researchgate fa-fw w3-margin-right w3-xxlarge jgu-red"></i></a>
</div>
</div>
</div>
</div>
<div class="footer">
<p>Last updated: 27.09.2024</p>
</div>
<script src="js/script.js"></script>
</body>
</html>