-
Notifications
You must be signed in to change notification settings - Fork 0
/
index.html
220 lines (220 loc) · 12.6 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
<!DOCTYPE html>
<html lang="en-US">
<head>
<meta charset="UTF-8" />
<title>Barbados Workshop 2024</title>
<meta name="viewport" content="width=device-width, initial-scale=1" />
<meta name="theme-color" content="#388cd5" />
<link rel="stylesheet" href="assets/style.css" />
<link rel="shortcut icon" type="image/x-icon" href="favicon.ico?v=9046d6025ad442bfe95151e736f675c26ff5b123" />
<script src="jquery-3.7.1.min.js"></script>
<script>
$(function () {
$("#includedContent").load("participant.html");
});
</script>
</head>
<body>
<header>
<h1>Bellairs Invitational Workshop on Contemporary, Foreseeable and Catastrophic Risks of Large Language Models</h1>
<h2>29 March 2024 -- 5 April 2024 </h2>
<a href="#program">Program</a>
<a href="#reading">Reading</a>
<a href="#participants">Participants</a>
<a href="#venue">Venue</a>
<a href="#travel">Travel</a>
</header>
<main>
<p>
Artificial intelligence in the form of Large Language Models (LLMs) has advanced more rapidly than most researchers expected. As a result, estimates of how long it will take to develop human-level AI have shortened,
accelerating speculation about the long-term dangers of AI. Far future predictions are valuable, but it is at least as important to engage with the concrete risks AI poses now. Especially since our capacity to solve
long-term dangers will be determined by immediate risks: effective alignment algorithms must at least work for the easy cases we encounter at present. And, to make good collective decisions for the future, we must ensure
that short-term proliferation of AI technologies does not irrevocably harm our economies and societies.
</p>
<p>
The objective of this workshop is to make progress on foreseeable and quantifiable risks of AI on the five year time scale. We will convene experts from diverse domains such as natural language processing, machine learning,
safety, privacy, and law within a singular forum to understand the concrete, demonstrable risks of LLMs. Together, will examine strategies for mitigating these risks and pinpoint areas that need further exploration and
development. Foremost, we seek to ground the discourse around LLM risks in a pragmatic and actionable framework.
</p>
<h2 id="program">Scientific Program</h2>
<p>The institute will be open from March 29 to April 5.</p>
<p>The scientific program will take place from March 31 to April 4.</p>
<p>Each day of the workshop will consist of:</p>
<ul>
<li>A morning session (9:30am-noon).</li>
<li>An evening session (7:30pm-9pm).</li>
<li>The rest of the day will be left open for discussions and collaborations.</li>
</ul>
<google-sheets-html-origin>
<table dir="ltr" style="table-layout: fixed; font-size: 10pt; width: 0px; border-collapse: collapse; border: none;" border="1" cellspacing="0" cellpadding="0" data-sheets-root="1">
<thead>
<tr style="height: 20px;">
<th style="height: 20px;">Day</th>
<th style="height: 20px;">Time</th>
<th style="height: 20px;">Topic</th>
<th style="height: 20px;">Leads</th>
</tr>
</thead>
<colgroup> <col width="77" /> <col width="132" /> <col width="200" /> <col width="200" /> </colgroup>
<tbody>
<tr style="height: 49px;">
<td style="height: 69px;" colspan="1" rowspan="2">
<div>1</div>
</td>
<td style="height: 49px;">9:30am−noon</td>
<td style="height: 49px;">
<p>Overview, Social biases in LLMs, Bias mitigation, Jailbreaking</p>
</td>
<td style="height: 49px;">
<p>Siva Reddy</p>
</td>
</tr>
<tr style="height: 20px;">
<td style="height: 20px;">7:30pm−9pm</td>
<td style="height: 20px;">Alignment</td>
<td style="height: 20px;">Maja Trębacz</td>
</tr>
<tr style="height: 39px;">
<td style="height: 78px;" colspan="1" rowspan="2">
<div>2</div>
</td>
<td style="height: 39px;">9:30am−noon</td>
<td style="height: 39px;">Uncertainty Communication</td>
<td style="height: 39px;">Sylvie Delacroix, Neil Lawrence<br /><br /></td>
</tr>
<tr style="height: 39px;">
<td style="height: 39px;">7:30pm−9pm</td>
<td style="height: 39px;">Explainability, Interpretability, Trust</td>
<td style="height: 39px;">Ana Marasović</td>
</tr>
<tr style="height: 39px;">
<td style="height: 59px;" colspan="1" rowspan="2">
<div>3</div>
</td>
<td style="height: 39px;">9:30am−noon</td>
<td style="height: 39px;">Watermarking, Responsible Deployment</td>
<td style="height: 39px;">Boaz Barak, Rich Zemel</td>
</tr>
<tr style="height: 20px;">
<td style="height: 20px;">7:30pm−9pm</td>
<td style="height: 20px;">Privacy, Security</td>
<td style="height: 20px;">Nicolas Papernot</td>
</tr>
<tr style="height: 39px;">
<td style="height: 78px;" colspan="1" rowspan="2">
<div>4</div>
</td>
<td style="height: 39px;">9:30am−noon</td>
<td style="height: 39px;">Normativity, Human Evolution and LLMs</td>
<td style="height: 39px;">Gillian Hadfield</td>
</tr>
<tr style="height: 39px;">
<td style="height: 39px;">7:30pm−9pm</td>
<td style="height: 39px;">Data Governance</td>
<td style="height: 39px;">Leandro Von Werra, Harm de Vries</td>
</tr>
<tr style="height: 20px;">
<td style="height: 40.8438px;" colspan="1" rowspan="2">
<div>5</div>
</td>
<td style="height: 20px;">9:30am−noon</td>
<td style="height: 20px;">Policy, Miscellaneous</td>
<td style="height: 20px;">Jess Montgomery</td>
</tr>
<tr style="height: 20.8438px;">
<td style="height: 20.8438px;">7:30pm−9pm</td>
<td style="height: 20.8438px;">Conclusion, Outcomes</td>
<td style="height: 20.8438px;">Siva Reddy</td>
</tr>
</tbody>
</table></google-sheets-html-origin>
<h2 id="reading">Background Reading</h2>
<p>Participants are encouraged to consult the following references in advance of the workshop:</p>
<ul>
<li><a href="https://arxiv.org/abs/2109.13916">Unsolved Problems in ML Safety</a></li>
<li><a href="https://arxiv.org/abs/2307.15043">Universal and Transferable Adversarial Attacks on Aligned Language Models</a></li>
<li><a href="https://aclanthology.org/2021.acl-long.416">StereoSet: Measuring stereotypical bias in pretrained language models</a></li>
<li><a href="https://arxiv.org/abs/2212.08073">Constitutional AI: Harmlessness from AI Feedback</a></li>
<li><a href="https://arxiv.org/abs/2204.05862">Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback</a></li>
<li><a href="https://arxiv.org/abs/2112.00861">A General Language Assistant as a Laboratory for Alignment</a></li>
<li><a href="https://arxiv.org/abs/2307.02483">Jailbroken: How Does LLM Safety Training Fail?</a></li>
<li><a href="https://aclanthology.org/2022.acl-long.132">An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models</a></li>
<li><a href="https://arxiv.org/abs/2310.03693">Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!</a></li>
<li><a href="https://arxiv.org/abs/1606.06565">Concrete Problems in AI Safety</a></li>
<li><a href="https://arxiv.org/abs/2203.02155">Training language models to follow instructions with human feedback</a></li>
<li><a href="https://aclanthology.org/2022.emnlp-main.225">Red Teaming Language Models with Language Models</a></li>
<li><a href="https://arxiv.org/abs/2401.05566">Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training</a></li>
</ul>
</ul>
<h2 id="participants">Participants</h2>
<div id="includedContent"></div>
<h2 id="venue">Venue</h2>
<p>The workshop will be held at the <a href="https://www.mcgill.ca/bellairs/">Bellairs Research Institute</a> of McGill University, Holetown, St. James, Barbados.</p>
<p>For accommodation pricing, see <a href="https://www.mcgill.ca/bellairs/facilities/accommodation">the official page</a>.</p>
<p>Contact</p>
<ul>
<li>Email: <a href="mailto:[email protected]">[email protected]</a></li>
</ul>
<h3>The Most Important House Rules</h3>
<p>Kitchen and Food</p>
<ul>
<li>Breakfast is eaten together Saturday-Friday at Bellairs.</li>
<li>Lunch may be purchased from a grocery store or nearby restaurants.</li>
<li>Dinner is eaten together Sunday-Thursday at Bellairs.</li>
<li>We can make coffee and tea in the kitchen any time we want.</li>
<li>Please leave the kitchen clean.</li>
<li>There is a guest fridge in the kitchen where we can keep our own private food. Please label your food and remove any left over when you depart.</li>
</ul>
<p>Showers and Sand</p>
<ul>
<li>
Sand in the shower drains can cause enormous blockage problems. Please be sure to rinse off the sand from your feet before entering your rooms. There are water taps outside both blocks of rooms for this purpose.
</li>
</ul>
<p>Locked Doors and Valuables</p>
<ul>
<li>Barbados is a rather safe country in general but normal precautions when travelling should be taken for your money and valuables.</li>
</ul>
<p>Telephone</p>
<ul>
<li>Telephones and computers are available in the main office (sort of).</li>
</ul>
<h3>Bellairs Survival Hints</h3>
<p>Food and Snacks</p>
<ul>
<li>We will have a cook and the food is great but if you need anything special please bring it along. There will be a fridge where we can keep our private food items.</li>
<li>The coffee there is of the instant variety. If you wish to bring your own coffee you may do so.</li>
<li>
Vegetarians may want to bring their favorite non-perishables, however it is not necessary since there is already a diverse selection at the local supermarket. There is also good vegetarian roti in several places near
Bellairs.
</li>
</ul>
<p>Beach, Sun, Snorkeling, and SCUBA diving</p>
<ul>
<li>Bellairs is situated on one of the best beaches in Barbados, so don't forget your bathing suit (and skin protection) for swims before breakfast and in between work sessions.</li>
<li>
There is also good snorkeling right in front of Bellairs so if you have a mask and fins bring them along too. In fact, if you SCUBA dive bring your gear. There is diving right there as well and air tanks at Bellairs cost
only about US$12.00 per tank!
</li>
</ul>
<p>Mosquitos</p>
<ul>
<li>Depending on the weather conditions and other factors, we may get some mosquitoes. You should bring some bug repellant just in case.</li>
</ul>
<h2 id="travel">Travel</h2>
<h3>Flying in</h3>
<p>Please see the <a href="https://www.visitbarbados.org/covid-19-travel-guidelines-2022">Barbados Official Travel Protocols</a> for the rules that are currently in place on the island.</p>
<p>
As of January 10, 2023, that site said "Effective midnight, Thursday September 22, 2022, Barbados will discontinue all COVID-19 related travel protocols. Therefore, there will be no testing requirements for entering Barbados
whether you are vaccinated or unvaccinated."
</p>
<p>Details for travel from the airport will be provided by email.</p>
<iframe src="https://maps.google.com/maps?q=mcgill+bellairs+holetown&t=&z=13&ie=UTF8&iwloc=&output=embed" frameborder="0" style="border: 0; width: 100%; height: 50vh;" allowfullscreen></iframe>
<h3>Map of Bellairs</h3>
<center>
<img src="assets/bellairs-map.png" style="max-height: 90vh;" />
</center>
<p style="margin: 3rem 0 1rem; text-align: center;">For questions please contact <a href="mailto:[email protected]">[email protected]</a></p>
</main>
</body>
</html>