PhD Defence • Machine Learning | Security and Privacy • Analyzing Risks of Large-Scale Machine Learning SystemsExport this event to calendar

Monday, January 29, 2024 — 10:00 AM to 1:00 PM EST

Please note: This PhD defence will take place in DC 3317 and online.

Nils Lukas, PhD candidate
David R. Cheriton School of Computer Science

Supervisor: Professor Florian Kerschbaum

Large-scale machine learning models such as ChatGPT rapidly transform how we interact with and trust digital media. However, the emergence of such a powerful technology faces a dual-use dilemma. While it can have many positive societal impacts in providing equitable access to information, deploying ML systems can also cause harm. For example, if it puts the privacy of individuals in its training data at risk because it has inadvertently memorized sensitive information. It could also be used to erode trust in digital media if it enables untrustworthy users to misuse the system to generate deepfakes, and misinformation or enhance the deceptiveness of online scams.

This thesis presents five projects to assess these risks to the privacy and security of ML systems and evaluates the reliability of known countermeasures. To do so, I assess the privacy risks of extracting Personally Identifiable Information from language models trained with (record-level) differential privacy. As a method of controlling misuse, I study the effectiveness and reliability of fingerprinting and watermarking methods to (i) detect the provenance of a model and (ii) detect any image generated by a watermarked ML system.


To attend this PhD defence in person, please go to DC 3317. You can also attend using Zoom at https://uwaterloo.zoom.us/j/99478041947.

Location 
DC - William G. Davis Computer Research Centre
Hybrid: DC 3317 | Online PhD defence
200 University Avenue West

Waterloo, ON N2L 3G1
Canada
Event tags 

S M T W T F S
28
29
30
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
1
  1. 2024 (132)
    1. June (1)
    2. May (13)
    3. April (41)
    4. March (27)
    5. February (25)
    6. January (25)
  2. 2023 (296)
    1. December (20)
    2. November (28)
    3. October (15)
    4. September (25)
    5. August (30)
    6. July (30)
    7. June (22)
    8. May (23)
    9. April (32)
    10. March (31)
    11. February (18)
    12. January (22)
  3. 2022 (245)
  4. 2021 (210)
  5. 2020 (217)
  6. 2019 (255)
  7. 2018 (217)
  8. 2017 (36)
  9. 2016 (21)
  10. 2015 (36)
  11. 2014 (33)
  12. 2013 (23)
  13. 2012 (4)
  14. 2011 (1)
  15. 2010 (1)
  16. 2009 (1)
  17. 2008 (1)