Thursday, March 12, 2020 — 4:00 PM EDT

 Bayesian Additive Regression Trees for Statistical Learning

Regression trees are flexible non-parametric models that are well suited to many modern statistical learning problems. Many such tree models have been proposed, from the simple single-tree model (e.g. Classification and Regression Trees — CART) to more complex tree ensembles (e.g. Random Forests). Their nonparametric formulation allows one to model datasets exhibiting complex non-linear relationships between predictors and the response.  A recent innovation in the statistical literature is the development of a Bayesian analogue to these classical regression tree models.  The benefit of the Bayesian approach is the ability to quantify uncertainties within a holistic Bayesian framework.  We introduce the most popular variant, the Bayesian Additive Regression Trees (BART) model, and describe recent innovations to this framework.  We conclude with some of the exciting research directions currently being explored.

Thursday, March 12, 2020 — 2:45 PM EDT

Please note: This seminar has been cancelled.

Friday, March 6, 2020 — 10:30 AM EST

Please note: This seminar has been cancelled.

Thursday, March 5, 2020 — 4:00 PM EST

Concentration of Maxima: Fundamental Limits of Exact Support Recovery in High Dimensions

We study the estimation of the support (set of non-zero components) of a sparse high-dimensional signal observed with additive and dependent noise. With the usual parameterization of the size of the support set and the signal magnitude, we characterize a phase-transition phenomenon akin to the Ingster’s signal detection boundary.  We show that when the signal is above the so-called strong classification boundary, thresholding estimators achieve asymptotically perfect support recovery. This is so under arbitrary error dependence assumptions, provided that the marginal error distribution has rapidly varying tails.  Conversely, under mild dependence conditions on the noise, we show that no thresholding estimators can achieve perfect support recovery if the signal is below the boundary.  For log-concave error densities, the thresholding estimators are shown to be optimal and hence the strong classification boundary is universal, in this setting.

The proofs exploit a concentration of maxima phenomenon, known as relative stability. We obtain a complete characterization of the relative stability phenomenon for dependent Gaussian noise via Slepian, Sudakov-Fernique bounds and some Ramsey theory.

S M T W T F S
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
1
2
3
4
  1. 2024 (1)
    1. August (1)
  2. 2023 (24)
    1. April (2)
    2. March (12)
    3. February (3)
    4. January (7)
  3. 2022 (83)
    1. December (6)
    2. November (11)
    3. October (6)
    4. September (4)
    5. July (3)
    6. June (3)
    7. May (4)
    8. April (8)
    9. March (12)
    10. February (7)
    11. January (19)
  4. 2021 (89)
  5. 2020 (71)
    1. December (2)
    2. November (13)
    3. October (16)
    4. September (7)
    5. August (5)
    6. July (3)
    7. June (2)
    8. May (1)
    9. March (4)
    10. February (4)
    11. January (14)
  6. 2019 (4)
  7. 2018 (2)
  8. 2017 (2)
  9. 2016 (3)
  10. 2015 (2)
  11. 2014 (2)
  12. 2013 (3)