About

The UBC/SFU Joint Statistics Seminar is jointly hosted by the graduate students of the UBC Department of Statistics and the SFU Department of Statistics and Actuarial Science. The Spring 2022 event is the second of two events taking place in the 2021/2022 academic year. The Fall 2021 event was organized by graduate students from SFU, and the Spring 2022 event is organized by graduate students from UBC. Over its 17-year history, the event has offered Statistics and Actuarial Science graduate and undergraduate students at both schools an opportunity to network with their peers and to attend accessible talks about the research work of their fellow students and faculty.

The Spring 2022 event includes talks given by six students (three from UBC and three from SFU) and one faculty member from UBC.

Check out more events hosted by the UBC Statistics Graduate Student Association.

Registration

This term’s event will be hosted virtually through zoom on March 12, 2022 starting at 9:30am. Register now through the registration form.

Schedule


Welcome Message

9:30am - 9:35am


Xiaomeng Ju (UBC)

9:35am - 10:00am

Tree-based boosting with functional data

We present a robust regression estimator for models with a functional explanatory variable and a scalar response. The algorithm uses decision trees constructed with multiple projections as the "base-learners", which we call "functional multi-index trees”. Compared with widely studied single-index estimators, our multi-index estimator enables modelling possible interactions between indices, allowing us to better approximate complex regression functions. The effectiveness of our method is illustrated using a numerical experiment that compares it with several linear and nonlinear regression estimators, including recently proposed nonparametric and semiparametric functional additive estimators.


Elijah Cavan (SFU)

10:00am - 10:25am

Optimal Roster Construction In Sports

In Major League Sports (MLB, NBA, NHL, NFL), it is the Front Office Staff (FOS) who make decisions about who plays for their respective team. The FOS bear the brunt of the responsibility for acquiring players through drafting, trading and signing players in free agency while typically contesting with maximum roster salary constraints. The players themselves are volatile assets of these teams- their value fluctuates with age and performance. A simple comparison can be made when viewing players as assets. The problem here is similar to that of optimizing your investment portfolio. The goal is ultimately to maximize your periodic returns while tolerating a fixed risk (degree of uncertainty/ potential loss). Each franchise may value assets differently, and some may only tolerate lower risk levels- these are examples of factors that introduce additional constraints into the model. In this talk we will detail the mathematical formulation of this problem as a constrained optimization problem- which can be solved with classical machine learning methods, but is also well posed as a problem to be solved on quantum computers.


Nikola Surjanovic (UBC)

10:25am - 10:50am

Music Theory: Mathematical Results and Curiosities

Note: I have decided to present a fun, math-related talk instead of a formal “research” talk at this year’s event. I hope there will be something new for everyone to enjoy.

What does the fundamental theorem of arithmetic have to do with music? What does it mean to play something “in tune” and can this notion be formalized to a certain extent? As mathematicians and statisticians, we are in a fortunate position to be able to understand some of these concepts. I plan to present a brief overview of music theory and some curious mathematical results that will (hopefully) surprise most people. Some possible topics include: the fundamental theorem of arithmetic and Pythagorean tuning, the twelfth root of two and equal temperament, and differential equations and overtones.


Break

10:50am - 11:00am


Wendy Wang (SFU)

11:00am - 11:25am

Selection Between Variance-Optimal and Bias-Optimal Designs When Some Two-Factor Interactions Are Important

Fractional factorial designs are widely used in many fields of studies because they allow us to study the effects of many factors on the response. However, as the primary interest of most experiments is for screening important factors, interactions are generally assumed to be negligible. When some two-factor interactions are important, we consider two types of designs which are namely, variance-optimal designs and bias-optimal designs. In our study, we compare these two types of designs by using a mean squared error criterion that takes effect sparsity into consideration. We obtain a close-form expression of this mean squared error criterion for the two types of designs. Under different levels of sparsity, comparison results are obtained for designs of 10, 12, 14, 20, 26, 28 runs, which will help practitioners to choose between the two types of designs.


Quanhan (Johnny) Xi (UBC)

11:25am - 11:50am

Identification of Latent Variables in Generative Modeling

Generative models aim to fit data by passing simple latent factors through complex generator functions. Leveraging modern functional parametrizations such as deep neural networks, generative modeling has excelled on tasks such as generating synthetic images and reconstruction with relatively simple prior latent distributions. In image data, for example, the latent factors are of much lower dimension than the pixel space, and we also wish to perform inference on them as a form of dimension reduction in hopes of recovering higher-level factors of variation. Recently, there has been an effort to examine the latent variable identifiability of these models equipped with the flexibility of arbitrary, non-linear functions. In this talk, we give a characterization of the indeterminacies of a general class of generative models, from which many existing results may be readily deduced. Armed with insights from our results, we propose a strongly identifiable generative model with a fully flexible generator by leveraging data arising from multiple environments. We show a proof-of-concept implementation of a linear generator, in a modern take on factor analysis.


Mandy Yao (SFU)

11:50am - 12:15pm

Effect of numerical errors on the convergence and accuracy of Markov Chains

Markov Chain Monte Carlo (MCMC) algorithms are a class of methods that is widely used to draw samples from a probability distribution and draw inferences. In particular, it constructs a stochastic process whose limiting distribution is the unknown distribution of interest in a given problem. For example, it can be used for Bayesian Calibration, which is the problem of determining a distribution for parameters in a physical model from noisy observations on the output of the model using a Bayesian approach. In practice, there could be numerical errors in the computer simulations that affect the MCMC samples, and this could have a significant effect on convergence and accuracy. To study this, we use some existing theory on the convergence of perturbed stable Markov Chains, which works with a “roundoff error” that arises from the finite precision of computer computation. We adapt this theory to derive some conditions that guarantee the convergence of MCMC algorithms when a more general model for computational error is used. We also quantify the effect of error on the accuracy of the MCMC samples using a time series model.


Break

12:15pm - 12:25pm


Faculty Speaker: Keegan Korthauer

12:25pm - 1:25pm

Plugging the leaky pipeline in Statistics

The “leaky pipeline” is a metaphor often used to describe the lack of women making career advancements in STEM fields. The field of statistics is not immune to this phenomenon. Although women currently earn graduate degrees in statistics at similar rates to men, they are underrepresented in senior roles and leadership positions in academia. In this talk, I will describe the visible and invisible barriers that were overcome by pioneering women in statistics, outline some of the major factors that continue to perpetuate the leaky pipeline, and discuss its outlook for the statistics community. My goal is for you to come away with (1) awareness and recognition of the challenge of retaining women in STEM fields, including statistics, and (2) concrete strategies to help repair the leaky pipeline in our field.

Link to the slides can be found here.


Networking

1:25pm


Sponsors

Past Seminars

Fall 2021 Spring 2021 Fall 2020 Spring 2020