Unlocking sports medicine research data while maintaining participant privacy via synthetic datasets

Working papers
Authors

John Warmenhoven, Andrew Harrison, Daniel Quintana, Giles Hooker, Edward Gunning, Norma Bargary

Published

August 20, 2020

Publication details

Working paper

Links

DOI

 

Happ and Greven (2018) developed a methodology for principal components analysis of multivariate functional data for data observed on different dimensional domains. Their approach relies on an estimation of univariate functional principal components for each univariate functional feature. In this paper, we present extensive simulations to investigate choosing the number of principal components to retain. We show empirically that the conventional approach of using a percentage of variance explained threshold for each univariate functional feature may be unreliable when aiming to explain an overall percentage of variance in the multivariate functional data, and thus we advise practitioners to be careful when using it.