Development of Fast Machine Learning Algorithms for False Discovery Rate Control in Large-Scale High-Dimensional Data (2024)
Abstract / truncated to 115 words
This dissertation develops false discovery rate (FDR) controlling machine learning algorithms for large-scale high-dimensional data. Ensuring the reproducibility of discoveries based on high-dimensional data is pivotal in numerous applications. The developed algorithms perform fast variable selection tasks in large-scale high-dimensional settings where the number of variables may be much larger than the number of samples. This includes large-scale data with up to millions of variables such as genome-wide association studies (GWAS). Theoretical finite sample FDR-control guarantees based on martingale theory have been established proving the trustworthiness of the developed methods. The practical open-source R software packages TRexSelector and tlars, which implement the proposed algorithms, have been published on the Comprehensive R Archive Network (CRAN). Extensive ...
machine learning – false discovery rate (fdr) control – high-dimensional data – sparsity – biomedical engineering – financial engineering
Information
- Author
- Machkour, Jasin
- Institution
- Technische Universität Darmstadt
- Supervisors
- Publication Year
- 2024
- Upload Date
- Nov. 27, 2024
The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.
The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.