AUEB Stats Seminars 25/2/2022: Subset selection for big data regression: an improved approach
Thu 24 Feb 2022 - 16:06
AUEB STATISTICS SEMINAR SERIES FEBRUARY 2022
Vasilis Chasiotis
Department of Statistics, AUEB, GR
Subset selection for big data regression: an improved approach
FRIDAY 25/2/2022
13:00
Room T102, AUEB New Building, 2 Troias str.
Εναλλακτικά συνδεθείτε μέσω TEAMS εδώ.
ABSTRACT
In the big data era researchers face a series of problems. Such big data occur in several cases. Even standard approaches/methodologies like linear regression can be difficult or problematic with huge volumes of data. For example, traditional approaches for regression in big datasets may suffer due to the large sample size, since they involve inverting huge data matrices or even because the data cannot fit to the memory. Among others, a simple approach may be based on selecting subdata to run the regression. Some approaches for big data regression, already existing in the current literature, are based on selecting data points using information criteria, providing algorithms as well. Some of these approaches are based on the combinatorial properties of an orthogonal array. In the present paper we wish to improve the algorithms proposed in these approaches. We describe an approach, providing a new algorithm whose gain is shown through simulation experiments and analysis of real data. A discussion about the parameters of the proposed algorithm is also provided in order to clarify the trade-offs between execution time and information gain.
- AUEB Stats Seminars 27/6/2022: Statistical Foundation of Deep Learning: Application to Big Data by Taps Maiti (Michigan State University)
- AUEB Stats Seminars 6/5/2022: Detection of two-way outliers in multivariate data and application to cheating detection in educational tests by Irini Moustaki (LSE)
- AUEB Stats Seminars 18/3/2022: Hypothesis Testing for the Covariance Matrix in High-Dimensional Transposable Data with Kronecker Product Dependence Structure by Anestis Touloumis (University of Brighton)
- AUEB Stats Seminars 14/12/2022: A new non-parametric Cross-Spectrum Estimator by Evanggelos Ioannidis (Department of Statistics, AUEB)
- AUEB Stats Seminars 24/3/2021: Improved estimation of partially-specified models by I. Kosmidis (Univ. of Warwick)
Permissions in this forum:
You cannot reply to topics in this forum