TY - JOUR
T1 - Comparison of Selection Optimal Knot using Cross Validation and Generalized Cross Validation for Nonparametric Regression Truncated Spline Longitudinal Data
AU - Pramudita, Ditia Tahta
AU - Nyoman Budiantara, I.
AU - Ratnasari, Vita
N1 - Publisher Copyright:
© 2024 American Institute of Physics Inc.. All rights reserved.
PY - 2024/6/7
Y1 - 2024/6/7
N2 - Research on nonparametric regression is mostly done nowadays, it is more flexible and does not require assumptions like parametric regression. One of the well-known nonparametric regression modeling is the truncated spline. Truncated spline regression has advantages such as being able to model data patterns that vary in behavior patterns in different sub-intervals, in addition, truncated spline regression has statistical interpretation and is easily accessible to researchers. In truncated spline regression, there are nodes that are very flexible in estimating the behavior of the data. However, in practice knot points must be selected using various methods such as Cross Validation (CV) and Generalized Cross Validation (GCV). Several researchers have developed CV and GCV methods to select optimal knot points in nonparametric cross-section regression. That research using cross-section data turns out to still contain many weaknesses. The cross-section model can only be used for one subject (one region), it cannot be used to model each subject. In fact, real data requires different modeling for each region. The cross-section model also cannot describe the behavior of the data from time to time (series). Therefore, in this study, we will examine the comparison of CV and GCV methods in choosing the optimal knot in nonparametric regression for longitudinal data using unemployment rate in Central Java with one and two knots and with the criteria of the coefficient of determination value model. The result of this research is the formulation of the CV, GCV and UBR methods which are carried out on longitudinal data. In its application to the 2012-2021 unemployment rate data in the province of Central Java, the results show that the best model by looking at the minimum knot points and the coefficient of determination is found in the CV model with two knot points with a value of 96% and a minimum knot point of 2.301
AB - Research on nonparametric regression is mostly done nowadays, it is more flexible and does not require assumptions like parametric regression. One of the well-known nonparametric regression modeling is the truncated spline. Truncated spline regression has advantages such as being able to model data patterns that vary in behavior patterns in different sub-intervals, in addition, truncated spline regression has statistical interpretation and is easily accessible to researchers. In truncated spline regression, there are nodes that are very flexible in estimating the behavior of the data. However, in practice knot points must be selected using various methods such as Cross Validation (CV) and Generalized Cross Validation (GCV). Several researchers have developed CV and GCV methods to select optimal knot points in nonparametric cross-section regression. That research using cross-section data turns out to still contain many weaknesses. The cross-section model can only be used for one subject (one region), it cannot be used to model each subject. In fact, real data requires different modeling for each region. The cross-section model also cannot describe the behavior of the data from time to time (series). Therefore, in this study, we will examine the comparison of CV and GCV methods in choosing the optimal knot in nonparametric regression for longitudinal data using unemployment rate in Central Java with one and two knots and with the criteria of the coefficient of determination value model. The result of this research is the formulation of the CV, GCV and UBR methods which are carried out on longitudinal data. In its application to the 2012-2021 unemployment rate data in the province of Central Java, the results show that the best model by looking at the minimum knot points and the coefficient of determination is found in the CV model with two knot points with a value of 96% and a minimum knot point of 2.301
UR - http://www.scopus.com/inward/record.url?scp=85196087135&partnerID=8YFLogxK
U2 - 10.1063/5.0211356
DO - 10.1063/5.0211356
M3 - Conference article
AN - SCOPUS:85196087135
SN - 0094-243X
VL - 3132
JO - AIP Conference Proceedings
JF - AIP Conference Proceedings
IS - 1
M1 - 020011
T2 - 3rd International Conference on Natural Sciences, Mathematics, Applications, Research, and Technology, ICON-SMART 2022
Y2 - 3 June 2022 through 4 June 2022
ER -