Principal Components and Weights
The information presented here is sufficient to build your own predictor that gives identical predictions to those given on the PCLR web page.
The stepwise variable selection process selected the 6 out of 20 principal components given below, where t(vector) indicates a transpose of the vector. We retain the original component number below.
C1 = t(0.6235765994, 0.1903392166, -0.1097760634, 0.0017843960, -0.0137391414,
-0.0374235944, 0.0547921825, 0.1743636019, -0.0466588092, -0.0378288784,
-0.2239923733, -0.0404317208, 0.0539077313, -0.1870887219, 0.0499313840,
-0.6158596982, -0.0887434335, 0.0184131253, 0.0248026406, 0.2096315570,
-0.0004239778)
C2 = t(-0.56028258, -0.13657457, 0.13093194, 0.19503135, 0.09444821,
0.01942719, 0.29956552, 0.22695880, 0.07011352, 0.19174166,
0.03902809, 0.16894649, -0.02829899, 0.01237345, -0.17997055,
-0.58134825, -0.09493956, 0.01450135, 0.09291877, 0.02542817,
-0.01193456)
C3 = t(0.1010486134, 0.0460028169, -0.0339251170, 0.0968264713, 0.1311255843,
0.0134880321, -0.1327912652, 0.3495371320, 0.0124851112, -0.0872309443,
0.6770606493, -0.4731140177, -0.0870984644, -0.0770638924, -0.2485609348,
-0.0001066069, -0.2232048965, 0.0067561059, -0.0021940592, -0.0690403180,
0.0017229925)
C9 = t(-0.227160322, 0.376020594, -0.255651656, -0.271567366, 0.016440412,
-0.202007959, -0.201694378, 0.218374166, -0.030829450, -0.176967859,
0.302703973, 0.506021106, 0.028348005, -0.289240140, 0.164197747,
-0.031960294, 0.142275970, -0.057567127, -0.110783986, 0.101048564,
0.003661198)
C10 = t(-0.0640423292, 0.0790151627, 0.6668058394, -0.4479385313, 0.1113721713,
0.0550292876, -0.3066414907, 0.1483251665, 0.0701514864, 0.0186540502,
-0.1270414487, -0.0209636049, 0.0409272023, 0.0544716168, 0.1381974451,
-0.0294962668, -0.3850550093, 0.0055203273, -0.0988590882, 0.0915680135,
-0.0005185123)
In the vectors, the order of the features are:
Alanine, Arginine, Asparagine, Aspartic Acid, Cystein,
Glutamine, Glutamic acid, Glycine, Histidine, Isoleucine,
Leucine, Methionine, Phenylalanine, Proline, Serine,
Threonine, Tryptophan, Tyrosine, Valine, and the Variance Term.
The weights for the logistic regression are:
Intercept | C1 | C2 | C3 | C9 | C10 |
-4.314 | -8.389 | -35.990 | -15.167 | 27.199 | 29.262
|
Back to main page.