Appendix A.1. Tables
Table A1.
Software details of the four computers and hive-specific and model transfer NLR grid search programs; column 1 contains the computer IDs from the CSV files with complete experimental results in the
Supplemental Materials; Foral Fossa is Ubuntu 20.04 LTS; Bionic Beaver is Ubuntu 18.04 LTS; GCC is Gnu C/C++ compiler.
Table A1.
Software details of the four computers and hive-specific and model transfer NLR grid search programs; column 1 contains the computer IDs from the CSV files with complete experimental results in the
Supplemental Materials; Foral Fossa is Ubuntu 20.04 LTS; Bionic Beaver is Ubuntu 18.04 LTS; GCC is Gnu C/C++ compiler.
Computer ID | Operating System | Python | Numpy | Sklearn | Perl |
---|
OGP | Foral Fossa | 3.8.13 (GCC 7.5.0) | 1.23.1 | 1.1.2 | 5.30.0 |
PWE | Bionic Beaver | 3.6.9 (GCC 8.4.0) | 1.19.5 | 0.24.2 | 5.26.1 |
OPC | Foral Fossa | 3.8.10 (GCC 9.4.0) | 1.23.4 | 1.1.3 | 5.30.0 |
EDW | Bionic Beaver | 3.6.9 (GCC 8.4.0) | 1.19.5 | 0.24.2 | 5.26.1 |
Table A2.
Hardware details of the four computers on which hive-specific and model transfer NLR grid search programs were executed; column 1 contains the computer IDs in the CSV files with complete experimental results in
Supplemental Materials.
Table A2.
Hardware details of the four computers on which hive-specific and model transfer NLR grid search programs were executed; column 1 contains the computer IDs in the CSV files with complete experimental results in
Supplemental Materials.
Computer ID | Model | Arc | CPU | Num CPUs |
---|
OGP | GEFORCE RTX 2080 Ti | x86_64 | i7-9700K@3.60 GHz | 8 |
PWE | Dell PowerEdge T130 | x86_64 | E3-1230 v6@3.50 GHz | 8 |
OPC | Dell Optiplex 9020 | x86_64 | i7-4770 CPU@3.40 GHz | 8 |
EDW | HP Z240 | x86_64 | i7-6700 CPU@3.40 GHz | 8 |
Table A3.
Independent variables (INV) of the LR and NLR models; TIME models have 2 INV (variables 1,2); WEATHER models—8 (variables 3–10); EMR models—6 (variables 11–16; ALL models—16; W/m—watts per square meter; mG—milligauss.
Table A3.
Independent variables (INV) of the LR and NLR models; TIME models have 2 INV (variables 1,2); WEATHER models—8 (variables 3–10); EMR models—6 (variables 11–16; ALL models—16; W/m—watts per square meter; mG—milligauss.
Num | INV Name | INV Category | INV Description |
---|
1 | timept | TIME | time point, integer in [1, 53] (time of day) |
2 | timept2 | TIME | timept |
3 | pressure | WEATHER | atmospheric pressure (millibars) |
4 | pressure2 | WEATHER | pressure |
5 | humidity | WEATHER | relative humidity (percent) |
6 | humidity2 | WEATHER | humidity |
7 | windspeed | WEATHER | wind speed (miles per hour) |
8 | windspeed2 | WEATHER | windspeed |
9 | temp | WEATHER | ambient temperature (Fahrenheit) |
10 | temp2 | WEATHER | temp |
11 | swradusu | EMR | USU Climate Center short wave radiation (W/m) |
12 | swradusu2 | EMR | swradusu |
13 | avgemf | EMR | mean strength of detected EMF (mG) |
14 | avgemf2 | EMR | avgemf |
15 | avgtotden | EMR | average total RF power flow per unit area (Watts/m) |
16 | avgtotden2 | EMR | avgtotdens |
Table A4.
Dependent variables (DNV) of the LR and NLR models; IN—number (non-negative integer) of incoming bee motions; OUT—number (non-negative integer) of outgoing bee motions; (IN − OUT)—difference of IN and OUT; (OUT − IN)—difference of OUT and IN; (OUT + IN)—sum of OUT and IN.
Table A4.
Dependent variables (DNV) of the LR and NLR models; IN—number (non-negative integer) of incoming bee motions; OUT—number (non-negative integer) of outgoing bee motions; (IN − OUT)—difference of IN and OUT; (OUT − IN)—difference of OUT and IN; (OUT + IN)—sum of OUT and IN.
Num | DNV Name | DNV Description |
---|
1 | CIN | IN |
2 | COUT | OUT |
3 | COUTMIN | (OUT − IN) if OUT ≥ IN; −|OUT − IN| otherwise |
4 | CINMOUT | (IN − OUT) if IN ≥ OUT; −|IN − OUT| otherwise |
5 | COUTPIN | (OUT + IN) |
Table A5.
Structural components of regression model types; RGR abbreviates “regressor”; LR—linear regressor; RFR—random forest regressor; SVMR—support vector machine regressor; total number of LR model types is , where 1 denotes LR; total number of NLR model types is , where 2 denotes RFR and SVMR.
Table A5.
Structural components of regression model types; RGR abbreviates “regressor”; LR—linear regressor; RFR—random forest regressor; SVMR—support vector machine regressor; total number of LR model types is , where 1 denotes LR; total number of NLR model types is , where 2 denotes RFR and SVMR.
Hive | Month | INV | RGR | DNV |
---|
R45 | 5 | TIME | LR | CIN |
R411 | 6 | WEATHER | RFR | COUT |
| 7 | EMR | SVMR | CINMOUT |
| 8 | ALL | | COUTMIN |
| 9 | | | COUTPIN |
Table A6.
RFR hyperparameters; see online documentation of the scikitlearn library for details at
www.scikitlearn.org; the total number of models in the hive-specific grid search is the number of hives × the number of months × the number of INV categories × the number of DNV × the number of values in the NT range × the number of values in the MTD range =
=
(see
Table A5 for the number of structural components in RFRs).
Table A6.
RFR hyperparameters; see online documentation of the scikitlearn library for details at
www.scikitlearn.org; the total number of models in the hive-specific grid search is the number of hives × the number of months × the number of INV categories × the number of DNV × the number of values in the NT range × the number of values in the MTD range =
=
(see
Table A5 for the number of structural components in RFRs).
Num | Name | Range | Description |
---|
1 | NT | | number of trees in RFR; 101 values |
2 | MTD | | maximum tree depth in RFR; 16 values |
Table A7.
SVMR hyperparameters for the non-polynomial kernel models; the hyperparameter C controls the softness of the margin (the larger it is, the fewer points lie in the margin); the hyperparameter
specifies the width of the tube within which no penalty is associated in the training loss function with points predicted within the distance of
from the actual value; see online documentation at
www.scikitlearn.org for more details; the total number of non-polynomial kernel SVMR models is the number of hives × the number of months × the number of INV categories × the number of DNV × the number of values in the C range × the number of values in the
range × the number of values in the
range × the number of kernels =
= 151,200. (See
Table A5 for the number of structural components in SVMRs).
Table A7.
SVMR hyperparameters for the non-polynomial kernel models; the hyperparameter C controls the softness of the margin (the larger it is, the fewer points lie in the margin); the hyperparameter
specifies the width of the tube within which no penalty is associated in the training loss function with points predicted within the distance of
from the actual value; see online documentation at
www.scikitlearn.org for more details; the total number of non-polynomial kernel SVMR models is the number of hives × the number of months × the number of INV categories × the number of DNV × the number of values in the C range × the number of values in the
range × the number of values in the
range × the number of kernels =
= 151,200. (See
Table A5 for the number of structural components in SVMRs).
Num | Name | Range/Value |
---|
1 | C | ; 14 values |
2 | | {scale, auto} |
3 | | ; 9 values |
4 | kernel | {linear, rbf, sigmoid} |
5 | cache_size | 1000; this is just one value common to all models |
Table A8.
SVMR hyperparameters for the polynomial (poly) kernel models; the C,
,
ranges and cache_size value are the same as for the non-poly kernel models in
Table A7; see online documentation at
www.scikitlearn.org for more details; total number of polynomial kernel SVMR models is number of hives × number of months × number of INV categories × number of DNV categories × number of values in the C range × number of values in the
range × number of values in the
range × number of kernels × number of degree values =
= 201,600 (see
Table A5 for the numbers of structural components in SVMRs).
Table A8.
SVMR hyperparameters for the polynomial (poly) kernel models; the C,
,
ranges and cache_size value are the same as for the non-poly kernel models in
Table A7; see online documentation at
www.scikitlearn.org for more details; total number of polynomial kernel SVMR models is number of hives × number of months × number of INV categories × number of DNV categories × number of values in the C range × number of values in the
range × number of values in the
range × number of kernels × number of degree values =
= 201,600 (see
Table A5 for the numbers of structural components in SVMRs).
Num | Name | Range/Value |
---|
1 | kernel | poly; this is just one value common to all models |
2 | degree | [2, 3, 4, 5]; 4 values |
Table A9.
(Pearson,
p value) between AVGEMF(A), TEMP(T), and AVGEMF(A), HUMID(H) for combined R45 and R411 data; Pearson’s,
p values are rounded to two and four decimals, respectively; months are in columns; monthly number of observations (N) are: 774 (5); 1361 (6); 1578 (7); 1422 (8); 1575 (9); the
Supplementary Materials include the spreadsheet ModelCVFits.xlsx which contains, under the tab “Correlations”, the tables of all Pearson’s of (EMR,INV) and (WEATHER,INV) and the corresponding
p values.
Table A9.
(Pearson,
p value) between AVGEMF(A), TEMP(T), and AVGEMF(A), HUMID(H) for combined R45 and R411 data; Pearson’s,
p values are rounded to two and four decimals, respectively; months are in columns; monthly number of observations (N) are: 774 (5); 1361 (6); 1578 (7); 1422 (8); 1575 (9); the
Supplementary Materials include the spreadsheet ModelCVFits.xlsx which contains, under the tab “Correlations”, the tables of all Pearson’s of (EMR,INV) and (WEATHER,INV) and the corresponding
p values.
| 5 | 6 | 7 | 8 | 9 |
---|
A,T | (0.97, <0.0001) | (0.96, <0.0001) | (0.93, <0.0001) | (0.97, <0.0001) | (0.93, <0.0001) |
A,H | (−0.85, 0.0001) | (−0.89, <0.0001) | (−0.90, <0.0001) | (−0.89, <0.0001) | (−0.76, <0.0001) |
Table A10.
LR results with 70/30 train/test split; INV and DNV are in rows, months in columns; in entry , for R45, for R411 for column month; lowest and highest values are bolded; all reals are rounded two decimals.
Table A10.
LR results with 70/30 train/test split; INV and DNV are in rows, months in columns; in entry , for R45, for R411 for column month; lowest and highest values are bolded; all reals are rounded two decimals.
INV,DNV/Month | 5 | 6 | 7 | 8 | 9 |
---|
TIME,CIN | 0.16; 0.31 | 0.10; 0.31 | 0.33; 0.50 | 0.22; 0.30 | 0.51; 0.40 |
TIME,COUT | 0.19; 0.41 | 0.11; 0.28 | 0.29; 0.46 | 0.17; 0.26 | 0.50; 0.42 |
TIME,COUTPIN | 0.20; 0.38 | 0.14; 0.36 | 0.29; 0.46 | 0.21; 0.22 | 0.50; 0.38 |
WEATHER,CIN | 0.18; 0.42 | 0.16; 0.55 | 0.34; 0.43 | 0.23; 0.40 | 0.49; 0.46 |
WEATHER,COUT | 0.19; 0.53 | 0.19; 0.51 | 0.32; 0.47 | 0.18; 0.34 | 0.45; 0.44 |
WEATHER,COUTPIN | 0.27; 0.58 | 0.21; 0.52 | 0.29; 0.40 | 0.29; 0.32 | 0.48; 0.42 |
EMR,CIN | 0.15; 0.42 | 0.16; 0.47 | 0.31; 0.42 | 0.28; 0.43 | 0.49; 0.39 |
EMR,COUT | 0.23; 0.56 | 0.18; 0.46 | 0.29; 0.47 | 0.22; 0.40 | 0.49; 0.47 |
EMR,COUTPIN | 0.22; 0.51 | 0.17; 0.45 | 0.29; 0.41 | 0.32; 0.34 | 0.51; 0.39 |
ALL,CIN | 0.28; 0.48 | 0.32; 0.60 | 0.37; 0.56 | 0.41; 0.48 | 0.57; 0.50 |
ALL,COUT | 0.30; 0.64 | 0.23; 0.55 | 0.35; 0.57 | 0.23; 0.44 | 0.57; 0.54 |
ALL,COUTPIN | 0.36; 0.66 | 0.28; 0.59 | 0.34; 0.49 | 0.37; 0.37 | 0.60; 0.48 |
Table A11.
(MEAN,STD) of maximum in top 30% of INV→SVMR models; minimum and maximum means are bolded.
Table A11.
(MEAN,STD) of maximum in top 30% of INV→SVMR models; minimum and maximum means are bolded.
MONTH | RBF | SIGMOID | LINEAR | POLY | AUTO | SCALE |
---|
5 | (0.537, 0.115) | (0.514, 0.120) | (0.542, 0.103) | nan | (0.542, 0.112) | (0.519, 0.112) |
6 | (0.531, 0.102) | (0.505, 0.104) | (0.544, 0.081) | nan | (0.551, 0.090) | (0.486, 0.094) |
7 | (0.536, 0.054) | (0.540, 0.019) | (0.555, 0.028) | nan | (0.547, 0.031) | (0.538, 0.046) |
8 | (0.428, 0.028) | (0.431, 0.038) | (0.415, 0.039) | nan | (0.426, 0.038) | (0.426, 0.033) |
9 | (0.594, 0.044) | (0.609, 0.054) | (0.630, 0.046) | nan | (0.605, 0.047) | (0.614, 0.052) |
Table A12.
ALL→RGR→COUTPIN, RGR is specified in columns; in entry , x is the mean for hive R45 and y is the mean for R411; highest values are bolded for each month and hive.
Table A12.
ALL→RGR→COUTPIN, RGR is specified in columns; in entry , x is the mean for hive R45 and y is the mean for R411; highest values are bolded for each month and hive.
MONTH | LR R45; R411 | RFR R45; R411 | SVMR R45; R411 |
---|
5 | 0.36; 0.66 | 0.44; 0.72 | 0.39; 0.68 |
6 | 0.28; 0.59 | 0.46; 0.71 | 0.42; 0.67 |
7 | 0.34; 0.49 | 0.44; 0.63 | 0.40; 0.61 |
8 | 0.37; 0.37 | 0.52; 0.60 | 0.43; 0.49 |
9 | 0.60; 0.48 | 0.74; 0.70 | 0.69; 0.66 |
Table A13.
(MEAN, STD) of maximum of ALL→RFR→COUTPIN in the hive-specific grid search on computers OPC and PWE; computer IDs are names of columns 1,2; in cell is , x, y are the mean max and its STD for hive R45, z, w are the mean max and its STD for hive R411; reals are rounded to two decimals; maximum and minimum are bolded; ties broken arbitrarily.
Table A13.
(MEAN, STD) of maximum of ALL→RFR→COUTPIN in the hive-specific grid search on computers OPC and PWE; computer IDs are names of columns 1,2; in cell is , x, y are the mean max and its STD for hive R45, z, w are the mean max and its STD for hive R411; reals are rounded to two decimals; maximum and minimum are bolded; ties broken arbitrarily.
MONTH | OPC | PWE |
---|
5 | 0.43; 0.04 0.71; 0.03 | 0.42; 0.05 0.71; 0.03 |
6 | 0.46; 0.03 0.70; 0.02 | 0.46; 0.03 0.70; 0.02 |
7 | 0.43; 0.03 0.62; 0.02 | 0.43; 0.02 0.61; 0.02 |
8 | 0.51; 0.03 0.59; 0.04 | 0.51; 0.04 0.59; 0.03 |
9 | 0.73; 0.01 0.70; 0.02 | 0.73; 0.01 0.70; 0.02 |
Table A14.
(MEAN, STD) of maximum of ALL→RFR→COUTPIN in the hive-specific grid search on computers EDW and OGP; computer IDs are names of columns 1,2; in cell is , x, y are the mean max and itsSTD for hive R45, z, w are the mean max and its STD for hive R411; reals are rounded to two decimals; maximum and minimum are bolded; ties broken arbitrarily.
Table A14.
(MEAN, STD) of maximum of ALL→RFR→COUTPIN in the hive-specific grid search on computers EDW and OGP; computer IDs are names of columns 1,2; in cell is , x, y are the mean max and itsSTD for hive R45, z, w are the mean max and its STD for hive R411; reals are rounded to two decimals; maximum and minimum are bolded; ties broken arbitrarily.
MONTH | EDW | OGP |
---|
5 | 0.42; 0.04 0.71; 0.02 | 0.42; 0.05 0.71; 0.03 |
6 | 0.46; 0.03 0.70; 0.02 | 0.46; 0.03 0.70; 0.02 |
7 | 0.43; 0.03 0.62; 0.02 | 0.43; 0.03 0.61; 0.02 |
8 | 0.51; 0.01 0.60; 0.04 | 0.51; 0.03 0.60; 0.04 |
9 | 0.73; 0.01 0.70; 0.02 | 0.73; 0.01 0.70; 0.02 |
Table A15.
(MEAN, STD) of maximum of ALL→SVMR→COUTPIN in the hive-specific grid search on computers OPC and PWE; in cell is , x, y are the mean max and its STD for R45, z, w are the mean max and its STD for R411; reals are rounded to two decimals; maximum and minimum are bolded; ties broken arbitrarily.
Table A15.
(MEAN, STD) of maximum of ALL→SVMR→COUTPIN in the hive-specific grid search on computers OPC and PWE; in cell is , x, y are the mean max and its STD for R45, z, w are the mean max and its STD for R411; reals are rounded to two decimals; maximum and minimum are bolded; ties broken arbitrarily.
MONTH | OPC | PWE |
---|
5 | 0.38; 0.05 0.66; 0.03 | 0.38; 0.05 0.66; 0.03 |
6 | 0.40; 0.05 0.40; 0.04 | 0.40; 0.04 0.40; 0.04 |
7 | 0.39; 0.02 0.60; 0.02 | 0.39; 0.03 0.60; 0.02 |
8 | 0.42; 0.03 0.49; 0.04 | 0.42; 0.04 0.49; 0.02 |
9 | 0.68; 0.02 0.66; 0.03 | 0.68; 0.02 0.65; 0.02 |
Table A16.
Hive-specific grid search run times for INV→RGR→DNV, each INV category on each computer; RGR is RFR or SVMR; DNV, H (Hive) and M (Month) take on all possible values; all reals are rounded to two decimal places; minimum and maximum run times are bolded for each model and computer.
Table A16.
Hive-specific grid search run times for INV→RGR→DNV, each INV category on each computer; RGR is RFR or SVMR; DNV, H (Hive) and M (Month) take on all possible values; all reals are rounded to two decimal places; minimum and maximum run times are bolded for each model and computer.
INV,DNV | Computer | Time (RFR; SVMR) (h) |
---|
TIME,DNV | OPC | 23.19; 25.17 |
TIME,DNV | OGP | 10.65; 22.31 |
TIME,DNV | PWE | 23.31; 25.88 |
TIME,DNV | EDW | 23.57; 26.67 |
WEATHER,DNV | OPC | 70.20; 192.41 |
WEATHER,DNV | OGP | 56.39; 158.38 |
WEATHER,DNV | PWE | 73.17; 178.38 |
WEATHER,DNV | EDW | 75.97; 182.41 |
EMR,DNV | OPC | 55.54; 97.80 |
EMR,DNV | OGP | 44.54; 63.00 |
EMR,DNV | PWE | 56.75; 89.59 |
EMR,DNV | EDW | 58.57; 91.12 |
ALL,DNV | OPC | 113.50; 58.40 |
ALL,DNV | OGP | 93.64; 47.58 |
ALL,DNV | PWE | 121.59; 56.10 |
ALL,DNV | EDW | 121.59; 51.12 |
Table A17.
Hive-specific grid search run time for all RFR and SVMR models for all INV, DNV, hives, months and computers; total model run times are the sums of the appropriate run times from the columns in
Table A16; for example, 80.72 = 23.19 + 10.65 + 23.31 + 23.57; reals are rounded to two decimal places; minimum and maximum run times are bolded in each column.
Table A17.
Hive-specific grid search run time for all RFR and SVMR models for all INV, DNV, hives, months and computers; total model run times are the sums of the appropriate run times from the columns in
Table A16; for example, 80.72 = 23.19 + 10.65 + 23.31 + 23.57; reals are rounded to two decimal places; minimum and maximum run times are bolded in each column.
INV,DNV | RFR TIME (h) | SVMR TIME (h) |
---|
TIME,DNV | 80.72 | 100.03 |
WEATHER,DNV | 275.73 | 711.58 |
EMR,DNV | 215.40 | 341.51 |
ALL,DNV | 450.32 | 213.20 |
TOTAL | 1022.17 | 1366.32 |
Table A18.
Model transfer grid search run time of INV→RGR→DNV models on PWE and EDW computers; RGR is RFR or SVMR; RFR runs were executed on PWE and EDW; SVMR model transfer runs on PWE; INV is specified in rows; DNV, H(Hive) and M(Month) take on all possible values; each model was trained on R45 data and tested on R411 data and then trained on R411 data and tested on R45; all reals are rounded to two decimals.
Table A18.
Model transfer grid search run time of INV→RGR→DNV models on PWE and EDW computers; RGR is RFR or SVMR; RFR runs were executed on PWE and EDW; SVMR model transfer runs on PWE; INV is specified in rows; DNV, H(Hive) and M(Month) take on all possible values; each model was trained on R45 data and tested on R411 data and then trained on R411 data and tested on R45; all reals are rounded to two decimals.
INV,DNV | Computer | Time (RFR; SVMR) (h) |
---|
TIME,DNV | PWE | 2.47; 5.47 |
TIME,DNV | EDW | 2.51; nan |
WEATHER,DNV | PWE | 10.23; 47.10 |
WEATHER,DNV | EDW | 10.39; nan |
EMR,DNV | PWE | 7.52; 15.82 |
EMR,DNV | EDW | 7.71; nan |
ALL,DNV | PWE | 17.22; 15.41 |
ALL,DNV | EDW | 17.63; nan |
Table A19.
Total model transfer grid search run time of RFR and SVMR models for all INV, DNV, hives and months; RFR time in each row is the mean of the corresponding RFR times on PWE and EDW in
Table A18; thus, 2.49 = (2.47 + 2.51)/2, 10.31 = (10.23 + 10.39)/2, etc.; TOTAL row is the sum of the run time means in the columns; reals are rounded to two decimal places.
Table A19.
Total model transfer grid search run time of RFR and SVMR models for all INV, DNV, hives and months; RFR time in each row is the mean of the corresponding RFR times on PWE and EDW in
Table A18; thus, 2.49 = (2.47 + 2.51)/2, 10.31 = (10.23 + 10.39)/2, etc.; TOTAL row is the sum of the run time means in the columns; reals are rounded to two decimal places.
INV,DNV | RFR TIME (h) | SVMR TIME (h) |
---|
TIME,DNV | 2.49 | 5.47 |
WEATHER,DNV | 10.31 | 47.10 |
EMR,DNV | 7.62 | 15.82 |
ALL,DNV | 17.43 | 15.41 |
TOTAL | 37.85 | 83.80 |
Table A20.
Power use of the computers OPC, PWE and EDW running hive-specific RFR and SVMR grid searches with 10-fold cross validation and a 70/30 train/test split; reals in columns 2, 3 and 4 are in kilowatt-hours (kW-h) from the Gardner Bender(TM) Power Meter PM3000 (no rounding); column COMP is the cumulative power amount (CPA) of the computer by itself for 24 h; column COMP+RFR is the CPA of the computer running our RFR software for 24 h; column COMP+SVMR contains the CPA of the computer running our SVMR software for 24 h; reals in columns (COMP+RFR)/24, (COMP+SVMR)/24 are the row values in (COMP + RFR) and (COMP+SVMR) divided by 24 and rounded to two decimals; these power use rates are in kW-h/h; the CPAs and rates of only RFR and SVMR can be estimated from the table as (COMP + RFR) − COMP, (COMP + SVMR) − COMP, ((COMP + RFR) − COMP)/24, ((COMP + SVMR) − COMP)/24; row MEAN contains the means of the column values.
Table A20.
Power use of the computers OPC, PWE and EDW running hive-specific RFR and SVMR grid searches with 10-fold cross validation and a 70/30 train/test split; reals in columns 2, 3 and 4 are in kilowatt-hours (kW-h) from the Gardner Bender(TM) Power Meter PM3000 (no rounding); column COMP is the cumulative power amount (CPA) of the computer by itself for 24 h; column COMP+RFR is the CPA of the computer running our RFR software for 24 h; column COMP+SVMR contains the CPA of the computer running our SVMR software for 24 h; reals in columns (COMP+RFR)/24, (COMP+SVMR)/24 are the row values in (COMP + RFR) and (COMP+SVMR) divided by 24 and rounded to two decimals; these power use rates are in kW-h/h; the CPAs and rates of only RFR and SVMR can be estimated from the table as (COMP + RFR) − COMP, (COMP + SVMR) − COMP, ((COMP + RFR) − COMP)/24, ((COMP + SVMR) − COMP)/24; row MEAN contains the means of the column values.
ID | COMP | COMP + RFR | COMP + SVMR | (COMP + RFR)/24 | (COMP + SVMR)/24 |
---|
OPC | 0.833 | 1.549 | 1.491 | 0.06 | 0.06 |
PWE | 1.690 | 1.992 | 2.227 | 0.08 | 0.09 |
EDW | 0.949 | 2.306 | 1.769 | 0.10 | 0.07 |
MEAN | 1.157 | 1.949 | 1.829 | 0.08 | 0.07 |
Table A21.
(MEAN, STD) of the maximum of ALL→SVMR→COUTPIN in hive-specific grid search on computers EDW and OGP; in cell , x, y are the mean max and its STD for R45, z, w are the mean max and its STD for R411; reals are rounded to two decimals; maximum and minimum are bolded; ties broken arbitrarily.
Table A21.
(MEAN, STD) of the maximum of ALL→SVMR→COUTPIN in hive-specific grid search on computers EDW and OGP; in cell , x, y are the mean max and its STD for R45, z, w are the mean max and its STD for R411; reals are rounded to two decimals; maximum and minimum are bolded; ties broken arbitrarily.
MONTH | EDW | OGP |
---|
5 | 0.39; 0.05 0.66; 0.03 | 0.38; 0.04 0.66; 0.02 |
6 | 0.66; 0.03 0.65; 0.02 | 0.65; 0.03 0.66; 0.03 |
7 | 0.38; 0.02 0.60; 0.02 | 0.38; 0.02 0.59; 0.01 |
8 | 0.41; 0.04 0.50; 0.05 | 0.42; 0.03 0.49; 0.03 |
9 | 0.68; 0.02 0.65; 0.02 | 0.68; 0.02 0.66; 0.02 |
Appendix A.2. Figures
Figure A1.
INV bar plots of maximum scores of INV→RGR→DNV models; RGR is RFR (top row) and SVMR (bottom row); Hive (H) is hive R45 (left column) and hive R411 (right column); Month (M) is the x-axis; INV and DNV take on all possible values.
Figure A1.
INV bar plots of maximum scores of INV→RGR→DNV models; RGR is RFR (top row) and SVMR (bottom row); Hive (H) is hive R45 (left column) and hive R411 (right column); Month (M) is the x-axis; INV and DNV take on all possible values.
Figure A2.
DNV bar plots of the maximum scores of INV→RGR→DNV models; RGR is RFR (top row) and SVMR (bottom row); Hive (H) is hive R45 (left column) and R411 (right column); Month (M) is the x-axis; INV and DNV take on all possible values.
Figure A2.
DNV bar plots of the maximum scores of INV→RGR→DNV models; RGR is RFR (top row) and SVMR (bottom row); Hive (H) is hive R45 (left column) and R411 (right column); Month (M) is the x-axis; INV and DNV take on all possible values.
Figure A3.
Mean value (dots) and corresponding standard deviation (vertical lines) of RFR hyperparameters (i.e., number of trees (NT)—blue and maximum tree depth—orange), of the top 30% (i.e., ranked by ) of RFR models found with 10-fold cross validation and 70/30 train/test split; upper and lower bounds of the number of trees—light red; upper and lower bounds of the maximum tree depth (MTD)—dark red; ≈ denotes removal of unused sub-ranges on the y-axis for compactness.
Figure A3.
Mean value (dots) and corresponding standard deviation (vertical lines) of RFR hyperparameters (i.e., number of trees (NT)—blue and maximum tree depth—orange), of the top 30% (i.e., ranked by ) of RFR models found with 10-fold cross validation and 70/30 train/test split; upper and lower bounds of the number of trees—light red; upper and lower bounds of the maximum tree depth (MTD)—dark red; ≈ denotes removal of unused sub-ranges on the y-axis for compactness.
Figure A4.
(a) Mean value (dots) and corresponding standard deviations (vertical lines) of SVMR hyperparameters (i.e., C—blue, epsilon—orange) of the top 30% (ranked by ) of SVMR models; upper and lower bounds of C—light red; bounds of epsilon—dark red; (b) frequency of rbf, sigmoid, linear, poly kernels (top); auto and scale for gamma hyperparameter (bottom).
Figure A4.
(a) Mean value (dots) and corresponding standard deviations (vertical lines) of SVMR hyperparameters (i.e., C—blue, epsilon—orange) of the top 30% (ranked by ) of SVMR models; upper and lower bounds of C—light red; bounds of epsilon—dark red; (b) frequency of rbf, sigmoid, linear, poly kernels (top); auto and scale for gamma hyperparameter (bottom).
Figure A5.
Mean of the maximum of INV→RFR→ DNV models (a,b) and INV→SVMR→DNV models (c,d) trained on R45 data and tested on R411 (a,c) and trained on R411 data and tested on R45 data (b,d).
Figure A5.
Mean of the maximum of INV→RFR→ DNV models (a,b) and INV→SVMR→DNV models (c,d) trained on R45 data and tested on R411 (a,c) and trained on R411 data and tested on R45 data (b,d).
Figure A6.
Scatter plot of CIN (cubic root of incoming bee motions) and quadratic regression for hive R411 for July.
Figure A6.
Scatter plot of CIN (cubic root of incoming bee motions) and quadratic regression for hive R411 for July.
Figure A7.
RFR and SVMR model transfer maximum score curves; RFR—(top row); SVMR—(bottom row); (left column): training on R45 data, testing on R411 data; (right column): training on R411 data, testing on R45 data.
Figure A7.
RFR and SVMR model transfer maximum score curves; RFR—(top row); SVMR—(bottom row); (left column): training on R45 data, testing on R411 data; (right column): training on R411 data, testing on R45 data.