1. Introduction
As [
1] points out, the significance of the management and administration of higher education institutions is so relevant that it conditions aspects of life, such as lifestyle and parenting patterns, worldviews, mindsets, aspirations, and motivation, and it is a challenge to meet the expectations of the new generation of students and teachers [
2]. On the other hand, the role of universities in the precarization of professional work and the precarization of academics is discussed by Reference [
3]. Within the space of management and administration of higher education institutions, there is no one-size-fits-all approach, as there are countless factors that could change the governance model and, thus, the processes that shape the faculty [
4]. In this paper, we center on the analysis of the Spanish academic staff workload in order to shine light on their situation and help policymakers make important decisions.
Under Spanish law, the working time of academic staff is restricted to 37.5 h per week for all full-time staff, since many of them are government employees. The number of hours worked by university staff in the U.S. is much higher than the hours stipulated in the agreement [
5]. Do Spanish academic staff work more hours than those stipulated, as happens in the U.S., or is the myth that Spanish government employees work few hours true? Spanish academic staff do not record their working hours on a daily basis as other government employees do, so there are no official statistics. No previous study has addressed this issue in Spain.
There is no previous detailed time allocation workload knowledge study in Spanish universities. How academic staff spend their time has major implications for the faculty, students and their institutions. The distribution of the number of hours worked depends on many factors, such as gender [
6], staff category, type of institution, etc. [
7]. O’Meara et al. [
7] conclude that women spend more time on teaching-related activities, and men spend more time on research in research universities. This is also the conclusion reached by Reference [
8] for Spanish scientists, but no data about this are shown, i.e., we do not know how many hours are spent on each activity, neither do we know the difference between men and women in this respect.
Housework is an academic issue [
9,
10]. The traditional lack of co-responsibility of men in domestic tasks and caring for other people facilitates the possibility of success in this environment, as more time can be dedicated to other activities. In Spain, there are some laws on public institutions’ obligation to implement egalitarian policies. However, in practice, there is a lack of compliance with many points of the law [
11]. Universities maintain inequalities in the field of university work [
12,
13]. The tendency to divide and focus on different tasks maintains the roles and routines of interaction within universities [
14,
15,
16,
17], which are gendered bureaucratic structures [
18]. According to References [
19,
20,
21,
22], the greater involvement of women in teaching and university services could be a way of intervening in gender policy programs. All these gender inequalities give rise to a gender wage gap in Spanish universities [
23]. Ref. [
23] conclude that the gender pay gap is more intense in the case of mothers, and the gap is not the result of a difference in merits between academic women and men, but rather of the operating structure of the universities and the general social structure.
Today, legislative changes are planned for Spanish universities, among other things in terms of professional categories. In order to make important decisions, policymakers must know the present workload and how it is distributed according to different factors, such as gender, age, or the current professional categories. One of the papers that is most closely related to our work is that of Allen [
24], which studied workload differences by institution and ethnicity in the United States. However, the Spanish university system is completely different; for example, there are not different types of public universities in Spain or clear ethnic groups among Spanish university staff, except very few exceptions. Allen [
25] instead studied mainly workload differences by gender, which is more in line with our objective.
Our objective is to carry out an exploratory study of the time distribution of academic work, i.e., paid work and unpaid work (domestic and caring tasks), among Spanish academic staff. Having a snapshot of Spanish academic staff may be very useful when making the mandatory gender impact assessment for changing laws, policies, or programs. In this way, legislation and policies can be better designed or transformed to prevent a negative impact on gender equality and to proactively promote gender equality.
However, we find differences between our proposal and the work presented by Allen [
25] and others in studies of faculty workload and time use. Unlike Allen [
25], our work refers to Spanish universities rather than universities in the United States. Note also that the data in Allen [
25] are from more than 25 years ago (from the 1990s), so it is possible that things may have changed over time. Furthermore, many studies of faculty workload use cross-sectional national surveys that ask participants to estimate their time use in broad categories over the last year or semester. This is not an easy task since it requires an extensive memory exercise. We limited ourselves to asking about the activities of the last week, and we propose more detailed categories.
On the other hand, other works use different approaches to study gender differences in academics in various aspects, such as semi-structured interviews [
26] for time use, or even pictures [
27] for representing academic careers. Instead, we use data from a statistical survey with more detailed activities than those asked about in Allen [
25], as mentioned above. Although finer details can be found using a time-diary method, as in O’Meara et al. [
7], who record the activities in 5-min increments over 4 weeks, so many details can be excessive. In fact, O’Meara et al. [
7] only show the results of a few activities, but we do not see a complete picture of the situation. Furthermore, a time-diary method is very time-consuming for the respondents, which can make the results more biased, as may only be completed by the more involved participants. This is the reason why we decided to carry out a statistical survey that can be answered in 10 min.
Furthermore, our work is also original because of the statistical analysis carried out. We do not just show the mean, as is usual, but we also show the five-number summary, which is better for skewed distributions or those containing outliers, as is the case with this kind of data. In this way, we discover a better picture, which would remain hidden if only the mean was shown. Not only that, but we also carry out a multivariate statistical analysis beyond the typical disaggregation of the descriptive statistics by different factors. We use innovative statistical learning techniques (clustering techniques and archetypal analysis), which give us the academic staff workload profiles and to the best of our knowledge is the first time they have been applied to this topic.
2. Materials and Methods
We used the questionnaire created with Google forms that is available at
https://forms.gle/XCFHZqzYao19KdPz5 (accessed on 24 June 2021) and translated into English at
http://www3.uji.es/~epifanio/RESEARCH/Questions.pdf (accessed on 24 June 2021). It was administered as a web-based instrument. Before conducting the main survey, we did a small pilot study in the department of one of the authors in order to detect whether there are any possible difficulties with the meaning of some questions or completion of the form, etc. We asked them to inform us of any problem, but no difficulty was reported.
The questions follow a similar formulation to some of the questions included in the National Study of Post-secondary Faculty (
https://nces.ed.gov/surveys/nsopf/ (accessed on 24 June 2021)) used by Allen [
25], but questions are asked about time devoted to more activities. On the one hand, we are aware that raw measures, such as weekly time devoted to teaching, research, etc., cannot capture the qualitative aspects of academic work, but on the other hand, numbers can also give us a picture of what is happening and can be more clearly understood by policymakers.
We are also aware that the weekly schedule can vary a lot depending on the semester and even on the week. This is why we specified at the beginning of the questionnaire that people should refer to what they did in a standard week. We suggest considering what they did the previous week if that week was a standard week, i.e., without any unusual event. We use the whole week from Monday to Sunday as a timescale and not just weekdays, since it is well-known that many academics also work during weekends. We also consider weeks in the middle of a semester for launching the survey.
The first part of the questionnaire contains personal questions, while the second part, i.e., the numbered questions, covers questions devoted to estimating the time spent. The survey required the Professional Category; Academic disciplines; Gender; Age; and Number of children.
In Spain, there are different types of staff, including part-time staff and full-time staff. The part-time staff are called Personal Asociado (AS) and they also work outside the university. Their main tasks should only be teaching-related. Regarding the full-time staff, there are temporary and permanent staff. The temporary staff may or may not be doctors (the majority are), and they are called Personal Ayudante Doctor (AYD) or Personal Ayudante (AY), respectively. There is also a temporary figure called a Visitante (VI), which is not very common in the majority of universities. Some of the permanent staff have an employment contract, while others are government employees. Those who are permanent with a contract are called Personal Contratado Doctor (CD) (the majority), although there is a minority belonging to Personal Colaborador (CO), who are not usually doctors and whose contract results from a previous law. There are mainly two categories of government employees: Profesorado Titular de Universidad (TU) and Profesorado Catedrático de Universidad (CU), which would be equivalent to Associate Professor and Full Professor. CD could be seen as a similar category to TU. Furthermore, there are a few government employees, who are not usually doctors, called Profesorado Titular de Escuela Universitaria (TEU), a category that results from a previous law. Finally, some categories are mainly research figures: predoctoral fellows (PRE) and postdoctoral fellows (POST), or mainly teaching figures: staff who cover substitutions when academic staff are on leave, called Personal Sustituto (SU).
2.1. Data Collection
We would have liked to use a stratified sampling technique for randomly selecting academic staff, but we did not have access to the database of Spanish academic staff. In early 2019, we asked the Vice-Chancellors of several Spanish public universities for permission to send an email with the link to the questionnaire to all the academic staff of their universities. Only Jaume I University accepted it, as long as we followed its protocol. Therefore, we presented the request to the Research Ethics Committee at Jaume I University, and we received its approval in June 2019.
Given the refusal of most universities to allow us to send the questionnaire by email to all their academic staff, we looked for the public email contact of each Department of 10 Spanish universities. Therefore, we sent an email to the administrative staff of each department and kindly asked them to forward the questionnaire to the academic staff in their departments. This data collection occurred in March and April 2019, in the middle of the second semester of the 2018/2019 academic year. We cannot know the percentage of non-responses, since we do not know which administrative staff decided to forward the email to the academic staff of their department and which did not. Therefore, we do not know if non-responses are because they did not want to respond to the questionnaire or because they simply did not receive our invitation to answer the questionnaire.
The second data collection occurred in October 2019, in the middle of the first semester of the 2019/2020 academic year, when an email to all the academic staff of
Jaume I University (UJI) was sent by its Communication Service. In this email, the office of the Vice-Chancellor of Research and Transfer invited the academic staff to complete the questionnaire, which was approved by the Ethics Committee. Only one email was sent to the academic staff and no reminder was sent to answer. Two hundred and twenty-seven academics responded from a total of 1542 academic staff at
Jaume I University in the 2019/2020 academic year (
https://www.uji.es/institucional/uji/presentacio/xifres/ (accessed on 24 June 2021)), i.e., 15% replied. Note that our response rate of 15% is much higher than the 2% response rate reported by O’Meara et al. [
7], where they invited 6438 faculty staff, but only 111 participated. In fact, our response rate is 22% if we only consider the full-time teaching staff of UJI (AY, AYD, CD, CU, TU, TEU).
We joined the data from the first and second collection and the total number of responses obtained was 711. However, from the initial 711 responses, we discarded 8 because of missing or impossible answers. Therefore, the final number of participants was 703.
The demographics of the participants are generally comparable to nationwide statistics of Spanish academic staff [
28]. The participants consisted of 356 (51%) women and 347 (49%) men, while the Spanish female faculty represents 41%. Therefore, more women replied to our invitation.
Table 1 compares the percentage of participants and the population, according to professional category. However, we do not know the number of PRE, POST and SU personnel in Spanish academic staff, so the percentages are computed without considering those categories or the ‘Other’ category. The percentage is quite similar except for AS, which has the greatest discrepancy. The percentage of AS in the sample is smaller than in the population. Remember that AS are part-time personnel whose main job is outside the university, so some of them may not be very willing to answer university surveys. On the contrary, the group that compensates for the lower response from AS is TU, which is the largest group of full-time teachers, with a percentage in the sample that is higher than in the population. AYD is also a group with a slightly higher percentage of responses in the sample. This could be explained by the fact that the UJI sample represents 32% of the sample, and the percentage of AYD at UJI is higher than in many Spanish universities. For the rest of the categories, the population and sample percentages are quite similar.
Table 2 compares the percentage of participants and the population, according to academic discipline. Note that they are very similar, except in Health Sciences and Engineering and Architecture. It is not surprising that the percentage of academic staff in Health Sciences in the sample is lower than in the population since the percentage of AS in this branch is the highest due to Health Sciences AS personnel, and the part-time staff responded less than the full-time staff. Note also that the lowest percentage of AS occurs in Engineering and Architecture. For example, at UJI, AS personnel account for 72% of the academic staff in Health Science, 46% in Arts and Humanities, 41% in Social and Legal Sciences, and 19% in Engineering and Architecture [
29].
2.2. Limitations
We have previously mentioned the obstacles we encountered when collecting the data. Therefore, the main limitation to this research is the selection of participants, which was not randomized. Participants were self-selected, so it is a voluntary response sample. This can produce bias. Therefore, we cannot use inferential statistics. Our study is strictly exploratory and we only use descriptive statistics.
Note that our questionnaire is not about a controversial topic, such as substance abuse or sexual behavior, where people may want to hide bad behaviors, and the impact of non-response bias could be really important [
30]. Although the response rate of our survey is low (22% in
Jaume I University for full-time teaching staff), according to O’Meara et al. [
7], faculty responses to national surveys are typically around 25–35%, so it is within the usual range and is higher than the 2% response rate reported by O’Meara et al. [
7]. Nevertheless, non-response bias is not determined by the response rate, but by the reasons for non-participation. A low response rate does not necessarily indicate non-response bias [
31]. In fact, Reference [
32] assessed response bias in a web survey at a university faculty and suggests that response bias was undetectable.
Several works have studied non-response bias [
33], and three approaches are considered. One of them is the comparison of the demographics of respondents versus non-respondents, which we have done in the previous section, and we have seen that they were not very different. Another approach is comparing the responses of participants who answered different waves (“reminders”) of the survey, but this option was not possible in our case. And the third approach is about the utility of subjective estimates of non-response bias. In our case, faculty staff with a high workload may be more willing to participate. But, in any case, this allows us to gather information to detect problems and patterns. The exact extent of the problems identified can be further determined with random samples in future research.
Despite the limitations, this study offers a unique portrait of Spanish faculty workload and a way to expose their problems and the different profiles, with a detailed time allocation workload that is not presented in other more meticulous works based on structured observations [
34] and interviews [
35], where samples are not random and the sample size is very small (up to 30 voluntary participants). Thus, this work provides a novel and important contribution to the way the academic workload is analyzed.
2.3. Data Cleansing and Preparation for Machine Learning
We cleaned up the data in order to correct or remove inaccurate data. As mentioned above, five individuals were completely removed since their answers to the questions were either impossible or extremely incomplete. When checking the accuracy of the data, we also detected some corrupt data and impossible answers for certain questions. In some cases, they could be corrected. For example, a few subjects answered the weekly hours of sleep instead of the average daily hours of sleep. In other cases, we were not be able to guess the reason for the misleading data, so the corrupt data point was removed, and we treated it as a missing data point. Categorical answers to numerical questions were also treated as missing data. For instance, a few people answered ‘a lot of hours’ instead of a particular number of hours. After this cleaning process, the maximum number of missing data in one question was 20 out of 703. In total, there were 107 missing data in the whole data set, which only represents 0.8% of the data set, i.e., the percentage of missing values was very low.
Although we could simply delete the complete case, this way of handling missing data has many disadvantage since a great deal of useful information is discarded. Therefore, in order to carry out the statistical analysis, and especially the multivariate statistical analysis, we use an advanced way of handling missing data. We consider imputation. We apply the
aregImpute function in the R package
Hmisc [
36], which uses additive regression, bootstrapping, and predictive mean matching to carry out the imputation. In this way, the missing data are replaced by an estimated value, which is computed by a prediction based on the observed values of that variable and the other variables.
3. Results and Discussion
In this section, descriptive statistics disaggregated by different factors are presented and discussed. Besides the classical mean and standard deviation (Sd), we also show the five-number summary, which is better for skewed distributions or those containing outliers, as is the case with this kind of data. In other words, we also show the percentiles 0% (the minimum value), 25% (first quartile), 50% (the median), 75% (the third quartile), and 100% (the maximum value), in addition to the interquartile range (IQR), which is a more robust measure of variability than Sd. We also show the number of individuals in each cell in the ‘Count’ column.
Table 3 shows the distribution of hours per week of academic work by gender, i.e., it collects the addition of the hours from questions 1 to 8 of the questionnaire for full-time staff, i.e., without AS. The ‘Global’ row contains the summary statistics of both men and women. Note that the average number of weekly working hours of academic staff in the sample is 49.0 h, 11.5 h more than the stipulated 37.5 h (which is equivalent to 7.5 h per day). These 11.5 h represent more than one and a half extra days of work per week. We have computed the quantile corresponding to 37.5 h and 80% of the academic staff in the sample work more than 37.5 h. There is a high degree of variability in the distribution, with a difference of 17 h between the 75% of staff that work the most hours and the 25% that work the fewest hours, although the latter work 40 h. Furthermore, 50% of the sample work more than 47 h, i.e., more than one extra day of work per week, while 25% of the sample work more than 57 h per week, i.e., more than two extra days of work per week. More than 20% of the staff in the sample work more than 60 h per week, i.e., more than three extra days per week. In regard to gender, women work 3.4 h per week on average more than men.
Table 4 shows the distribution of workload by gender and age for full-time staff. For the range of 65 to 70 years, the average hours per week of academic work is clearly higher in men in the sample, although the sample size for this age range is very small. However, the average hours per week of academic work is higher for women in the range from 24 to 35 years and from 45 to 65 years. In the range from 35 to 45 years, the average hours per week of academic work are nearly identical in both genders. Note that the range from 35 to 45 years coincides with the maternity and parenting period, since the most common age for women to become mothers in Spain is at 34 years (
https://www.ine.es/jaxi/Datos.htm?path=/t20/e301/provi/l0/&file=01001.px#!tabs-tabla (accessed on 24 June 2021)). In fact, there are more mothers who have children at the age of 40 than at 25 in Spain. After the parenting period, women from 50 to 65 years work on average more than men in the same age range, nearly one day more per week.
Table 5 shows the distribution of workload by gender and category. The staff category with the smallest mean workload is AS, since it is a part-time job. For AS, the mean number of hours per week of academic work is 29.96 h, approaching the 37.5 h stipulated for full-time staff. This indicates that overall AS personnel dedicate a lot of time to work, since they should have another job outside the university. If this other work is a full-time job of 37.5 h per week, the total time devoted to working would be 37.5 + 29.96 = 67.5 h per week. The categories with the next lowest mean workload are the temporary figures of doctors with no clear expectations of remaining at those universities: VI with 36.5 and POST with 41.2 mean hours per week, respectively. The next figures are TEU and TU personnel. However, the mean male workload in TEU (42) and TU (46) is lower than for females (47 and 49, respectively). The same pattern is found in CD: the mean male workload is 49, while it is 52 for females. The staff categories with the highest mean workload are AY (64), SU (53), CU (53), AYD (52), and PRE (51). It is to be expected that AY and AYD staff work many hours, since they are temporary figures and have to gain recognition to achieve promotion within the university. This is also the case with PRE who are working on their thesis. However, the mean workload of CU is higher than for AYD and PRE, although there is a difference in mean workload between male CU (48) and female CU (55). Note also the high mean workload of SU.
We detected in
Table 3 that women work on average more than men in our sample. Let us analyze time distribution activity by activity for full-time staff. The results are shown in
Table 6. The last two questions that appear in
Table 6, are not activities but the ECTS credits assigned or reduced for management tasks. The average number of teaching hours is similar by gender, as is the number of ECTS credits assigned and reduced. However, the average hours per week devoted to student supervision, tutoring, administrative tasks, post-class tasks and preparing classes are higher for women, especially for the last two tasks. This could be the reason why female staff obtain better teaching results than their male counterparts [
37]. Nevertheless, the criteria of the Spanish system for promotion in regard to teaching are mainly based on the number of teaching hours, while the quality of teaching is largely ignored [
11]. Our results show that the current criteria could be revised to take into account the tasks carried out by women. On the other hand, the average hours per week devoted to research and transfer tasks is higher for men, although not so much. The difference is greater for pre- and post-class tasks. Remember that these differences are on a weekly scale, so the differences are amplified when we consider the whole year.
Knowing the teaching workload is very important, since staff are assigned a certain number of ECTS credits each year. If we add up all the teaching-related tasks, we observe that the female staff in our sample dedicate on average 29.77 h per week and the male staff 25.66 h per week (a difference of 4.11 h per week). Remember that the legal working week consists of 37.5 h/week, so, if teaching duties imply so many hours, there are not many hours left for research, transfer, and administrative tasks.
It is accepted that one ECTS credit represents approximately 25 to 30 h per year of student workload, but how much does one ECTS credit represent for teachers? We can broadly estimate this as follows. Teaching takes place within 30 weeks per academic year, although this is an underestimation since there are also teaching duties during the exam periods. Therefore, we can (under)estimate the total number of teaching hour per course as 29.77 × 30 = 893.1 for female staff and 25.66 × 30 = 769.8 h for male staff. Then, we divide this quantity by 19 ECTS credits assigned on average. This means that 1 ECTS credit represents an average of 47.0 h/year for female staff and 40.5 h/year for male staff, i.e., approximately 43.8 h/year.
Nevertheless, the assigned ECTS credits depend mainly on the staff category. These are summarized in
Table 7. TEU is the staff category with the highest average number of ECTS credits assigned, while the second highest category is AYD. However, TEU staff do not have research duties, so AYD is the highest category with research duties. Although AYD is the natural beginner category in the Spanish university, it has the highest number of credits assigned on average in our sample. In other countries, such as France, beginners have fewer teaching responsibilities at first, which is quite sensible, instead of burdening beginners with teaching when they must also gain recognition for research to achieve promotion.
We have also analyzed the age by staff category. The results are shown in
Table 8. In all the stable (AY, AYD, CU, CD, TU, TEU) categories, the average age for females is higher than for male staff, so it seems that female staff enter later and/or remain for longer in different stable categories before being promoted to the next category in the system. It is also significant that 39 years is the average age for AYD, the beginners in the Spanish university system. Therefore, people are entering the university when they are not so young, and these people, at almost 40 years old, do not have a permanent position. Remember that AYD is a temporary staff category.
Let us now analyze the questions related to care, domestic chores and leisure time.
Table 9 displays the results. The average hours per week devoted to caring for people is 6.5 h higher for female staff than for male staff in our sample. If we focus on the 25% of people who devoted the most hours to care, the difference increases up to 12.25 h per week. The weekly average spent on domestic chores is also higher for women than for men (3 h per week). On the contrary, the weekly average for leisure activities is higher for men than for women (2.3 h).
Table 10 shows the summarized statistics for care of people according to age and gender. The highest average is attained for female staff between 35 and 40 years, which coincides with maternity and parenting. In this case, the difference between women and men is 30 h per week. This gap continues in the following age ranges, although it is not so large. The distribution of this variable is very skewed, since 32% of the sample do not devote any time to caring for other people. Therefore, for this variable, it is better to look for the median. The difference between the median of women and men is 30 h/week for 35–40 years, 11 h/week for 40–45 years, 9 h/week for 45–50 and 50–55 years, and around 5 h/week for the following age ranges. For ages below 35 years, the median is zero for both genders. It could be surprising that the average time devoted to care time for men aged 30–35 years is higher than for women of the same age range, but this is misleading. Six of the 24 men of this age have children, in fact, 4 have two children, but only 2 women of this age have one child.
Unsupervised Learning
Until now, we have limited ourselves to describing each variable disaggregated by different factors. However, for having a more complete picture of the profiles in the Spanish university staff, we can carry out a multivariate analysis taking into account all the variables jointly. Therefore, we apply several unsupervised statistical learning techniques in this section.
Firstly, a cluster analysis is applied, in particular the classical
k-means [
38] (Ch. 14). This analysis allows us to segment the data and see which groups are present in the data. We consider four features or blocks for carrying out the analysis. These blocks are built from the original variables, according to the type of activity performed, and are the following ones: Teaching is Block 1 and includes the questions from 1 to 5; Block 2: Research and transfer, with questions 7 and 8; Block 3: Management (question 6); Block 4: Care and home (questions 9 and 11). Then, each block is standardized (the mean is subtracted and the results are divided by the standard deviation), in order that the blocks have the same weight. When data is standardized the mean corresponds to the zero value, and if the distribution is Normal, 95% of the values are between −2 to 2.
In order to determine the number of clusters, we use the elbow or kink criterion [
38] (Ch. 14), and five clusters are selected.
k-means returns the centroids of each group.
Table 11 shows the standardized values of the centroids, which can be as the representantive point of each cluster. It is also returns the membership of each point to a unique cluster. The column ‘Count’ contains the number of subjects assigned automatically by
k-means to each cluster.
We notice that cluster number 1, which is the most numerous, has low scores in all the blocks, i.e., it represents an academic that works less than the mean in Teaching, Research, Management and Care, especially in Research and Care. The other clusters also have low or medium scores in all the blocks, except in one of them, i.e., it highlights in one of the blocks. The block with high scores in groups 2 to 5 is different. The centroid of cluster 2 represents an academic with medium values in Research, Management and Care, but with a high workload in Teaching. The centroid 3 represents an academic with mean values in all the blocks, except in Management, where her/his workload is high. The centroid 4 represents an academic with medium-low values in all the blocks, except in Research, with high workload in this area. Finally, centroid 5 represents an academic with medium values in all blocks, except in Care, where he/she devotes quite a bit more time than the mean.
Table 12 includes the mean values for the members of each group for different variables. As before, Q indicates the question of the survey. Note that the mean number of children in cluster 5, which focuses on care, is 1.87, which is the highest of all the groups. Cluster 4, which focuses on Research, has the highest mean in leisure time and the lowest in care, and the lowest teaching duties (Q1). These teaching duties are more or less similar to the individuals in cluster 3, which is centered on administrative tasks. The highest teaching duties are for individuals in group 2; moreover, they devoted a lot of time to preparing classes. Nevertheless, this group also has a high average in Research, so its total academic workload is very high, nearly 63 h/week, the highest academic workload of all the groups. Cluster 3 has the second highest academic workload, with 56 h/week. The third highest academic workload corresponds with cluster 4, with 52 h/week. Academic workload for group 5 is 38 h/week, while, for group 1, it is 34 h/week. Staff in group 1 is who devotes less hours to domestic work. Group 1 is the most numerous, and it represents 38% of the sample. Note that having an average academic workload of 34 h/week does not imply that they are not fulfilling their obligations of working 37.5 h/week, since we are also considering part-time staff, such as AS, which are a very numerous category, and they should not work 37.5 h/week at university, therefore decreasing the average.
Table 13 shows the distribution by gender in the different clusters. The first and fourth groups mainly constituted men, whereas groups 2, 3, and, especially, 5, constituted mostly women. Remember that group 5 is focused on care. Cluster 1 was the group with lowest workload, and cluster 2 was the group with highest workload. Group 3 for management tasks and group 4 focused on research are more equally distributed between genders.
Table 14 shows the distribution in the different clusters according to labor category and gender. Let us discuss the results for the main categories. For AYD, group 2 is the majority for women and group 1 for men. The rest of individuals are divided between group 2 and 4 for men, but group 1, 4, and 5 for women. As they are beginners, the percentage of group 3, for management tasks is low, although is higher for men. PRE are mostly in group 4, devoted to research. The profiles in CU for men and women are different. In both cases, group 4 is the majority. However, in CU men, group 1 is very important. In CU women, group 5, and also group 3, is higher, i.e., again, caring is also more present in CU women, and CU women are more involved in management tasks than CU men. In CD, the majority group is 1, but with very different percentages between men and women. In TU, the majority group is 1 for men, but not for women. For CD and TU men, it is above 40%, but for women it is below 26%. An important part of TU and CD women belong to group 5, and also to group 2, more than TU and CD men. For group 3, the membership is similar between CD and TU men and women, but it is a bit higher for women. For CD and TU men, the membership to group 4 is a bit higher than for women. In the case of AS, the membership in group 1 is the highest, especially in AS men. It is also the case with TEU, although, in this case, it is higher for TEU women. In regard to SU, the highest membership in SU men is in group 1, but, in group 2, it is SU women.
Besides cluster analysis, we have also applied archetype analysis and self-organizing maps (SOM) obtaining similar results.
4. Conclusions
We have analyzed the workload of 703 Spanish academic staff. As novelty, we have used data science tools. We have used five number summary, besides the classical statistical summaries, which are more appropriate for skewed distributions. Furthermore, we have applied unsupervised multivariate statistical learning techniques, such as cluster analysis.
We have commented the limitations, which are mainly due to the selection of the sample, but this study has allowed us to the detect problems and patterns. The exact extent of them can be further determined with random samples in future research. The survey does not examine subjective processes that take place within academic departments, such as access to informal support networks, mentorship, etc., but it focus on measuring the workload in different activities during the period of classes. In future work, activities during the period of exams can be also measured.
The findings can be useful for policy-making, especially now that there are planned legal modifications in the Spanish university environment. Note that workload allocations have an impact on academic satisfaction and working life [
39], especially for women [
40].
Some of the most important findings are the followings. Eighty percent of the academic staff in the sample works more than 37.5 h. Fifty percent of the sample works more than 47 h, i.e., more than one day extra work per week.This could suggest that staff can be overloaded. In regard to gender, women work more hours per week on average than men. The labor figures with highest mean workload are AY, AYD, SU, CU, and PRE. Although AYD is the natural beginner figure in the Spanish university, it has the highest number of credits assigned on average in our sample for labor figures with research duties. This could suggest that resources are not allocated correctly, since beginners should not be burdened with teaching. Moreover, 39 years is the average age for AYD, so the beginners in the Spanish university system are not too young. This could be a signal of a problem in the Spanish university system.
Although the teaching duties (ECTS credits assigned) for women and men in our sample are nearly identical on average, women dedicate on average more time to tutoring, preparing classes, post-class tasks, and student supervision, which we could consider as improving the teaching quality. In fact, the difference is 4 h/week. Therefore, 1 ECTS credit represents on average 47.0 h/year per female staff and 40.5 h/year per male staff. However, teaching quality is not nearly taking into account in the national system of evaluation for promoting.
The average weekly hours devoted to care of people and domestic work is 10 h per week higher for women. But, the difference is 30 h per week, on average, between women and men for care (without considering the domestic work) during the range from 35 to 40 years, the most frequent period of parenting. This is a clear signal that gender roles continue to be present in the academic staff.
Although we have found that on average men devote more weekly time to research, the difference is very small, 0.6 h per week for full-time staff. Therefore, it seems that the pattern in Spain does not coincide with that of American research universities [
7].
We have found five clear workload profiles in the Spanish academic staff. One with low workload compared to the rest of staff, although its average academic workload is 34 h/week; one with very high workload, especially in teaching; another one with high workload in management; another one with high workload in research; and one profile with high workload in care and domestic tasks.
In summary, the legal modifications in the Spanish university environment should address the key findings: (a) the beginners in the Spanish university system, AYD, are not so young and are also loaded with teaching; (b) the majority of the academic staff works non-recognized overtime, which would be a violation of Spanish labor law; (c) there is a gender gap about the time spend on quality of teaching and caring tasks. This would be also a violation of the Spanish equality law, since co-responsibility must be fostered by all the administrations, governments, and universities, and obviously they are failing in this task.
More information can be gathered from our data in the future. As data are available, other researchers can carry out further analysis and compare their results in other countries. Other future line of research would be to analyze the possible consequence of COVID-19 pandemic in the workload of academic staff and changes in the distribution of activities. According to other approaches [
41,
42], the more time being spent on caregiving responsibilities during the COVID-19 pandemic has dropped female scientific productivity. Therefore, policies should also address this gendered effect [
43].
The following abbreviations are used in this manuscript: