1. Introduction
While the European Union considers the integration of the Roma population to be of strategic importance and devotes considerable resources to this issue, data show that little has changed since 2005, and the Roma population is still on the margins of society. The European Council’s previous development priorities of education, employment, health and housing have been expanded to include equality, inclusion, and participation in the 2020–2030 funding period. In the area of education, three targets have been formulated: increasing participation in early childhood education and care, reducing the number of early school leavers, and ending segregated education in primary schools. Current figures show that 42% of Roma children attend pre-primary school compared to 92% in the general population; 44% of Roma students are in segregated education; and 28% of Roma young people have completed upper secondary education compared to 83.5% in the general population [
1]. Education, the attainment of at least upper secondary education, has an impact on other target areas, as a number of studies have shown the link between employment and health and welfare and early school leaving. In addition, the effects of early school leaving are also conspicuous in the areas of civic responsibility, politics, health, social sector, and employment, leading to lower economic growth rates, lower tax revenues, higher unemployment and welfare payments, and higher public health and criminal justice expenditures [
2].
Our study analyses early school leavers (more recently termed early leavers from education and training or ESL/ELET) in Hungary, focusing on the characteristics of young people who embrace their Roma identity. Our analysis attempts to fill the gap that the literature where early school leaving is not analysed as a combined explanation of several social background factors and is often explored only in ethnic terms. In our study, we use the 20–29 year olds subsamples of the Hungarian Youth 2016 and 2020 databases to characterise the highest completed educational attainment of Roma youth, compare the characteristics of Roma and non-Roma early school leavers, and examine the factors explaining early school leaving using logistic regression models.
There are numerous systematic analyses of the causes of dropout and early school leaving [
3,
4,
5,
6], which identifies individual, peer, family and school-level factors as the main causes of early school leaving. The analyses also highlight that early school leaving is a long process influenced by a combination of factors. Due to its complexity and the fact that it changes over time early school leaving is difficult to study, as static and bivariate analyses can easily fail to detect the interaction of factors and to highlight its dynamic nature [
7]. Some of the systematic analyses also suggest that some causes are stronger than others: absenteeism, poor academic achievement, peers, family structure, economic status, and emotional background have a greater impact [
5,
6]. In addition, it is also found that certain social groups (based on family background or ethnicity) are generally more affected than others.,
Intersectionality is a persistent situation in which multiple categories of interacting inequalities manifest as a new social category in which the causes of oppression cannot be separated [
8,
9]. Research on intersectionality approaches from different perspectives: they can be group-centred, process-centred, and system-centred [
10]. Some only describe the effects of belonging to multiple groups [
11], while the process approach highlights interactions and explores the underlying factors [
12], such as the relationship of power to groups and the choice of belonging to a particular group [
13]. The system-centred approach looks at intersectionality from a historical perspective and examines inequalities in their complexity [
14]. Others [
15] analysed the educational and labour market opportunities of young adults along three group characteristics (gender, migrant background, and social class). Stand [
16] examined the relationship between academic achievement, socioeconomic status and gender. He found that ethnicity, gender, and SES are not simply additively combined; they interact significantly, particularly ethnicity and SES, and ethnicity and gender. Cerna et al. [
17] identified six intersectional dimensions and described them as overarching factors of socioeconomic status and geographic location. Howard and Vajda [
18], examining the intersectional situation of Roma, find that the most persistent form of group-based disadvantage is linked to identities of origin (minority), which deepens other forms of inequality. In Hungary, social disadvantage and its complexity (low educational attainment, disadvantaged localities), belonging to the Gypsy/Roma community, and the associated negative social prejudice are the main intertwined categories [
19,
20]. The studies also point out that, due to intersectionality, educational issues related to disadvantage (region, type of locality, socioeconomic status) and Roma cannot be separated [
21,
22].
The average rate of early school leavers in the European Union has been steadily decreasing, falling below 10% in 2021 [
23]. In Hungary, the rate tended to fluctuate or stagnate in the 2000s, followed by an upward trend after 2010. Hungary moved above the EU average in 2013, the first year in which the early school leaving rate exceeded the EU average and has remained above it ever since. Hungary’s rate is 23rd among the 27 countries, followed by Bulgaria, Italy, Spain and Romania with 15.3%. (The biggest drop in ten years has been achieved in Portugal (17%) and Spain (13%), and the best rates are scored by Croatia, Slovenia and Greece.), It should be pointed out that the average rate of early school leavers is slightly higher for boys (11.4%) than for girls (7.9%), but girls have a more difficult time returning to education, as in their case, early childbearing is one of the main reasons for early school leaving. According to Eurostat data [
24], in 2017 the average proportion of mothers under 20 years of age at the birth of their first child was 3.7% in EU countries. The highest rates are in Hungary (8.5%), Bulgaria (12.1%), and Romania (12.5%). As a result, both Bulgaria and Romania have more early school leavers among women than men, and Hungary has almost the same gender ratio. Early school leaving also has regional characteristics. Eurostat also looks at how early school leavers are distributed by the level of urbanisation in different countries, distinguishing between cities, towns and deprived areas. The analysis finds that Hungary, Bulgaria and Romania show the largest differences between EU countries in the three categories and that rural areas in these three countries are the most affected by ESL. In Hungary, Central Hungary and Western Transdanubia are in the best position, with early school leaving rates below 10%. Northern Hungary is in the worst situation, with twice the rate of early school leavers compared to Central Hungary.
The Hungarian research [
25] also identifies factors similar to those in international studies, finding that school factors perform only a minor role in preventing dropout (no compensation) and that individual characteristics and family background are the determinant factors of dropout. It should also be highlighted that Hungary is consistently among the countries with the highest explanatory power of the SES index in academic achievement [
26]. In Hungary, there was only one large-sample longitudinal study consisting of six waves, which followed the school paths of students/young people over a longer period (2006–2012) and also examined the Roma identity. Data from the last wave of the study show that (1) 99% of students starting primary school successfully completed primary school, compared to 93% of Roma students; (2) 90% of students starting primary school successfully completed upper secondary school, compared to 50% of Roma students; (3) 75% of students starting secondary school successfully took the secondary school leaving exam, compared to 24% of Roma students [
27]. The authors also point out that in addition to the financial situation of the families, poverty, educational environment (cultural disadvantages), and social isolation also perform an important role in the dropout of Roma students, as Roma young people have fewer friends and peers who do well at school. It is important to note that this study was conducted at a time when the compulsory schooling age in public education was 18, and major school integration/inclusion developments were underway [
28], but in 2011, the compulsory schooling age was changed to 16 and the above-mentioned developments have also been disrupted. Research has shown that compulsory schooling up to the age of 18 has promoted schooling for children from lower social groups [
28]. Kende [
29] points out that although integration in pre-primary education has improved significantly in recent years, the situation of Roma in education is deteriorating. Educational gaps are widening, and the proportion of Roma who do not complete the different levels of education is very high. In addition, school segregation is increasing. Roma students face significant disadvantages in access to quality education. Segregated education limits young Roma people’s opportunities for further education and deprives them of inter-ethnic social networks, destroying their identity and self-esteem [
30]. Segregated education also has an impact on academic performance [
31]. The results of the large-sample studies were nuanced by research conducted in the context of three surveys that followed the schooling path of the same groups of students in segregated Roma communities over more than 20 years and its impact on their adult life. The life course interviews revealed that where support from teachers, schools, churches, NGOs, and family was available, early school leaving was lower and young people often had significantly higher educational attainment than their parents. Where this was not the case, young people repeated their parents’ failure to attend school, reproducing their disadvantaged social position. Here, again, the intersectional position of those included in the study was decisive: support was targeted to compensate for disadvantage, while barriers were rooted in social disadvantage and prejudice against Roma [
32].
The aim of the study is to examine the rates of early school leaving and its variation among young people based on data from a large sample panel study representative of the cohort’s characteristics and to analyse which factors are likely to explain early school leaving, focusing on the Roma identity.
RQ1: What socio-demographic differences can be found between Roma and non-Roma early school leavers?
RQ2: What demographic, social, geographical and economic factors increase the probability of early school leaving?
RQ3: What role does ethnic background perform in early school leaving?
2. Materials and Methods
Our secondary analysis relied on the 2016 and 2020 data from the database of the Hungarian Youth Survey, a panel survey conducted since 2000 with a sample size of 8000 respondents. The databases belong to the fifth and sixth waves of the survey, with four-year intervals between each wave. The Hungarian Youth Survey aims at a comprehensive and longitudinal analysis of the young population, so the omnibus questionnaire covers a wide range of areas (marital status, education, employment, leisure time use, etc.). The questionnaires (and the targeted areas) of each wave do not always correspond to each other, but block mapping education is included in all cases. The length of the fulfilling was about 45–50 min. The sensitive topics were mapped by a self-administered survey due to ethical issues.
The database is representative by sex, age, region, type of locality, and educational attainment, and covers the population of young Hungarians aged 15–29. The sampling was carried out in a multistage and stratified manner, with the first stage being the selection of localities (municipals were chosen in this stage according to its geographical disposition and the proportion of young people in them) and the secondary sampling frame being the young people in the localities (according to the data of Minister of Interior wia th simple random sample, N = 8000). In the case of fifth wave, the number of the target population was 1,701,837 people who were born between 1 January 1987 and 31 December 2001 [
33], and this number was 1,614,973 people in the sixth wave who were born between 1 January 1991 and 31 December 2005 [
34]. The interviews were conducted face-to-face by an interviewer. The database was weighted by gender, type of locality, educational attainment, and age group [
34]. Data analysis was conducted by SPSS 28.0 statistical software. The dataset arefreelye available for research issues.
All 8000 respondents answered the questions used in the analysis (some subsamples of 2000 respondents were used for certain sub-areas and question blocks). The used questions blocks by us belong to the part of the analysis which was fulfilled by every respondent (educational attainment, ethnic and socio-demographic background). Although the survey is longitudinal, the wave-to-wave variation in the ethnicity variable, one of the most important questions for us, did not allow for a more serious longitudinal analysis. The wording and the way the question was asked were the same in the 2016 and 2020 surveys (‘What ethnicity do you identify yourself as?’—a multiple-choice question with Roma/Gypsy response option), so the data from the last two waves were compared in terms of Roma educational attainment. However, the proportion of young people identifying themselves as Roma decreased between the two waves, which can probably be explained by external social factors influencing self-reporting (degree of prejudice, evolution of social distances, etc.) rather than by demographic reasons [
35]. The issue of the social embeddedness of the change in the proportion of self-identified Roma has also been explored in the international literature [
36]. In addition to the ethnicity variable, we also used the age variable (we used a sample of 20–29 year olds, as they already reflect the fact of dropping out; the current compulsory school age in Hungary is 16 years. The database divides young people into three cohorts (15–19, 20–24, and 25–29).
In addition, the analysis used background variables that are associated with dropout [
25]:
Sex (male and female);
Ethnic identity (self-reported);
Father’s highest completed education, mother’s highest completed education (here we used four-choice data: primary education or less, vocational education without secondary school final certificate, secondary school final certificate, degree, and in the regression model, we separated those with only primary education from those with at least vocational education);
Region (based on EU 2020 data [
30], we have separated the more disadvantaged regions (Southern Great Plain, Southern Transdanubia, Northern Great Plain, Northern Hungary) from the better-off regions (Central Transdanubia, Central Hungary, Western Transdanubia and Budapest);
Type of locality (with three categories: Budapest, city with county status, and county seat; smaller town; village);
Early childbearing (respondent had first child before the age of 19);
Previous cohabitation without marriage;
Subjective, self-assessed financial situation measured by five categories (living in deprivation; having financial problems from month to month; just making ends meet on the income; with careful budgeting, we get by without problems; no financial problems). In the regression model, negative (first two responses), neutral (third response), and positive (fourth and fifth responses) statements were separated. The question block on the subjective financial situation has been used continuously since the first wave of the research, i.e., since the 2000 survey [
34].
First, we examine the percentage of young people aged 20 or over with the highest educational attainment who identify themselves as Roma in the 2016 and 2020 datasets of the survey, and we try to uncover if there is displacement between the two waves or not and this displacement is statistically significant or not. Second, we compared the features of the Roma and non-Roma early school leavers along with the earlier mentioned independent variables, which may describe the specific intersectional life situation. Not only percentages but chi-square statistics were used by us during this step. We have paid special attention to the type of locality because the Roma people are overrepresented in the villages and pupils’s school achievement is strongly formed by the of the locality in Hungary [
37]. The values of adjusted residuals were analysed by us because we would like to explore the patterns of the given cells. Finally, we run two binary logistic regression models on a subsample of 20 year-olds or older, with the dependent variable being the dropout rate (0 = having at least vocational qualification, 1 = not having vocational qualification). The independent variables included sex (0 = female, 1 = male), mother’s level of education (0 = higher than primary school, 1 = primary school or less), father’s level of education (0 = more than primary school, 1 = primary school or less), childbearing under 19 years of age (0 = no, 1 = yes), previous cohabitation without marriage (0 = no, 1 = yes), type of locality (dummy coded, the reference category is Budapest), regions (0 = favourable regions, 1 = disadvantaged regions), subjective economic situation (the reference category is a merge of the two categories better than neutral), and ethnicity (0 = not Roma, 1 = Roma).
Table 1 shows the subsample sizes, and
Table 2 contains the distribution of socio-demographic variables.
The proportion of men in both samples is around 51%, and there are also slight changes in the age structure. As already indicated, the proportion of Roma was 4.1% and 3.7%, respectively. The share of those with primary education shows a slight decrease (from 30.6% to 27.9%), and the most common educational attainment is secondary school and leaving certificate or vocational qualification requiring SSLC (38.4% and 42.4%, respectively). There is a slight increase in the proportion of degree holders (12.2% and 12.9%), but it is still significantly below the EU average. In 2020, around 41% of 25–34/year/olds had ISCED 5–8 qualification, compared to around 33% in Hungary [
38]. In 2020, 11.2% of fathers and 12.4% of mothers had not completed upper secondary education, higher than the EU average (reference) but better than in 2016. The rate of early childbearing is low (2.4% and 1.3%, respectively), while cohabitation is higher (10.2% and 9.6%). The largest proportion of the sample live in smaller towns or villages, and the rate of those living in disadvantaged regions is slightly higher. The most common response to the subjective assessment of the financial situation is “with careful budgeting we get by without problems” (43% and 56.9%), and there is a positive shift between the two waves of self-reporting.
4. Discussion
Our study examined the reasons for early school leaving in two waves (2016 and 2020) of a large sample panel survey focusing on Roma identity. The study yielded several new findings.
(1) There are few comprehensive, cohort-representative studies that provide data on the Roma population. Rostas [
39] highlights the biases associated with censuses or other estimates used to prepare policy decisions. Ethnic identity is a social construct that is formed through interactions and may change from time to time depending on the social situation and the strengthening or weakening of prejudices. Someone may be less likely to self-identify as a Gypsy if they fear discrimination or more likely to embrace it if they are proud of their identity or if it may benefit them. According to our analysis, 4.1% of young people in 2016 and 3.7% in 2020 identified themselves as Roma, meaning that the proportion dropped slightly between the two survey years. It is likely that this is due to changes in the social situation.
(2) Data is scarce not only on Roma identity but also on the proportion of Roma early leavers from education and training. Our results show that the ESL/ELET rate is 65% in both years studied, which is between six and eight times higher than for non-Roma youth (as the proportion of non-Roma youth with completed upper secondary education increased somewhat in the two years studied). It is to be noted that this rate is higher than the figure reported by the European Council mentioned in the introduction [
1], where on average, 28% of Roma young people complete upper secondary education. However, it should also be noted that the vast majority of the Roma in our study have vocational qualifications (without SSLC), which have limited value for the labour market, employability, further education, and lifelong learning.
(3) Another finding to be highlighted is that there is a significant difference between Roma and non-Roma youth in terms of parental educational attainment and subjective financial situation at both survey points. Parents of Roma young people with an education of eight grades or less are still in the majority and their financial circumstances are characterised by deprivation or day-to-day living. However, it should also be noted here that other longitudinal studies have shown that Roma identity is less admitted by more successful Roma young people, and therefore ethnicity and poverty are linked [
35].
(4) Risk factors for early school leaving are student-related, community-related, family-related and school-related [
3,
4,
5,
6,
7,
40]. In our study, we analysed student demographic and socio-economic and geographical characteristics. Among the factors most likely to lead to early school leaving, low parental education, poor financial situation, and childbearing appear to be the most prominent. The effect of the latter weakened slightly between the two study periods. The educational attainment of the mother was more important than that of the father, and its effect even increased between the two study years. For us, it is of paramount importance that the explanatory power of parental educational attainment exceeds that of Roma identity. Examining the differences between the two models, we see that the explanatory power of Roma identity and geographical location (village and rural area) also increased between 2016 and 2020. The latter could be an indication of increasing spatial disparities in education. In the 2020 sample, Roma living in smaller towns have higher dropout rates, which warrants further investigation. Compared to 2016, the probabilistic power of Roma identity increased the most in the 2020 model. These data suggest that the intersectionality of socioeconomic, geographical disadvantage and Roma group membership is becoming increasingly dominant. In contrast, the ‘classic’ intersectionality factor (minority group female) did not perform a role. No differences by gender were found in the Roma/non-Roma comparison or in the factors that make dropout more likely.
Our analysis has shown that there are complex reasons for early school leaving, and in this explanatory framework, Roma identity is only one—and not the strongest—factor. However, the effects that emerge point to the reproductive power of inherited status, geography and economic situation, complemented by certain elements of relationship behaviour, also embedded in inequalities. Our first research question can therefore be answered by this complex explanatory framework. Our second research question concerned the role of ethnic background in early school leaving. Based on our analysis, we can answer that Roma identity had a moderately strong explanatory power, i.e., that not only economic, educational or geographical factors perform a role in the reproduction of low educational attainment.
In their qualitative study classifying early school leavers, Ref. [
41] found that coming from a more privileged social background does not necessarily lead to successful reintegration into education but that young people with few resources in their family background are particularly disadvantaged in terms of education and career. In addition, young people face stigmatising situations not only during dropout but also afterwards, which also makes their employability more difficult. It is likely that for Roma young people, this disadvantage is compounded, and stigma is reinforced, not only because of the resources of their family background, but also because of the social prejudice surrounding their ethnicity. Kende and Szalai found [
42] that Roma students’ school failure has structural and institutional roots, which our research confirms.