2. State of the Art: Teaching Mathematics in Bilingual Contexts
If we based our study on more classical criteria [
3], we should say that bilingualism is limited to the use of two languages as if both of them were native languages. We believe, however, that this can leave aside many situations and contexts that could also be considered as bilingual. We see Planas’s view [
4] as more appropriate when considering the bilingual as a speaker who masters one language and is capable of constructing meanings in a second language.
According to Cummins [
5] there are many studies conducted in situations of additive bilingualism that show many positive effects. As Cummins states, there is “...a critical threshold level of L1 and L2 ability that must be attained before the positive effects of bilingualism can be observed” [
6], p. 341. Nevertheless, mastering both languages (L1 or first language or mother tongue, and L2 or second language) requires longer exposure time and the right contexts. According to De Houwer, “Both bilingual speaker’s language input history and the sociolinguistic settings to which he or she has been accustomed are elements pertaining to the environmental conditions in which bilingual language use takes place” [
7], p. 253. Following the author, we find that the researchers’ interest is given to “…real-life” language use rather than artificial, laboratory-induced language behavior, implying that it is highly important to consider all these environmental and contextual conditions when setting up data collection sessions.
The development of linguistic skills takes place because of the speaker’s language awareness itself and, above all, the characteristics of the speaker’s socialization contexts [
8]. It is a relevant fact to consider when learning mathematics, specifically acquiring those concepts in which communicating and interacting with others take a center stage to guarantee their understanding. The social element of interaction should be a priority in the learning of mathematics, whether it is a bilingual context or not, since through dialogical learning between peers, learning is equally or more enriching than any vertical teacher-student communication [
9].
Therefore, the language role in the classroom, and specifically the role of the language of instruction used, allows us to further expand the concept exposed at the beginning, since from the perspective of its pedagogical role, language is considered as a vehicle in the development of the learning objectives, far ahead of its social and pedagogically weak role in the description of reality and its diversity [
4,
10].
Barton [
11] states that “we bring mathematics into existence by talking about it, and the way we talk about it changes the questions we can ask” (p. 227). In order to understand the cognitive effect that occurs in second language teaching, we can start, as explained in
Figure 1, from the proficient command of both languages. We see that having a high linguistic level in both languages has positive effects, but what happens when we are at a lower level of language proficiency in bilingual contexts?
Taking into account that in the first years of elementary education, linguistic skills are low in both L1 and L2, this may be one of the reasons for the low impact on a cognitive level. The hypothesis of the linguistic interdependence proposed by Cummins [
12] claims that a child’s L2 command depends partly on the level of proficiency achieved in the L1 or mother tongue, that is, it is necessary to have the CUP (common underlying proficiency) well developed either in the first language or the additional language, or in both languages. According to Cummins [
13] it is due to the fact that for distant languages the transfer takes place mainly from personality and cognitive attributes, while for more related languages the linguistic elements are also transferred.
In multilingual contexts, children who are exposed to language input in two different languages tend to acquire the two syntactic structures when both of them are similar. However, when these structures are much more complex in one of the two languages, they will tend to acquire first the language whose syntax is easier [
14].
The improvement of linguistic competences and the development of mathematical concepts, therefore, lead us to value the concept of translanguaging, which, unlike code-switching and codemixing, is not simply a matter of speakers changing languages, but of constructing, and using complex and original interrelated discursive practices [
15]. In the vast majority of the cases, these situations take place in contexts where English coexists with other languages or is used for professional or educational purposes, “because English tends to be one of the languages in the multilingual speaker’s repertoire and one of the languages used in society” [
16], p. 301.
Quite often, it has been emphasized that the language of instruction is a powerful vehicle for mathematical thinking since people who speak two or more languages tend to carry out activities associated with memory, such as counting in the language that mathematics has been learned. In anecdotal cases, we can find that both languages coexist and are mixed up in these types of tasks. According to previous studies it has been demonstrated how bilingual students perform calculations more quickly and efficiently in their first language compared to a second language [
17,
18], although performance is higher if the person shows a higher language command in any of the languages [
19].
However, when it comes to with problem solving, the contrast is highly noticeable since people solve problems and cope with the situation more quickly and efficiently when these are presented in their dominant language [
17,
20,
21]. On one hand, as Dehaene [
22,
23] proposes, in order to retrieve arithmetic data in a second language, these would come from a translation process or from a less systematic learning of facts in the second language. On the other hand, McCloskey [
24] speaks about the transformation processes that can occur in problem solving. McCloskey’s hypothesis is based on the fact that the numbers and arithmetic data are represented regardless of the language, but in order to access these representations, the person must transform the problem into a representation to calculate an answer, which, in turn, must be transformed back into the language spoken. These two processes to decode and encode will be involved in the speed and performance to find the response.
It is also necessary for our study to consider the phenomenon of translanguaging [
25], a term coined by Cen Williams in Wales to refer to bilingualism where speakers alternate between language modes especially in teaching and learning [
26,
27,
28]). It has been demonstrated that this phenomenon is a natural process occurring in school contexts where two or more languages are used by students and teachers [
15].
As García [
26] puts it, it occurs when bilingual students and teachers engage in different types of practices including home language in order to communicate appropriate content knowledge and also work on academic language activities. Language practices are dynamic and fluid, and bilinguals select language resources from a repertoire to fit their communicative tasks.
For our study we have also based on the study of Li Wei [
28], who deals with translanguaging as a phenomenon occurring when going from and to different linguistic structures and systems in speaking, listening, reading, signing and remembering. According to Canagarajah [
27], multilingual speakers feed from all languages in their repertoire when they need to communicate in educational contexts, something that will lead us to base our analysis on students’ understanding and using an additional language when resolving verbal problems.
5. Materials and Methods
The sample consisted of 169 students in bilingual contexts belonging to 1st and 2nd grades of elementary education from two international schools. The sample was made up of 80 1st grade students (39 girls, mean age of 7.1 years and SD = 3.2 months years and 41 boys, mean age of 7.3 years and SD = 2.8 moths); and 89 2nd grade students (38 girls, mean age 8.2 years and SD = 2.3 moths, and 51 boys, mean age 8.2 years SD = 3.4 moths). It is a convenience sample where the number of participants equals the number of students who are part of the international school in 1st and 2nd grades in which the study was conducted. In these multicultural and multilingual schools, there are students from many different nationalities, i.e., Brazil, USA, Portugal, England and Italy, among others. One of the main reasons to carry out the study in these international schools is that they have some common characteristics, for instance, that students have had the same time of exposure to the second language in the school, mostly with native teachers. The methodology implemented in these centers follows the IB (International Baccalaureate) program, therefore we can find very few differences, being only related to the teacher’s personal classroom management characteristics. See
Table 1 for grade and sex distribution.
Once we obtained the authorization to carry out the research, a presentation letter of the study was sent to the management teams of all the schools involved, who subsequently informed the families after its approval by the management team. Taking into account that the participants were minors, they were sent an informed consent letter that their parent/guardian had to sign. All the data were obtained anonymously and treated confidentially.
All participants had been exposed to the same linguistic immersion in the second language, and they all spoke Spanish and English at different levels of proficiency. In both international centers the linguistic immersion program developed is one of the educational priorities and is implemented throughout the school years, from kindergarten to the last years of secondary education, implying a great linguistic and cultural variety. All through the kindergarten courses, the immersion program is fully implemented in English. In one of the schools, all subjects in elementary education are taught in English except for the Spanish language, whereas in the other school both Spanish language and mathematics are taught in Spanish. In other words, the only difference between the two schools is the language of instruction used to teach mathematics, a situation that made us form two distinct groups according to whether or not their mother tongue coincides with the language of instruction.
The tests provided were the following:
The Raven Colored Progressive Matrices test [
45]. We used it to assess whether there were significant IQ differences among the participants.
The second test used was the Perception of Differences Test [
46], which was intended for evaluating the attentional processes in the participants.
Finally, we administered individually to each participant 20 addition and subtraction word problems. In any case no more than twenty problems were provided per participant. The answer was recorded as correct when the student knew how to explain the answer. When the answers were given at random, these were considered invalid. The selection of problems was based on the TEDI-MATH scale test, but since they did not cover all types of problems for assessment, they were complemented with problems extracted from the research carried out by Bermejo et al. [
47], being sequenced in an increasing order of difficulty following the classification established in Bermejo, Lago y Rodríguez [
48] (see
Table 2):
This test was conducted in Spanish, the language of instruction used to teach mathematics, to those students who were taught mathematics in that language, and in English language to students who were taught in English. The problems were translated from Spanish into English following the translational procedure established for such a purpose.
During the school day we first handed out two collective tests (1 and 2) to the groups in their usual classrooms. We then assessed the students individually according to the language of instruction in their school where they are taught mathematics. Then students were assessed in a different room from their usual classroom. First grade students were evaluated first, and then the students who belonged to 2nd grade. We carried out the tests individually over two consecutive days counterbalancing the order of the tests (1 y 2) across groups. Each session lasted approximately 30 min, depending on each student’s performance.
6. Results
Data analysis was carried out using the SPSS (version 25.0, IBM Corporation, Armonk, New York, USA) statistical package. In
Table 3 is shown the number of participants who did the problems correctly, and those who did not solved them correctly.
In order to evaluate the possible differences between the use of the language of instruction versus the use of mother tongue, the Pearson chi-square hypothesis contrast test (Χ
2) was performed for each of the problems in both 1st and 2nd grades. When breaking down the results (see
Table 4) and emphasizing the statistically significant differences, they appeared with statistically higher values when there was a similarity between the native language and the language of instruction. However, the values were not higher when the mother tongue and the language used for instruction were different. On one hand, in 1st grade, there were statistically significant differences in the items 2 (Χ
2(1,N = 80) = 5657,
p = 0.017), 3 (Χ
2(1,N = 80) = 5134,
p = 0.023), 4 (Χ
2(1,N = 80) = 4579,
p = 0.032), 6 (Χ
2(1,N = 80) = 7050,
p = 0.008), 7 (Χ
2(1,N = 80) = 9183,
p = 0.002), 11 (Χ
2(1,N = 80) = 12.121,
p = 0.000), y 12 (Χ
2(1,N = 80) = 5805,
p = 0.016). On the other hand, in 2nd grade all these differences disappeared, leaving only that of item 11 (Χ
2(1,N = 89) = 5392,
p = 0.045), and significant differences appearing in the items 5 (Χ
2(1,N = 89) = 5392,
p = 0.020), and 19 (Χ
2(1,N = 89) = 3973,
p = 0.046).
In the same way, all 20 items were grouped according to three characteristics (see
Table 5): the type of problem (change, equalization, combination or comparison), location of the unknown (result, mean, beginning), and if it was an addition or subtraction problem.
The last objective was to determine if the language coincidence (the language of instruction was the same as the mother tongue) showed higher values compared to the situation when this coincidence did not occur. The differences between both groups denote better results when the languages coincide than when languages are different (see
Table 6).
In 1st grade groups we found statistically significant differences (see
Table 7) when solving exchange problems (Z
(48,N = 80) = 3849,
p = 0.000), particularly in those problems with medium unknown (Z
(1,N = 80) = 2625,
p = 0.009) and beginning unknown (Z
(1,N = 80) = 2706,
p = 0.007), and also both in additions (Z
(1,N = 80) = 2150,
p = 0.032) and subtractions (Z
(1,N = 80) = 2708,
p = 0.007). In 2nd grade groups, all these differences disappeared, and they only remained if they were additions (Z
(1,N = 89) = 2113,
p = 0.035). We also found differences when they were combination problems (Z
(1,N = 89) = 2530,
p = 0.011).
7. Discussion
Our main aim was to determine if the language of instruction influenced the resolution of verbal problems when it coincided or was different from the students’ mother tongue. In order to define the scope of our data, we divided the analysis into two large sections: (a) analysis of correct answers and (b) analysis of resolution procedures, i.e., the errors and proportion of correct answers.
From this moment onward, we used several labels to refer to the groups. The 1st grade group whose language of instruction did not coincide with their first language, that is, noncoincident language of instruction, was called G-I, whereas the group whose language does coincide, was called G-II. In the 2nd grade groups, the group with the noncoincident language of instruction was called G-III, whereas the group whose language coincided with the instruction language, was called G-IV.
(a) Analysis of correct answers.
The results showed that in the 1st grade groups there were significant differences only in the type of exchange problems. If we take into account this variable, we can observe that there was a higher percentage of correct answers in group G-II compared to G-I in all types of problems, but this difference was statistically significant only in exchange problems.
The situation where we found more differences in one of the problems with less difficulty such as exchange problems [
29,
38,
39,
40,
49,
50,
51,
52,
53,
54] may be due to the fact that students using their mother tongue were more capable of creating a prepositional base text which allowed them to be able to construct problem models, while the students who solved it in a nondominant language or noncoincident language of instruction, needed more time to construct these models. We observed that in all types of problems, students with the coincident language showed higher performance, and both in G-I and G-II, students’ efficiency when solving equalization and comparison problems was lower compared to those of change and combination types. Therefore, we can claim that there are only significant differences between the groups in exchange problems.
This tendency was maintained in the 2nd grade groups when problems had a simpler linguistic structure, such as in change and combination types. We observed that the G-III’s resolution was lower if compared to G-IV, being more significant in combination problems. However, these differences became shorter in equalization and comparison verbal problems.
Similarly, in both 1st and 2nd grades, we found that the most significant differences occurred in those problems whose semantic structure was simpler. This could be due to the fact that when the performance is higher the distances increase, but if the difficulty of the problems increases, students’ performance decreases and the differences in problem solving are not so important. In this sense, it is based on those theories supporting the idea that problems are encoded in memory under a verbal format [
23,
55,
56,
57], that is, when this encoding is carried out in a nondominant language, there tends to be greater difficulty in its recovery [
2,
58,
59]. For this reason, in problems whose semantic structure is simpler, the child encodes more easily in his dominant language than in his nondominant language, hence his performance is higher. However, with those problems whose linguistic structure is more complex, the coding is also much more complex. Therefore, it is always less difficult in the dominant language.
Accordingly, problem solving would not only involve verbal processes in data retrieval but also in the application of the resolution procedures [
60,
61,
62]. These authors highlight that the components of working memory are necessary to manage the successive steps of the complex resolution procedure and that these components could be at least partially verbal in nature, which would also explain a lower performance in students who solve these tasks in their nondominant language.
From our point of view, if more significant differences appeared in the 1st grade groups compared to the 2nd grade groups, it would result from the limitation of exposure time to this type of tasks, which had not allowed, for the moment, the construction of structures or models to solve the problems that help balance the difficulties they had at a linguistic level. Such difficulties could be derived from the fact that the linguistic structure presented provided a great semantic load and served as a support for the representation and the relationship between the concepts. This seemed to be reinforced when more specific and less efficient strategies were used since they had a medium level of linguistic performance [
62].
If we analyze the correct answers considering the location of the unknown, we can observe that in 1st graders, when the unknown was located in first or second position there were significant differences, highlighting that G-II students’ performance was higher compared to G-I. Similarly, when the unknown was located in third position, the G-II mean was again slightly higher, although no significant differences were found.
All students demonstrated higher performance when the unknown was in third position with respect to the first and second position, which generally confirmed the data found in other investigations [
29,
51,
63]. Therefore, we can point out that the problems that were more difficult to solve depending on the location of the unknown were also a relevant fact: the group whose dominant language coincided with the language of instruction obtained higher performance. Once again, we could be faced with a linguistic-based difficulty, as in order for the students to solve this type of problems they had to “translate” the problem into a simpler way by exchanging the place of the addends. In this process, both verbal and mathematical components were again involved, hence the lack of mastery in one of them could have an effect on the resolution.
In the 2nd grade groups the contrasts in the different positions of the unknown were not significant, although G-IV students presented higher means than the group that did not coincide in all positions.
As previously mentioned, all 2nd graders also showed higher performance when the unknown was located in the third position compared to the other two groups. In other words, regardless of the age group, the level of difficulty of the tasks decreased when the third term was unknown; however, it increased when the second term was unknown, and even with greater intensity if the first term was unknown.
If we compare the differences established between the 1st and 2nd grade groups with respect to the location of the unknown, we see that in the 2nd grade groups it is less noticeable, which reinforces the idea that showing students this type of tasks can allow them to construct models and strategies that compensate for their difficulties in language proficiency. The reason could be that the resolution models in a second language follow a similar structure to the resolution in the mother tongue concerning both the linguistic structure of the problem and the place of the unknown. In this sense, the problems of change and combination were better resolved in both groups, regardless of the language of instruction, than the problems of comparison and equalization. Likewise, in the problems where the unknown appeared in first place, the execution level was lower than when it appeared in the result.
(b) Analysis of resolution procedures.
First of all, we describe which were the most frequent errors in each group, and secondly, we give a broader and more global explanation. Our main aim is to determine if a certain type of error is associated with a certain problem, or if it depends on the location of the unknown, and if that error is corrected when the mother tongue coincides or not with the language of instruction.
One of the first conclusions that we can draw (see
Table 3) is that the different categories of problems cannot be associated with a specific type of error. This can be highlighted because we can find several prevailing errors in each group who worked on some of the problems sharing the same category. Neither can we establish a model based on the typical errors committed by pupils. However, as it has been pointed out in other investigations [
13,
40,
47,
50,
51,
52] errors such as “repeating one of the quantities” and “inventing the answer” are more frequent in 1st grade than in 2nd graders, who generate other types of errors such as “transforming the problem” and “keyword”. It must be highlighted that this last error is closely linked to the semantic component of the problem, and in both Spanish and English it has similarities “más que” or “more than”. For this reason, this error has repeatedly been shown in all groups mostly in the majority of problems of Comparison 1 and 4.
Transforming the problem is another type of error that tends to appear in 2nd grade groups. It also appears in 1st grade groups when the level of language comprehension is higher when solving problems in their dominant language. In these cases, students tend to make modifications to the problem in order to make it simpler. It generally occurs in those problems of which the unknown is not found in the result. This indicates that students show a greater understanding of the linguistic component within the problem. It, therefore, would not appear in G-I group whose linguistic proficiency is low and the time of exposure to this task has also been more limited.
It is relevant to note that execution errors have occurred in the simplest problems such as that of exchange, where the linguistic structure or the place of the unknown has changed the type of error. However, we can note that the error has been mostly conceptual.
In problems with a medium difficulty such as problems of change 2, we observe that in the 1st grade group (G-I) whose mother tongue does not coincide with the language of instruction, the most frequent error they made was not understanding the problem. However, students with a second dominant language, or 2nd grade groups who had already been exposed to this type of task, tended to transform the problem. This way, we can confirm that this is supporting the theory of creating models that allow linguistic difficulties to be overcome.
Most importantly, we can note that the possibility of committing errors diminished in all problems as students’ ages were higher, as was also demonstrated in other studies such as Carpenter and Moser [
53], and Bermejo [
64]. In this sense, we can see that the higher the students’ grade, the less significant the differences in errors committed. There were significant differences in the problems of change (2, 3, 4, 5, 6) and comparison 1 in 1st grade groups, with G-I being the group making a higher number of errors. However, group G-II made a higher number of errors in the change 3 problem, making up a large part of execution.
In the 2nd grade groups, there was a high decline of significant differences in errors, repeating the errors in the change 2 type and appearing in equalization 1 and change 4. In this case, G-III continued to commit a higher number of errors than G-IV.
Furthermore, it should be emphasized that, on many occasions the percentage of errors produced by G-II in the same problem was similar to the percentage of G-III. This suggests that, although there was only a difference of one school year between both groups, the linguistic proficiency of the verbal structure in 1st graders could make the problem execution more difficult. Therefore, the performance in problem solving by those groups whose language is dominant in both their mother tongue and the language of instruction, compared to those that is not, may be close to a lag of one school year.
8. Conclusions
The results of this study reassert the existing relationship between language and the verbal problem-solving tasks in mathematics. We can also see the effects that the second language has on these types of tasks as well as how that influence is evolving throughout the different grades.
It has been demonstrated that as students obtained a greater command of the language and had more exposure to solving problems in a second language, their performance improved, although it did not reach the same level as their resolution performance in their dominant language.
We could also observe how translanguaging took place on many occasions. Through their first language they were capable of resolving a verbal problem, in some cases by giving the answer in the dominant language, and in other cases by translating the problem into their dominant language to have a better understanding of it. Then the answer was translated back from Spanish into English to provide the answer. In this way, we observed that the change of code from one language to another was done using their mother tongue when a complicated linguistic or semantic structure was provided and providing the answer immediately in the second language or English (i.e., the language that was being used for the resolution of the problem).
Likewise, errors tended to decrease as the age of the students increased, although the difference between the dominant and nondominant language groups was close to one school year. It was also observed that the nature of the errors was modified, and although errors remained conceptual, on many occasions they denoted that there was a certain understanding of the semantic structure of the problem, which led the students to transform it or use “keywords”.
We highlight the importance of establishing strategies that would allow the child to overcome the linguistic difficulties that can occur in the first stages of learning a second language. In this way, we would also help students who show difficulties in their first language. We propose the use of images or pictorial representations that can help the students to make it by themselves; these materials can help them understand the situation and make easier the process of transferring the knowledge in the language of instruction. Concerning teaching mathematics, studies suggest that a low performance in multilingual students can be partly, and even totally, due to their language limitations. For instance, the existing performance differences disappeared between speakers of German as a first and second language when problems were presented symbolically rather than verbally [
65].
We must bear in mind that this study has been carried out in international schools specialized in the area of second language acquisition as well as in CLIL. However, in schools that do not reach such a degree of specialization in these two areas, it would be necessary, before implementing bilingual programs, to reflect on the availability of resources and the development of teacher training.
The language of instruction seems to determine its use in the processes involved in teaching in later grades and even in adult life [
56,
66,
67,
68], therefore it is essential to solve all difficulties and encourage its transfer and use at early ages. It should be taken into account in those schools that are implementing bilingual programs. In addition, teachers in bilingual contexts should be encouraged to be aware of both the problems associated with language and cultural awareness in a very sensitive way, which will allow them to value all the previous knowledge that the student brings to the classroom. Likewise, teachers should also provide students with equal opportunities and promote the exchange of ideas, avoiding any language difficulty so that all students can develop the same mathematical competence [
69].
Additionally, it would be interesting to carry out similar studies in immersion programs evaluating a higher number of participants, to study teachers’ perceptions towards the students’ linguistic difficulties and their attitudes in the classroom in order to favor all students’ inclusion, regardless of their linguistic competence. Therefore, if these studies were carried out in ordinary centers whose level of interculturality is lower, we could obtain a different approach that would complement the results shown in this article.
For future research, it would also be adequate to address the didactic aspects and educational strategies that can favor the transfer of knowledge from and to the language of instruction and mathematical concepts, as well as the linguistic aids that students may need to successfully address the challenges posed by the teaching of mathematic in a second language.