1. Introduction
Groundwater, a vital resource, supports the global societies in multiple ways [
1,
2,
3]. Approximately 42% of the total population is directly dependent on this resource for the necessities as well as for the livelihood [
4,
5,
6]. However, the groundwater resources have been overexploited due to increasing industrialization [
4], agricultural activities [
7], and urbanization [
8]. The Sustainable Development Goal 6 (SDG 6) provides a blueprint for the availability of safe water and its sustainable management as part of the 2030 agenda. India, with the largest population in the world, also consumes the highest volume of groundwater, and thus, its exploited wells have been continuously rising [
9,
10,
11,
12]. Estimates show that almost 60% of its aquifers would become critical by 2025 [
13,
14,
15]
This common source of freshwater is dominantly used for the drinking water supply, agriculture, and animal farming in the Tamil Nadu state of India [
16]. However, the depleting groundwater levels have impacted the water resource quantity and quality [
16,
17,
18]. Groundwater exploitation is represented by the ratio of annual abstraction to yearly total water available, and its average value for the Tamil Nadu state is 70% [
19]. The current annual groundwater extraction and recharge volumes for this state are 14.4 and 21.1 billion cubic meters, respectively [
19]. Similarly, the groundwater withdrawal is about 68% of the annual recharge [
12]. These values indicate that groundwater is being pumped at an extremely alarming rate in the state of Tamil Nadu [
20]. In addition to their depleted nature [
17], the complexity in aquifer systems of this region is represented by the geology comprising charnockite, khondalite, and migmatite gneisses, and they exhibit low storage and movement. The judicious use and sustainable management of these aquifers require effective strategies as designed in the other parts of the world [
21] and India [
12,
22,
23]. The delineation of the groundwater potential of any area is an important step for evaluating the aquifer sustainability [
24,
25]. It is carried out using the Analytic Hierarchy Process (AHP), which uses the multi-criteria decision-making method [
26,
27]. GIS and remote sensing technologies, incorporated into the AHP, have helped to synthesize the large datasets and to provide accurate results [
27,
28,
29,
30,
31,
32,
33,
34,
35,
36]. In the drought-prone Sivagangai district, ref. [
37] delineated four major groundwater potential zones through the integration of thematic layers, such as geology and slope, validated by groundwater level maps, and proposed the implementation of artificial recharge structures to enhance groundwater resources. In the Cuddalore district [
38], GIS and AHP were employed to map groundwater potential zones, identifying high infiltration rates in coastal areas with flat topography and lower groundwater potential in regions with steep slopes. Similarly, ref. [
39] utilized GIS and AHP in the Chennai River Basin, achieving an accuracy of 78.43% in delineating groundwater potential zones. In the Vaigai River upper basin, ref. [
40] applied GIS-based weighted overlay analysis to identify suitable zones for groundwater recharge and recommended site-specific artificial recharge structures to mitigate water scarcity.
Groundwater potential zone mapping has been assessed using statistical methods, including the Analytic Hierarchy Process (AHP), in different regions of the world [
31,
41,
42]. However, dynamic interactions and nonlinear changes over time could be underestimated by conventional statistical approaches [
24]. On the other hand, machine learning (ML) methods are growing increasingly effective because of their ability to identify intricate connections and predict the location of groundwater. For groundwater mapping, previous investigations have employed a number of machine learning approaches, such as Random Forest (RF), K-Nearest Neighbor (KNN), Gradient Boosting (GB), Support Vector Machine (SVM), and artificial neural network (ANN) [
43,
44,
45]. Several locations have used these models, including India [
46], China [
47], Bangladesh [
48], Pakistan [
49], Tunisia [
50], and Botswana drylands [
51]. Previous studies have been performed in the Madurai district in a broader context, but there has not been any research performed specifically at the Melur block. Recent studies using GIS and remote sensing techniques, ref. [
52] investigated groundwater potential zones in the Madurai district’s hard rock terrain, concentrating on thematic layers including geology, geomorphology, and slope. Ref. [
53] used remote sensing and fuzzy AHP to evaluate the groundwater potential in Madurai city, finding important factors such as drainage density, geology, and land use. However, none of these studies specifically address the Melur block, an important region within the Madurai district that faces unique hydrogeological challenges. In order to reveal hidden patterns and relationships in groundwater data, these machine learning techniques make use of artificial intelligence. Researchers can precisely determine groundwater potential zones and make well-informed management decisions by utilizing RS and GIS through AHP and ML approaches. These methods allow for the effective mapping of groundwater resources while taking into account a number of variables that affect the presence and behavior of groundwater. Further study in this area will help create sustainable water resource management plans and advance our knowledge of groundwater dynamics.
Although groundwater potential zone mapping has utilized the use of singular models, ensemble models that integrate statistical and machine learning techniques have also shown acceptable accuracy. These ensemble models provide a combination of techniques, utilizing the advantages of machine learning and statistics [
44]. Several kinds of machine learning models can be used to help researchers make use of different methods in their attempts at accurate mapping of groundwater potential. Assessing groundwater availability has historically been difficult due to factors such as accuracy, climate change, and a lack of data. By using an innovative approach integrating machine learning algorithms for mapping the sustainable groundwater potential zone in Melur, Madurai District, Tamil Nadu, India, this study addresses this gap.
Gradient Boosting (GB), Random Forest (RF), and Support Vector Machine (SVM) are three machine learning (ML) models that will be used in this study, along with the conventional AHP technique to map groundwater potential zones in the Melur Madurai District. The study enhances groundwater mapping’s accuracy and efficiency by including these models, exceeding traditional methods. This promotes the selection of suitable groundwater resource management plans and regulations. Being the first study to use machine learning techniques in the region of Melur, Madurai, it addresses a significant knowledge gap. Additionally, selecting the best-performing model among them becomes quite difficult because each model produces a profound impact, although there are a number of limitations on how each ML technique may predict a region’s potential. The aquifers here are least replenishable and might deplete further in the absence of any suitable measures in the near future [
54]. The developed maps of GWPZ provide valuable information for water users and policymakers to develop sustainable strategies for the preservation of groundwater resources in this region, as well as in other regions with similar geomorphic and geologic characteristics within India.
Globally, numerous studies have applied GIS-based multi-criteria decision-making (MCDM) techniques and machine learning models to delineate groundwater potential zones; however, many are either region-specific or lack integration of both surface and hydrogeological variables in a data-scarce environment. In comparison, limited research exists for the Madurai region that combines geospatial analysis with advanced modeling techniques to evaluate groundwater potential with a focus on both spatial heterogeneity and model interpretability. The current study addresses this gap by integrating physical parameters with machine learning-based feature analysis to assess groundwater potential across a relatively uniform terrain. The key objectives of this research are as follows: (i) to delineate groundwater potential zones using a combination of geospatial and data-driven approaches; (ii) to evaluate the interrelationship between influencing parameters; (iii) to assess the effectiveness of different machine learning models in capturing groundwater potential. The novelty of this study lies in its application of both visual analytics (such as correlation maps and pair plots) and model-based accuracy assessment to interpret complex parameter interactions, offering a more robust and scalable approach to groundwater assessment compared to conventional methods.
4. Results
4.1. Geology
The major rock types include crystalline rocks consisting of charnockite and khondalites. A majority of the Melur area also has exposures of migmatite gneissic complex (
Figure 3a) covering the southern and central regions, representing a significant portion of the area. The Khondalite Gneissic Complex is primarily concentrated in the northern region, while the Charnockite Gneissic Complex appears in scattered patches in the northern and central regions. These geological formations influence the area’s groundwater potential, with the Khondalite and Charnockite regions showing higher groundwater potential due to their lithological characteristics. In these terrains, the groundwater is sustained through the secondary porosity, such as percolation through the fractured and highly weathered crystalline rocks [
70].
4.2. Geomorphology
Among the eight thematic layers, the geomorphology influences the groundwater occurrence and flow [
71]. The landform map shows distinct geomorphological units viz. pediment and pediplain complex covers approximately 450 km
2, representing about 66.3% total area, weathered hills/valleys cover roughly 90 km
2 or 13.3% of the area, quarry and mine areas occupy around 35 km
2 or 5.1% of the study area, and river and water bodies covering approximately 60 km
2 (8.8%) and 43 km
2 (6.5%), respectively (
Figure 3b). Pediments and pediplains are common. Manmade depressions like quarries generate artificial groundwater recharge [
72]. Rivers and other water bodies also recharge the aquifers and encourage the groundwater potential.
4.3. Lineaments
Lineaments comprise mainly faults or fractures, as well as subsurface structural features like lithological boundaries and the drainage profiles [
73]. The map of lineament (
Figure 3c) indicates that the study area exhibits low lineament density. Lineament density is highest in the northern parts, favoring the recharge through fractures, joints, and fissures. The interconnected fractures, along with a pressure gradient, encourage the groundwater flow from the high to the low lineament areas.
4.4. Land Slope
Surface slope inversely affects the rate of recharge, with the steep slopes encouraging more runoff and the gentle slope favoring infiltration due to prolonged residence time [
74,
75]. The areas with gentle slope have higher groundwater potential than the steep slope regions [
27]. The slope map shows an overall gently sloping topography, and the steeply sloping features resembling hills are found in the central and western regions. The isolated valleys, similar to the depressions, are present in the northern region (
Figure 3d).
4.5. Land-Use/Land-Cover (LuLc)
Land use/land cover (LuLc) data is essential to characterize the runoff, evapotranspiration, and percolation [
76]. The LuLc map shows four major classes, with about 53.1% of the area covered by vegetation or cropland. This LuLc class favors recharge and storage of the groundwater (
Figure 4a). About 41% of the area is barren or uncultivated land, 16.55% is represented by the residential buildings or urbanization, and 23.9% of the area is covered by water bodies.
4.6. Soil
Soils affect recharge by permitting water infiltration, as the highly permeable soil allows more recharge. The dominant soil types in the study area are represented by clay loam and loam soil. Clay loam has a higher percentage of clay, and its low permeability inhibits the groundwater infiltration. Loam soil, consisting of equivalent proportions of sand, silt, and clay, provides more permeability and allows more groundwater recharge [
77]. The clay loam covers the entire area except for the northern region, whereas the loam soil is restricted to the northern part (
Figure 4b).
4.7. Rainfall
The rate of infiltration is influenced by the amount of rainfall [
28]. The duration, intensity, and pattern of rainfall control the recharge process. Low-intensity rainfall for many days is generally suitable for recharge [
78]. In the study area, the northeastern monsoon is the major contributor of precipitation. The rainfall map illustrates between 1015 mm and 1104 mm of average annual precipitation, with the highest values located in the northern area. In general, the overall distribution shows comparable rainfall throughout the study area (
Figure 4c).
4.8. Drainage Density
Drainage density, a measure of the length of streams per unit area of the basin, also impacts the groundwater resources. High drainage density promotes more runoff and low groundwater recharge. The drainage density is categorized into four classes: low density (high groundwater potential), moderate density, high density, and very high density (low groundwater potential) [
79,
80].
Figure 4d illustrates that the central parts have high drainage density and consequently the least groundwater prospects.
4.9. Hydrogeochemical Characteristics
pH levels in the Melur region range from 7.0 to 8.1. The southern region shows slightly alkaline water (up to 8.1), likely due to geological formations or human activities such as agriculture or industry. The central and northern regions have near-neutral pH levels (7.0–7.5), indicating stable and suitable water. The slight alkalinity in the southern parts may require periodic monitoring to maintain water quality.
Electrical Conductivity (EC), which indicates salinity and dissolved ions, ranges from 790 to 2010 µmho/cm. The southern region has higher EC levels, especially in areas with lower water levels, due to evaporation. The central region shows moderate EC levels (950–1690 µmho/cm), while the northern region has the lowest values, starting at 790 µmho/cm, suggesting fresher water. High EC in the south may affect water usability for drinking and irrigation, as salinity can harm crops and human health.
Total Dissolved Solids (TDS), which measure dissolved substances in water, range from 422 to 1054 mg/L. The southern region has higher TDS levels, often exceeding 1000 mg/L, likely due to rock–water interaction or agricultural runoff, making the water unsuitable for direct consumption without treatment. The central and northern regions have moderate to low TDS levels (422–855 mg/L), remaining within permissible drinking water limits.
Calcium (Ca++) and Magnesium (Mg++) are key indicators of water hardness. In the study area, calcium levels range from 50 to 118 mg/L, and magnesium levels range from 29 to 84 mg/L. The southern region has higher calcium levels, often above 100 mg/L, and magnesium levels reaching 84 mg/L, leading to harder water. This is caused by natural rock–water interaction. The northern region has lower calcium levels (50–60 mg/L) and magnesium levels below 29 mg/L, resulting in softer water. Sodium (Na+) and Potassium (K+) are important for assessing water salinity and irrigation suitability. Sodium levels range from 58 to 156 mg/L, with higher concentrations in the southern and central regions, which can affect soil quality and crop growth. The northern region has lower sodium levels (58–97 mg/L), making the water better for agriculture. Potassium levels are low across all regions, ranging from 2 to 7 mg/L, indicating little human impact, such as fertilizer runoff, on water quality.
Anions like Chloride (Cl−), Sulphate (SO42−), and Bicarbonate (HCO3−) are the main indicators of water quality. Chloride levels range from 72 to 330 mg/L, with higher levels in the southern region (over 300 mg/L), likely due to agricultural runoff, salt leaching, or weathering. Excess chloride can make water salty and unsuitable for drinking. The central and northern regions have moderate chloride levels (72–180 mg/L), indicating better water quality.
Sulphate levels vary between 48 and 180 mg/L. The southern regions show higher sulphate concentrations (above 150 mg/L), which can cause a bitter taste and gastrointestinal issues. In the northern regions, sulphate levels are lower (below 100 mg/L), indicating good water quality. Bicarbonates range from 180 to 360 mg/L, with higher levels in the southern regions contributing to alkalinity and water hardness.
The Water Quality Index (WQI) combines several parameters to assess overall water quality. WQI values range from 57.07 to 120.82. The southern regions have excellent water quality (WQI below 60), suitable for all purposes. The central regions show moderate quality (WQI 70–90), where treatment may be needed for drinking. The southern regions face poor water quality, with WQI exceeding 100 (up to 120.82), caused by high TDS, EC, and hardness. Immediate steps like water treatment and source protection are required to improve water quality in these areas.
6. Conclusions
The groundwater potentiality of Melur in Tamil Nadu state of India was evaluated by studying an area of 678 km2 in widely accepted Analytical Hierarchy Processes (AHP) and by using the remote sensing and GIS tools with the data layers consisting of geology, geomorphology, lineament density, land use/land cover, rainfall, soil, slope and drainage density. The weighted layers separated this area into five zones between poor and high groundwater potential. About 55.2% of the study area has very low to low groundwater potential, and only 1.3% of the area has excellent groundwater potential. About 6.6% and 36.5% of the study area has high and moderate groundwater potential, respectively.
The distribution map demarcates the northern region as more appropriate for groundwater extraction in a sustainable way. The southern and central regions are vulnerable to aquifer degradation, and the groundwater prospection here must be restricted with adequate aquifer system rejuvenation or replenishment strategies. By integrating the Groundwater Potential Zones (GWPZ) and Water Quality Index (WQI), the study can effectively identify different zones based on their suitability for extraction and consumption. This study effectively illustrated how machine learning algorithms may be used to categorize groundwater potential zones. With 100% accuracy and accurate predictions, the Random Forest (RF) and Gradient Boosting classifiers performed better than the Support Vector Machine (SVM) model. The most significant variables influencing the classification of groundwater potential were identified using correlation visualization and feature importance analysis. The findings highlight how crucial feature selection, model selection, and data pretreatment are to obtaining high classification accuracy. While ensemble methods such as Random Forest and Gradient Boosting proved to be highly effective, further studies could explore additional machine learning techniques such as XGBoost, Neural Networks, or k-Nearest Neighbors (k-NN) to further enhance predictive capabilities. This study highlights the effectiveness of machine learning in groundwater potential assessment and provides a foundation for future research in hydrogeological modeling and sustainable water resource management. The delineated groundwater potential zones can serve as a scientific basis for local authorities and water resource planners to prioritize areas for sustainable groundwater extraction, recharge zone protection, and infrastructure development. Integrating these insights into land-use planning and agricultural water management policies can enhance water security, especially in water-scarce regions. The lithological and geological conditions are not favorable for efficient groundwater recharge and storage. Therefore, artificial recharging or managed aquifer recharge measures should be vital for sustainable aquifer productivity, congruent with SDG 6.