An Adaptive Multi-Stage Fuzzy Logic Framework for Accurate Detection and Structural Analysis of Road Cracks
Abstract:
The degradation of road infrastructure presents significant challenges to public safety and maintenance budgets, with cracks serving as critical indicators of structural instability. Despite extensive advancements, existing detection methodologies frequently fail to address complex surface textures, variable illumination, and diverse crack geometries, resulting in inconsistent performance. An adaptive, multi-stage framework has been developed to mitigate these limitations, integrating advanced image processing techniques with fuzzy logic-based analysis. The proposed approach utilises dynamic contrast enhancement and multi-scale feature extraction to ensure accurate detection of both fine and extensive cracks across heterogeneous surfaces. A fuzzy graph-based methodology is employed to evaluate crack connectivity, while an adapted algorithm is applied to assess continuity and severity. The framework incorporates fuzzy wavelet transforms to enhance feature segmentation and employs morphological techniques for precise crack boundary delineation. Dijkstra’s algorithm is integrated to optimise connectivity analysis, facilitating the identification of critical structural deficiencies. The performance of the model has been rigorously validated through extensive experimental testing, achieving an accuracy rate of 94.2%, with high precision and recall metrics. Comparative analysis with conventional techniques reveals a significant reduction in false detection rates and an improved capacity for capturing intricate crack features. The results underscore the practical utility of the proposed model, demonstrating its scalability and reliability across diverse roadway conditions. By enabling early and accurate identification of structural anomalies, the framework enhances roadway safety, minimises maintenance costs, and supports proactive infrastructure management. The findings highlight its potential as a transformative solution for addressing modern challenges in road maintenance, with implications for improved public safety and resource optimisation.
1. Introduction
Road infrastructure plays a pivotal role in supporting economic development and ensuring public safety. High-quality roads facilitate the efficient movement of goods and services, promote trade, and enhance connectivity among communities. However, as traffic volume increases and environmental factors contribute to road deterioration, the timely detection and maintenance of road cracks have become critical. According to the Federal Highway Administration (FHWA), nearly 40% of U.S. roads are in poor or mediocre condition, which can lead to safety hazards and increased repair costs [1]. The economic implications of poorly maintained roads are significant, with studies indicating that inadequate infrastructure costs the U.S. economy approximately $3 trillion annually due to lost productivity and increased vehicle operating costs [2]. Consequently, effective road maintenance strategies that incorporate advanced technology for crack detection are essential to prolonging the lifespan of roadways and minimizing financial burdens. Traditional methods for crack inspection primarily rely on manual visual assessments, which are often time-consuming and subjective [3], [4], [5], [6], [7]. Inspectors typically walk along the road or drive slowly, visually identifying cracks and other distresses. This approach, while still prevalent, has several drawbacks, including the risk of human error, inconsistency in judgments, and the inability to detect small or subtle cracks that could lead to severe structural failures over time. The subjectivity of manual assessments often results in inconsistent maintenance practices, where some cracks may be overlooked, leading to costly repairs down the line. Moreover, visual inspections may not provide a comprehensive view of the road’s condition, as inspectors may miss cracks that are not readily visible.
To address these shortcomings, automated techniques using computer vision and image processing have gained prominence in recent years. These methods leverage advanced algorithms to enhance detection accuracy and efficiency, minimizing the dependency on human judgment. Various techniques, including machine learning, deep learning, and mathematical morphology, have been proposed for crack detection, with each offering unique advantages and limitations.
For instance, machine learning approaches typically require extensive feature engineering and may struggle with complex backgrounds or varying illumination conditions [8]. Deep learning methods, while powerful, often require large labeled datasets for training and can be computationally intensive [9]. Furthermore, many existing methods may not adequately account for the diverse range of crack appearances and road surfaces, leading to potential misclassifications. Recent advancements in crack detection have led to the development of various methodologies [3], [10], [11], [12], [13], each with its own strengths and weaknesses. For example, Zhang and Cheng [8] proposed a deep learning-based approach for crack detection, which demonstrated promising results in detecting large cracks in well-lit environments. However, the effectiveness of deep learning is highly dependent on the quality and quantity of training data. Many datasets used for training are often limited in size and diversity, resulting in models that may not generalize well to unseen data or different environmental conditions.
Another study by Ahmadi et al. [9] employed image processing techniques coupled with machine learning classifiers for road crack detection. While their approach improved the accuracy of crack detection compared to traditional methods, it still faced challenges with noise and varying textures. The reliance on specific features extracted from the images made the model sensitive to changes in road surface conditions, such as shadows, dirt, and surface reflections. As a result, cracks that are partially obscured or located in shadowed areas may go undetected, undermining the overall efficacy of the method.
A further examination of existing methodologies reveals that many studies focus primarily on a single aspect of crack detection, whether it be segmentation, feature extraction, or classification [14], [15], [16], [17], [18], [19]. For instance, Ashraf et al. [17] utilized multi-scale feature aggregation and transformer (M-SFT)-based attention mechanisms that enable precise crack detection and segmentation on road pavement materials by capturing both global and local context, improving model accuracy. This approach effectively adapts to varying crack scales and orientations, enhancing the robustness of the detection model in real-world applications. The computational complexity of transformer-based attention mechanisms may lead to longer processing times, making real-time deployment challenging on devices with limited resources. Additionally, model performance can be sensitive to high-quality training data, which may not always be available or consistently labeled in road pavement images and did not incorporate subsequent classification stages, limiting the overall applicability of the approach in real-world scenarios where classification of detected cracks is essential for prioritizing maintenance.
Given these limitations, there is a clear need for a more comprehensive and integrated approach to road crack detection. The proposed multi-stage method aims to address the shortcomings of existing techniques [9], [17], [20] by combining advanced preprocessing fuzzy techniques, feature extraction, segmentation, and classification in a cohesive framework. This approach not only enhances the robustness of crack detection but also improves the ability to generalize across varying conditions and road types.
The proposed method begins with image preprocessing, where Gaussian filtering and fuzzy contrast enhancement improve image quality and enhance crack visibility. This is followed by multi-scale feature extraction using a fuzzy discrete wavelet transform, which captures both detailed and approximate crack structures across scales. The segmentation phase employs mathematical morphology to emphasize line-like structures characteristic of cracks, followed by fuzzy-based thresholding to optimize crack isolation, reducing sensitivity to noise and lighting variations. Next, a fuzzy graph-based modeling approach represents the detected crack structures, enabling an in-depth analysis of crack connectivity, topology, and continuity. Each crack pixel is treated as a graph node, allowing the model to assess continuity, directionality, and structure robustness using an adapted Dijkstra's algorithm. This graph-based assessment of crack connectivity and orientation provides valuable insights for prioritizing maintenance efforts based on crack severity and distribution.
The primary objective of this research is to develop a robust and adaptive multi-stage road crack detection model capable of addressing the challenges posed by complex road textures, varying lighting conditions, and diverse crack geometries. Specifically, the study aims to enhance detection accuracy by integrating advanced image processing techniques with fuzzy logic, ensuring reliable segmentation and multi-scale feature capture. The proposed model seeks to minimize false detection rates and effectively identify fine and irregular cracks, providing a scalable solution for accurate and proactive infrastructure management (refer to Figure 1).
This study presents a novel model for road crack detection that effectively integrates fuzzy logic with graph theory to address the limitations of existing approaches. By employing fuzzy contrast enhancement, wavelet-based multi-scale feature extraction, and a graph-based representation of cracks, the proposed model achieves improved accuracy and robust performance under diverse conditions. These advancements facilitate the reliable detection of fine cracks and irregular patterns, providing distinct advantages over conventional models, such as those proposed by Ashraf et al. [17] and Xu et al. [20], which often struggle with complex textures and branching cracks.
2. Propose Mathematical Approach
The proposed mathematical approach for road crack detection combines multiple advanced techniques to enhance image clarity, identify crack structures, and facilitate accurate feature extraction. First, the Gaussian filter smooths the image by reducing noise, improving the visibility of crack patterns without compromising edge information. Following this, fuzzy contrast enhancement adapts intensity levels using fuzzy membership functions, highlighting cracks more effectively in complex road textures.
The approach then employs a fuzzy wavelet transform, enabling multi-scale feature extraction and capturing fine details across varying scales for robust crack detection. Morphological operations, such as top-hat and bottom-hat transformations, further emphasize crack-like structures by isolating bright and dark patterns in the image.
To represent crack connectivity, a fuzzy graph-based model is applied, where each crack pixel is treated as a node in a graph. Fuzzy adjacency and degree matrices are used to construct a fuzzy graph Laplacian, which encapsulates the spatial and intensity-based relationships between nodes, allowing for effective connectivity analysis. Finally, feature extraction methods, including crack length and curvature, provide geometrical descriptors to support classification and analysis. These combined techniques offer a comprehensive and adaptable approach to accurately detect and characterize road cracks under varying conditions.
Preprocessing is a critical step in road crack detection as it prepares raw input images for advanced analysis. In the proposed model, the Gaussian filter is employed for noise reduction and image smoothing, a necessary process to enhance crack visibility. By mitigating random noise while preserving essential edge details, the Gaussian filter ensures that the image retains its structural integrity, which is crucial for accurately detecting road cracks.
where, $(x, y)$ are the spatial coordinates, $\sigma$ controls the spread (standard deviation) of the Gaussian function.
By applying the Gaussian filter to the image, the resulting smoothed image makes crack patterns more noticeable, facilitating easier detection in later stages of the analysis. This approach ensures that the model can focus on the essential details, such as crack locations and shapes, while minimizing the influence of noise, which can lead to false detections. Convolve the Gaussian filter $G$ with the input image $I$ to obtain a smoothed image $I_{\text {smosth}}$:
where, $k$ is the filter size parameter, often set to $3 \sigma$.
The fuzzy logic model in the proposed road crack detection framework introduces a flexible approach for handling the inherent uncertainty and imprecision in image data. Fuzzy logic allows the system to model the complexity of crack patterns, which often exhibit variations in size, shape, and intensity due to environmental factors and image quality. By incorporating fuzzy logic, the model can more accurately classify ambiguous or borderline pixels that traditional binary methods might misclassify. This results in a more robust detection process, particularly in cases of irregular crack patterns or low-contrast images. The fuzzy logic model enhances the overall precision and adaptability of the detection system, improving its performance across varied road conditions and crack geometries.
To improve the visibility of road cracks in an image, fuzzy contrast enhancement is applied using fuzzy histogram equalization. The idea is to modify the intensity levels of the image based on fuzzy membership functions, which can adjust the contrast more effectively than traditional methods, especially when dealing with varied lighting conditions or noisy images.
The fuzzy contrast enhancement procedure starts by defining the probability distribution of the intensity levels in the image. Let $P(i)$ represent the probability of each intensity level $i$. This is the normalized frequency of occurrence of intensity values in the image, and it is calculated as:
$$ P(i)=\frac{f(i)}{N} $$
where, $f(i)$ is the number of pixels with intensity level $i$, and $N$ is the total number of pixels in the image.
To apply fuzzy contrast enhancement, a fuzzy membership function $\mu_{\text {contrast }}$ is used to adjust the intensity levels. This function operates by associating each intensity level $P(i)$ with a value that determines how much enhancement should be applied to it. The function is designed as follows:
where, the fuzzy membership function $\mu_{\text{contrast}}(P(i))$ is defined to modify intensity levels based on their probability $P(i)$. The function operates in two parts:
•When the intensity probability $P(i)$ is greater than or equal to the high threshold, the membership value is set to 1 , meaning the contrast enhancement effect is applied fully.
•For intensity probabilities lower than the threshold, the membership function is proportional to $P(i)$, meaning the enhancement effect is scaled down. This allows for smoother transitions between regions of low and high contrast.
This approach contrasts with traditional contrast enhancement methods like histogram equalization, which might overly adjust intensity levels and introduce artifacts. By using fuzzy logic, the contrast enhancement process becomes adaptive and robust, especially in the presence of noise or varying illumination. The use of fuzzy contrast enhancement in crack detection allows for better visualization of cracks, which are often characterized by subtle intensity differences from their surroundings. This enhancement ensures that cracks, regardless of their size or orientation, are more easily detected, especially in low-contrast or noisy images. The gradual nature of the fuzzy adjustment also ensures that noise is not overly amplified, making the model more robust in real-world conditions.
In the proposed road crack detection model, the Fuzzy Cumulative Distribution Function (CDF) is applied to enhance contrast by adjusting pixel intensities based on their membership values. The cumulative distribution, $\mathrm{CDF}_{\text {fuzzy}}(i)$, is computed by summing the fuzzy membership values, $\mu_{\text {contrast}}$, of pixel intensities up to a given level ig (Eq. (4)). This cumulative measure helps in amplifying the contrast selectively, enhancing the visibility of subtle crack patterns.
To update each pixel intensity $I(x, y)$ for improved clarity, the model employs the fuzzy CDF in Eq. (5). The enhanced intensity, $I_{\text {enhanced}}(x, y)$, is adjusted by normalizing the fuzzy CDF , scaled by the maximum intensity $L$ (usually 255) and the total number of pixels $M, N$. This results in a contrast-enhanced image where fine details of cracks are more pronounced, aiding in more accurate detection.
The wavelet transform allows for capturing details at different scales. Integrating fuzzy logic improves resilience against variations in crack characteristics. For each level $j$, calculate the fuzzy approximation and detail coefficients:
where, $\mu_{\text {intensity}}(I(n))$ is the fuzzy membership function based on pixel intensity. $\phi$ and $\psi$ represent the scaling and wavelet functions, respectively.
Morphological operations, such as erosion, dilation, opening, and closing, are used to emphasize and refine crack-like structures in road images by manipulating shapes and boundaries. These techniques enhance line structures, making cracks more visible while reducing noise and small objects. By adapting to various image conditions, they effectively highlight cracks, improving detection accuracy.
Top-Hat (enhances bright structures):
where, $\circ$ denotes morphological opening.
Bottom-Hat (enhances dark structures):
where, $\cdot$ denotes morphological closing.
The fuzzy graph-based model for crack structure representation is a technique used to model and analyze the connectivity and structure of road cracks in an image. In this approach, the pixels representing cracks are treated as nodes in a graph, with edges representing relationships between adjacent nodes. The fuzzy logic component is applied to capture the uncertainty in crack structures, such as varying pixel intensities and connectivity patterns. By using fuzzy relationships, the model accounts for variations in crack width, orientation, and continuity, allowing for more robust and flexible detection of crack patterns in complex road surfaces. The edges in the graph are weighted based on factors like intensity similarity and spatial proximity between nodes, enhancing the model's ability to detect fine, branching, or irregular crack patterns that are typically challenging for traditional methods.
In the proposed road crack detection model, a Fuzzy Adjacency Matrix Afuzzy is constructed to represent the relationships between neighboring pixels or nodes, enhancing the model's ability to detect crack continuity. The entries of this matrix are "fuzzified" based on two criteria: the distance $d_{i, j}$ between nodes $i$ and $j$, and the intensity similarity between these nodes, $I_i$ and $I_j$ (Eq. (12)). If the distance $d_i$, ${ }_j$ is within a specified threshold $d$, the adjacency value $A_{\text {fuzzy}, i, j}$ is calculated as the product of two membership functions: $\mu_{\text {proximity }}\left(d_{i, j}\right)$, which quantifies the closeness of the nodes, and $\mu_{\text {intensity }}\left(I_i, I_j\right)$, which measures the similarity in intensity. Otherwise, the adjacency value is set to zero.
This fuzzy adjacency matrix helps the model emphasize connections between closely located and similar-intensity pixels, enhancing its ability to follow the path of cracks accurately.
The matrix $A_{\text {fuzzy}}$ is constructed using two primary factors: **distance** $d_{i, j}$ and ** intensity similarity** $I_i$, $I_j$, which help establish the strength of connectivity between any two pixels (or nodes) in the image. The distance $d_{i, j}$ measures how far apart two neighboring pixels are in the image. In a typical image, the spatial proximity of pixels is a natural indicator of their potential relationship in terms of belonging to the same object or structure, such as a road crack. In the fuzzy model, the proximity function $\mu_{\text {proximity}}\left(d_{i, j}\right)$ is used to quantify the likelihood that two pixels are neighbors based on their distance.
The intensity similarity $I_i$ and $I_j$ between two pixels captures their brightness values (or any other feature of the image, such as texture). Since cracks are often distinguishable based on intensity differences from their surroundings, intensity similarity helps refine the adjacency matrix by ensuring that only pixels with similar intensities are connected. The fuzzy membership function $\mu_{\text {intensity}}\left(I_i, I_j\right)$ is used to quantify this similarity.
The fuzzification process in the fuzzy adjacency matrix is governed by two main factors:
•If the distance between pixels $d_{i, j}$ is within a predefined threshold $d$, indicating that the two pixels are spatially close enough to potentially belong to the same crack, the adjacency value $A_{\text {fuzzy}, i, j}$ is computed as the product of the two membership functions: $\mu_{\text {proximity}}\left(d_{i, j}\right)$ and $\mu_{\text {intensity}}\left(I_i, I_j\right)$. This product quantifies the combined influence of spatial proximity and intensity similarity on the adjacency relationship.
•If the distance $d_{i, j}$ exceeds the threshold $d$, indicating that the two pixels are far apart, their adjacency value is set to zero. This reflects the fact that distant pixels are unlikely to belong to the same crack structure and thus should not be connected.
The Fuzzy Degree Matrix $D_{\text {fuzzy}}$ is used to quantify the connectivity strength of each node (or pixel) within the network of detected crack patterns. Specifically, each diagonal entry $D_{\text {fuzzy}, i, i}$ represents the total sum of fuzzy adjacency values, $A_{\text {fuzzy}, i, j}$, for all neighboring nodes $j$ connected to node $i$ (Eq. (13)). This sum effectively captures the intensity of connections or “fuzzy edges” associated with each node, providing a measure of how strongly it is linked to its surrounding nodes.
By accumulating these connection values, the fuzzy degree matrix helps enhance the model's sensitivity to continuous crack paths, supporting more reliable detection of complex and branching cracks. This matrix plays a crucial role in the structural analysis of cracks by weighing nodes with higher connectivity, which are more likely part of the crack pattern.
In our crack detection model, the Fuzzy Graph Laplacian Lfuzzy is introduced to capture the connectivity within the network of pixels and to analyze the structure of the cracks. The Laplacian matrix $L_{\text {fuzzy}}$ is computed as the difference between the fuzzy degree matrix $D_{\text {fuzzy}}$ and the fuzzy adjacency matrix $A_{\text {fuzzy}}$ (Eq. (14)). This matrix provides valuable insights into the spatial relationships between nodes, allowing the model to better distinguish crack patterns from the surrounding background.
Each element $L_{\text {fuzzy}, i, j}$ of the Laplacian matrix is defined as follows (Eq. (15)): if $i=j$, the entry is set to the diagonal value of the degree matrix $D_{\text {fuzzy}, i, i}$, representing the total connectivity strength of node $i$. If $i=j$, the entry is set to the negative of the adjacency value $A_{\text {fuzzy}, i, j}$, which reflects the connection strength between nodes $i$ and $j$. This structure allows the Laplacian matrix to highlight the edges and boundaries of the cracks, supporting more precise crack detection.
Using Dijkstra's algorithm adapted for fuzzy weights, we calculate the shortest paths between nodes, providing insights into crack continuity, see Figure 2.
where, $\mu_{\text {connectivity}}\left(x_k, y_k\right)$ represents the fuzzy connectivity strength for crack segments.
After the segmentation process, which isolates the crack regions from the background, geometric features such as crack length, width, area, and curvature are extracted. These features provide valuable information about the characteristics of the cracks, such as their severity, shape, and potential for growth.
The extracted features are then used for classification, where the crack patterns are categorized based on predefined criteria, often using machine learning or fuzzy logic models. Classification helps in determining whether a detected feature represents a significant structural issue or is a minor imperfection. By analyzing these geometric features, the model can prioritize cracks that are critical for road maintenance, aiding in the proactive management of road infrastructure.
Crack Length refers to the total length of a detected crack in a road surface, and it plays a significant role in assessing the severity of the damage. It is calculated by measuring the continuous path of the crack pixels, typically using edge detection and pixel connectivity algorithms. The formula for crack length often involves summing the distances between adjacent crack points or pixels along the detected crack line.
This feature provides an indication of the crack's extent along its path.
It refers to the rate at which the direction of a crack changes along its length. It is computed by evaluating the angle between two consecutive tangent vectors at different points along the crack. The curvature formula is based on the cosine of the angle between these vectors, which indicates how sharply the crack bends.
where, $\mathbf{e}_i$ and $\mathbf{e}_{i, 1}$ are the unit tangent vectors at points $i$ and $i+1$ along the crack path. This feature reflects the sharpness or smoothness of the crack's trajectory.
These extracted features are then used for classification, which aids in distinguishing cracks with varying shapes and sizes.
Existing road crack detection models often face limitations in handling complex road textures, variable lighting, and diverse crack structures, leading to inaccurate segmentation and increased false detections. They typically lack adaptability to multi-scale crack patterns and are sensitive to noise, making them less effective across varying conditions. To address these limitations, the proposed model integrates Gaussian filtering, fuzzy contrast enhancement, and multi-scale fuzzy wavelet transforms for enhanced crack visibility and detail capture across scales. Mathematical morphology is used to refine segmentation, while a fuzzy graph-based approach models pixel connectivity, enabling robust crack isolation. Dijkstra’s algorithm then analyzes crack continuity, and geometric feature extraction supports precise characterization. Together, these methods deliver a comprehensive, adaptive framework for accurate road crack detection, contributing to more reliable maintenance planning.
3. Experimental Results
The experimental results presented in this section evaluate the performance of the proposed road crack detection model. (a) The proposed model leverages advanced techniques such as feature extraction through crack length and curvature, combined with a robust fuzzy approach, to accurately detect and classify road cracks. (b) To benchmark the performance of the proposed model, we compare its results with those from competing models, including the approaches by Ahmadi et al. [9], Ashraf et al. [17], and Xu et al. [20], which have established effectiveness in crack detection. (c) All experiments were conducted using MATLAB R2019, with images of varying sizes (e.g., 256×256 pixels) processed on a high-performance computing system with 8 GB of RAM. The evaluation considers metrics such as accuracy, precision, and computational efficiency for a comprehensive comparison. The dataset used in this study consists of 300 images of road surfaces captured under various real-world conditions, enabling the evaluation of the model's generalization ability. These images were sourced from publicly available datasets.
The images cover diverse road types, including asphalt, concrete, and gravel, which exhibit different surface textures and crack characteristics. The dataset contains a variety of crack types, including but not limited to longitudinal, transverse, and alligator cracks. The cracks vary in size, orientation, and severity, providing a comprehensive representation of common crack patterns encountered in road infrastructure. The proposed road crack detection model effectively identifies and traces crack paths with high precision. The result in Figure 3 showcases the model's ability to accurately map initial crack contours and simulate intensity profiles, culminating in a refined output that distinctly highlights the crack's structure against the background. This approach enhances crack visibility and offers reliable results, demonstrating the model’s robustness and potential for practical applications in automated road maintenance systems. Figure 4 illustrates the superiority of the proposed crack detection model by highlighting its ability to accurately delineate crack boundaries and generate precise bounding boxes, in contrast to previous methods like Xu's model. The model's results reveal a clear and comprehensive mapping of complex crack networks, demonstrating enhanced detection performance and the potential for more reliable structural assessment in road maintenance. The Xu's model struggles to effectively detect cracks of varying sizes and scales. While it performs adequately on prominent cracks, its ability to capture fine or faint cracks is limited. This restricts its applicability in scenarios involving subtle or micro-cracks that are critical for early maintenance.
Additionally, the proposed model outperforms both competing models—Ashraf's model and Xu'smodel—in terms of visual clarity, as shown in Figure 5. In this figure, Ashraf's model fails to accurately detect the road cracks, as it is primarily effective only for larger cracks. The Ashraf's model struggles with detecting fine or micro-cracks and performs poorly in handling intersecting or branching crack patterns. It is sensitive to lighting variations and uneven surfaces, leading to false negatives. Additionally, its bounding boxes are imprecise, reducing accuracy in delineating cracks. Similarly, the Xu's model struggles to detect fine road cracks due to its limitations in handling branching cracks and irregular patterns, resulting in incomplete crack mapping. The Xu's model fails to detect multi-scale cracks, particularly smaller ones, and fragments continuous cracks due to occlusions. It relies heavily on fixed thresholds, reducing adaptability to diverse datasets. Moreover, it struggles with complex branching patterns, resulting in incomplete crack mapping.
Table 1 presents a comparison of the time (in seconds) taken for performance evaluation of the proposed road crack detection model and the models by Xu's model, Ashraf's model, and Alavi's model. The results are shown for four different images, labeled Image-1 through Image-4. As seen in the table, the proposed model consistently performs faster than the competing models across all images. For instance, in Image-1, the proposed model completes the task in 8.45 seconds, while Xu's model takes 81.25 seconds. This trend holds for the remaining images, indicating that the proposed model is more efficient in terms of processing time compared to the other models.
Image No. | Proposed Model | Xu's Model | Ashraf's Model | Alavi's Model |
---|---|---|---|---|
Image-1 | 8.45 | 81.25 | 57.71 | 64.11 |
Image-2 | 7.49 | 73.65 | 61.79 | 58.61 |
Image-3 | 5.52 | 68.15 | 63.01 | 69.56 |
Image-4 | 11.15 | 91.25 | 89.71 | 74.11 |
The confusion matrix is a vital tool for evaluating the performance of our proposed model. It provides a summary of the model's prediction results by comparing the predictions with the ground truth data, offering a comprehensive insight into the model's effectiveness. For a binary classification problem, such as crack versus non-crack detection, the confusion matrix consists of four metrics:
True Positive (TP): The number of pixels correctly identified as crack.
True Negative (TN): The number of pixels correctly identified as non-crack.
False Positive (FP): The number of pixels incorrectly identified as crack when they are actually non-crack.
False Negative (FN): The number of pixels incorrectly identified as non-crack when they are actually crack.
These values can be arranged in a matrix form as follows:
$$ \text { Confusion Matrix }=\left[\begin{array}{ll} \mathrm{TP} & \mathrm{FN} \\ \mathrm{FP} & \mathrm{TN} \end{array}\right] $$
From the confusion matrix, we can derive several key metrics to quantitatively assess the model's performance:
Accuracy: Measures the proportion of correctly identified pixels (both crack and non- crack).
$$ \text {Accuracy}=\frac{\mathrm{TP}+\mathrm{TN}}{\mathrm{TP}+\mathrm{TN}+\mathrm{FP}+\mathrm{FN}} $$
Precision: Quantifies the proportion of predicted crack pixels that are correctly identified.
$$ \text {Precision}=\frac{\mathrm{TP}}{\mathrm{TP}+\mathrm{FP}} $$
Recall (Sensitivity): Measures the proportion of actual crack pixels that are correctly identified.
$$ \text {Recall}=\frac{\mathrm{TP}}{\mathrm{TP}+\mathrm{FN}} $$
F1 Score: Provides a single metric that balances Precision and Recall.
$$ \text { F1 Score }=2 \times \frac{\text { Precision } \times \text { Recall }}{\text { Precision }+ \text { Recall }} $$
These metrics help us assess the model’s effectiveness in detecting road cracks. A higher accuracy indicates a greater proportion of correctly classified pixels. Precision reflects the accuracy of crack pixel predictions, while recall demonstrates the model’s sensitivity to actual cracks. The F1 Score provides a balance between precision and recall, summarizing the model’s overall effectiveness.
The Figure 6 illustrates a comparative analysis of four methods-Our, Xu, Ashraf, and Alavi-across four key performance metrics: Accuracy, Precision, Recall, and F-Measure. This analysis reveals that our method consistently outperforms the other methods in each metric, indicating its superior capability in classification tasks.
In terms of Accuracy, the Our method achieves a notably high score of 94.20%, suggesting a strong ability to correctly classify samples. The Xu method follows with an Accuracy of 88.60%, showing moderate performance, albeit lower than our method. Ashraf and Alavi, with scores of 85.30% and 79.50%, respectively, exhibit lower accuracies, suggesting a higher misclassification rate compared to the leading methods.
For Precision, which reflects the effectiveness in minimizing false positives, our method again leads with a score of 91.50%. This high precision implies that our method is particularly effective at correctly identifying positive cases. Xu and Ashraf reach Precision scores of 85.10% and 80.50%, respectively, indicating moderate effectiveness. Alavi, with the lowest Precision at 70.20%, appears to have a substantial rate of false positives, which could be problematic in applications where precision is crucial.
The Recall metric, representing sensitivity, further highlights the strengths of our method, which achieves a high Recall of 89.70%. This indicates a strong capability in capturing all relevant cases. Xu follows with a Recall of 82.30%, while Ashraf and Alavi score lower, at 75.80% and 65.40%, respectively. The lower Recall scores for Ashraf and Alavi suggest that these methods fail to capture a significant portion of relevant cases, potentially resulting in higher false negatives.
The F-Measure, which balances Precision and Recall, emphasizes the robustness of our method, achieving a score of 90.60%. This balance between Precision and Recall makes it an ideal choice for applications that require both accuracy and sensitivity. Xu, with an F-Measure of 83.50%, performs reasonably well, though not at the level of our method. Ashraf and Alavi score 78.10% and 67.70%, respectively, indicating that they struggle to achieve a balanced performance in these critical metrics.
In summary, our method consistently excels across all evaluation criteria, making it the most reliable and effective approach among the four methods, see Table 2. In contrast, Xu serves as a competent but less effective alternative, while Ashraf and Alavi exhibit considerable limitations, particularly Alavi, which consistently scores the lowest across all metrics. This analysis underscores our method as highly suitable for applications demanding high classification accuracy, precision, and sensitivity.
Model | Accuracy | Precision | Recall | F1 Score |
Alavi’s model | 78.5% | 70.2% | 65.4% | 67.7% |
Ashraf’s model | 85.3% | 80.5% | 75.8% | 78.1% |
Xu’s model | 88.6% | 85.1% | 82.3% | 83.6% |
Proposed model | 94.2% | 91.5% | 89.7% | 90.6% |
4. Conclusion
The proposed road crack detection model demonstrates significant advancements by integrating fuzzy logic and graph theory to address challenges such as fine crack detection, irregular geometries, and multi-scale feature analysis. This innovative approach outperforms existing models, which are limited in adapting to complex road conditions, as evidenced by the model's superior accuracy of 94.2%, high precision, and recall. These results, validated through detailed confusion matrix analysis, establish the model's reliability and robustness. Beyond detection accuracy, the proposed framework contributes to proactive and scalable infrastructure maintenance, offering practical relevance in enhancing roadway safety and management. By incorporating Dijkstra's algorithm within a fuzzy graph-based framework, the model effectively assesses crack continuity and severity, ensuring comprehensive crack analysis. This research not only provides a reliable solution for road maintenance but also lays the foundation for future advancements, such as real-time crack detection and adaptability to diverse environmental conditions.
However, the model does have limitations, including its sensitivity to image resolution and potential challenges in processing images with excessive noise. The proposed road crack detection model's performance is influenced by crack type (fine, branching, or interconnected), road surface material and texture, and environmental conditions like lighting, weather, and debris. Future work could explore optimizing the model for lower-resolution images, enhancing its robustness under noisy conditions, and integrating deep learning techniques to further improve its capabilities. The proposed model represents a significant advancement in automated road maintenance systems, improving road safety and reducing maintenance costs.
The data supporting our research results are included within the article or supplementary material.
The author declares no conflict of interest.