چارچوب مبتنی بر پایتون برای طبقه‌بندی پوشش زمین در مطالعات زمین‌شناسی مهندسی مقایسه SVM، K-means و شاخص‌های طیفی

تلخابلو, مهدی; فرمهینی فراهانی, مهدی; سیاه منصوری, سبا

doi:10.22034/KJES.2025.11.1.103081

چارچوب مبتنی بر پایتون برای طبقه‌بندی پوشش زمین در مطالعات زمین‌شناسی مهندسی مقایسه SVM، K-means و شاخص‌های طیفی

نویسندگان

مهدی تلخابلو

مهدی فرمهینی فراهانی

سبا سیاه منصوری

دانشگاه خوارزمی

10.22034/KJES.2025.11.1.103081

چکیده

طبقه‌بندی پوشش زمین از اهمیت بالایی در مطالعات زمین‌شناسی مهندسی، به‌ویژه در ارزیابی ناپایداری دامنه‌ها و حرکات توده‌ای برخوردار است. این پژوهش به ارائه یک چارچوب مبتنی بر پایتون برای طبقه‌بندی پوشش زمین پرداخته و عملکرد دو الگوریتم یادگیری ماشین شامل خوشه‌بندی k-means بدون نظارت و ماشین بردار پشتیبان (SVM( با نظارت را در مقایسه با روش‌های متداول مبتنی بر شاخص‌های طیفی NDVI، NDWI، UI و SAVI ارزیابی کرده است. منطقه مورد مطالعه در استان آذربایجان شرقی ایران، با تنوع بالایی در پوشش زمین شامل پوشش گیاهی، خاک برهنه، مناطق شهری و آب‌های سطحی است .داده‌های ماهواره‌ای لندست 8 پس از اعمال مراحل پیش‌پردازش (اصلاح گاما، نرمال‌سازی Min-Max و تحلیل مؤلفه‌های اصلی PCA) برای طبقه‌بندی چهار کلاس اصلی به کار گرفته شدند. نتایج نشان داد که الگوریتم SVM با دقت کلی 84% و ضریب کاپا 0/81 عملکرد بهتری نسبت به الگوریتمK-means با دقت %73 و ضریب کاپا 0/68 دارد. شاخص‌های طیفی به تنهایی دقتی حدود %65 فراهم کردند، اما در ترکیب با الگوریتم‌های یادگیری ماشین، کارایی آن‌ها به‌طور معنی‌داری افزایش یافت. یافته‌ها نشان می‌دهد که الگوریتم‌های یادگیری ماشین، به‌ویژهSVM ، می‌توانند به‌عنوان ابزاری قدرتمند در مطالعات زمین‌شناسی مهندسی از منظر طبقه‌بندی پوشش زمین در ارزیابی ناپایداری دامنه‌ها مورد استفاده قرار گیرند. علاوه بر این، چارچوب مبتنی بر پایتون ارائه‌شده در این پژوهش، قابلیت تکرار، شفافیت و کارایی بالایی دارد و می‌تواند به‌عنوان یک راهکار عملیاتی در مطالعات مشابه به‌کار گرفته شود.

کلیدواژه‌ها

حرکات توده‌ای

خوشه‌بندی K-means

ماشین بردار پشتیبان (SVM)

شاخص‌های طیفی

یادگیری ماشین

PCA

پایتون

20.1001.1.2538449.1404.11.1.9.8

عنوان مقاله English

A python-based framework for land cover classification in engineering geology: A comparative assessment of SVM, K-means, and spectral indices

نویسندگان English

Mehdi Talkhablou

Mahdi Farmahinifarahani

Saba Siahmansouri

Kharazmi University

چکیده English

Accurate land cover classification is a fundamental step in engineering geology studies, particularly for assessing slope instability and mass movements. With the growing availability of satellite data and machine learning tools, automated and reproducible classification frameworks have become essential. This study presents a comprehensive Python-based framework for land cover classification, comparing the performance of two machine learning algorithms, Support Vector Machine (SVM, supervised) and K-means clustering (unsupervised), against traditional spectral indices (NDVI, NDWI, UI, SAVI) using Landsat 8 imagery. The study area is located in East Azerbaijan Province, Iran, covering approximately 80×70 km with diverse land cover types, including vegetation, bare soil, urban areas, and surface water. Prior to classification, data underwent several preprocessing steps: gamma correction for visual enhancement, Min-Max normalization for data scaling, and Principal Component Analysis (PCA) for dimensionality reduction and multicollinearity mitigation. PCA retained components explaining at least 95% of total variance. Classification was performed on four main classes. Results were evaluated using Overall Accuracy (OA), Kappa Coefficient, and weighted Precision, Recall, and F1-Score. The SVM algorithm, using an RBF kernel, achieved the highest accuracy with 84% OA and a Kappa of 0.81, demonstrating superior ability in defining clear class boundaries, particularly in distinguishing urban areas from bare soil. In contrast, K-means clustering yielded 73% OA and a Kappa of 0.68, with noticeable class overlap. Spectral indices alone provided a baseline accuracy of ~65%, but their integration with machine learning models significantly improved performance. The findings confirm that supervised machine learning models, particularly SVM, outperform unsupervised clustering and standalone spectral indices. However, K-means remains viable in data-scarce scenarios. The proposed Python-based workflow offers a reproducible, transparent, and efficient approach for land cover analysis, making it a valuable tool for engineering geology applications such as landslide susceptibility mapping.

کلیدواژه‌ها English

Mass movements

K-means clustering

Support Vector Machine (SVM)

spectral indices

Machine Learning

PCA

Python

land cover classification

Ashrafi Igder, M., Liang, X., Mitolo, M., 2022. Service Restoration Through Microgrid Formation in Distribution Networks: A Review. IEEE Access 10, 46618-46632.
Azari, T., 2025. Design of a supervised artificial intelligence committee machine to estimate hydrodynamic parameters of confined aquifers. Kharazmi Journal of Earth Sciences 11 (1), 28-46.
Berk, R. A., 2008. Support vector machines. Statistical Learning from a Regression Perspective. Springer Nature Switzerland AG.
Bishop, C. M., 2006. Pattern Recognition and Machine Learning (Information Science and Statistics). Springer.
Gholami Dargahi, M., Pourbeyranvand, Sh., Talkhablou, M., 2021. Stress inversion studies using 3D seismic interpretation data and earthquakes focal mechanism. Kharazmi Journal of Earth Sciences 7 (2), 317-341.
Ishfagh, A., Atiq ur, R., Khan DM, Khan Z, Shafiq M, Choi J-G., 2022. Model Selection Using K-means Clustering Algorithm for the Symmetrical Segmentation of Remote Sensing Datasets. Symmetry 14(6), 1149.
Javed, A., Cheng, Q., Peng, H., Altan, O., Li, Y., Ara, I., Saleem, N., 2021. Review of Spectral Indices for Urban Remote Sensing. Photogrammetric Engineering & Remote Sensing 87(7), 513-524.
Jolliffe, I. T., Cadima, J., 2016. Principal component analysis: a review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 374(2065), 20150202.
Kwedlo, W., 2011. A clustering method combining differential evolution with the Kmeans algorithm. Pattern Recognition Letters 32(12), 1613-1621.
Mróz, M., Sobieraj, A., 2004. Comparison of several vegetation indices calculated on the basis of a seasonal SPOT XS time series and their suitability for land cover and agricultural crop identification. Technical Sciences 7, 39–66
Pedregosa, F., 2011. Scikit‐learn: Machine learning in python Fabian. Journal of machine learning research 12, 2825.
Peshawa, J.M.A., 2022. Investigating the Impact of min-max data normalization on the regression performance of K-nearest neighbor with different similarity measurements. ARO-The Scientific Journal of Koya University 10(1), 85-91.
Petrelli, M., 2021. Introduction to Python in Earth Science Data Analysis: From Descriptive Statistics to Machine Learning. Springer Nature.
Petrelli, M., 2023. Machine Learning for Earth Sciences Using Python to Solve Geological Problem. Springer Textbooks in Earth Sciences, Geography and Environment.
Talkhablou, M., Fatemi Aghda, S.M., Milani Chegooshi, H., 2023. Comparison of main rock mass classification methods with an attitude on determining the initial support. Kharazmi Journal of Earth Sciences 8 (2), 94-116.
Van Westen, C. J., Castellanos, E., & Kuriakose, S. L., 2008. Spatial data for landslide susceptibility, hazard, and vulnerability assessment: An overview. Engineering Geology 102(3-4), 112-131.
Xu, H., 2006. Modification of normalised difference water index (NDWI) to enhance open water features in remotely sensed imagery. International journal of remote sensing 27(14), 3025-3033.
Zhang, C., Ma, X., & Li, W., 2020. A review on deep learning in remote sensing. ISPRS Journal of Photogrammetry and Remote Sensing 162, 1-14.

دوره 11، شماره 1 - شماره پیاپی 21
شهریور 1404
صفحه 201-219

XML

اصل مقاله 1.52 M

تعداد مشاهده مقاله 3
تعداد دریافت فایل اصل مقاله 2

نشریه علوم زمین خوارزمی

چارچوب مبتنی بر پایتون برای طبقه‌بندی پوشش زمین در مطالعات زمین‌شناسی مهندسی مقایسه SVM، K-means و شاخص‌های طیفی

A python-based framework for land cover classification in engineering geology: A comparative assessment of SVM, K-means, and spectral indices

دوره 11، شماره 1 - شماره پیاپی 21شهریور 1404صفحه 201-219

فایل ها

هم رسانی

ارجاع به این مقاله

آمار

دوره 11، شماره 1 - شماره پیاپی 21
شهریور 1404
صفحه 201-219