Calculation of Similarity between MUI Fatwas: A Comparison of Text Extraction Features and String Matching Algorithms

Authors

  • Mohamad Fahmi Syaifudin Sepuluh Nopember Institut of Technology
  • Gagatsatya Adiatmaja
  • Bilal Hidayaturrohman Sepuluh Nopember Institut of Technology

DOI:

https://doi.org/10.12962/j22759970.v5i1.1226

Keywords:

Fatwa MUI, Word Extraction, Jaccard Similarity, Cosine Similarity, Euclidean Similarity, Dice Similarity

Abstract

Fatwas, as religious rulings issued by the Indonesian Ulama Council (MUI), play a crucial role in guiding the Muslim community. This research aims to analyze the similarity between these fatwas, contributing to the field by comparing various similarity methods. The dataset includes 380 fatwa titles collected from the official website of the National Sharia Council of the Indonesian Ulama Council. The research follows a structured methodology: starting with data collection, followed by text pre-processing involving punctuation removal, stemming, and stop word elimination. Word extraction techniques such as Bag of Words (BoW), TF-IDF (Term Frequency-Inverse Document Frequency), and BERT (Bidirectional Encoder Representations from Transformers) are then applied. Similarity is calculated using Jaccard Similarity, Cosine Similarity, Euclidean Distance, and Dice Coefficient. The results show that Cosine Similarity combined with TF-IDF achieves the highest performance with an F1 Score of 0.299. This study is novel in its comprehensive comparison of multiple similarity methods applied to MUI fatwas, providing valuable insights for researchers and practitioners in Natural Language Processing (NLP).

References

R. A. Zayed, M. F. A. Hady, and H. Hefny, "Islamic fatwa request routing via hierarchical multi-label Arabic text categorization," in Proceedings - 1st International Conference on Arabic Computational Linguistics: Advances in Arabic Computational Linguistics, ACLing 2015, Institute of Electrical and Electronics Engineers Inc., Feb. 2016, pp. 145-151. doi: 10.1109/ACLing.2015.28.

J. A. Ali, "Contemporary Islamic Revivalism: Key Perspectives," 2012.

W. B. Hallaq, A history of Islamic legal theories_: an introduction to Sunni_ us_u_l al-fiqh. 1997.

Nasrullah, "MAJELIS ULAMA INDONESIA (MUI); STUDI ATAS PENGGUNAAN METODOLOGI QIYAS SEBAGAI UPAYA PENETAPAN HUKUM ISLAM DI INDONESIA," 2017.

M. Asad, "Ulama in Indonesian Politics: Analysis on the Attitudes of The Majelis Ulama Indonesia (MUI) on the General Elections," Akademika, vol. 16, no. 1, Jun. 2022, doi: 10.30736/adk.v16i1.764.

A. Irsyad and N. A. Rakhmawati, "Community detection in twitter based on tweets similarities in indonesian using cosine similarity and louvain algorithms," Register: Jurnal Ilmiah Teknologi Sistem Informasi, vol. 6, no. 1, pp. 22-31, 2020, doi: 10.26594/register.v6i1.1595.

N. Aini Rakhmawati and M. Jannah, "Food ingredients similarity based on conceptual and textual similarity," 2021. [Online]. Available: http://halal.addi.is.its.ac.id/

N. Aini Rakhmawati, A. Adi Firmansyah, P. Maulidya Effendi, R. Abdillah, and T. Agung Cahyono, "Auto Halal Detection Products Based on Euclidian Distance and Cosine Similarity," vol. 8, pp. 4-6, 2018, [Online]. Available: http://halal.addi.is.its.ac.id;

R. Singh and S. Singh, "Text Similarity Measures in News Articles by Vector Space Model Using NLP," Journal of The Institution of Engineers (India): Series B, vol. 102, no. 2, pp. 329-338, Apr. 2021, doi: 10.1007/s40031-020-00501-5.

A. A. Munshi, W. H. AlSabban, A. T. Farag, O. E. Rakha, A. A. Al Sallab, and M. Alotaibi, "Towards an Automated Islamic Fatwa System: Survey, Dataset and Benchmarks," International Journal of Computer Science and Mobile Computing, vol. 10, no. 4, pp. 118-131, Apr. 2021, doi: 10.47760/ijcsmc.2021.v10i04.017.

Hasmawati and Ade Romadhony, "Similar Questions Identification on Indonesian Language Subject Using Machine Learning," Jurnal Nasional Pendidikan Teknik Informatika (JANAPATI), vol. 12, no. 2, pp. 196-202, Jul. 2023, doi: 10.23887/janapati.v12i2.62582.

N. Aini Rakhmawati, A. Adi Firmansyah, P. Maulidya Effendi, R. Abdillah, and T. Agung Cahyono, "Auto Halal Detection Products Based on Euclidian Distance and Cosine Similarity," vol. 8, pp. 4-6, 2018, [Online]. Available: http://halal.addi.is.its.ac.id;

R. Singh and S. Singh, "Text Similarity Measures in News Articles by Vector Space Model Using NLP," Journal of The Institution of Engineers (India): Series B, vol. 102, no. 2, pp. 329-338, Apr. 2021, doi: 10.1007/s40031-020-00501-5.

M. J. Sulastri, N. Aini Rakhmawati, and R. Indraswari, "Identifying Gender Bias in Online Crime News Indonesia Using Word Embedding," in 2023 International Conference on Advanced Mechatronics, Intelligent Manufacture and Industrial Automation, ICAMIMIA 2023 - Proceedings, Institute of Electrical and Electronics Engineers Inc., 2023, pp. 774-778. doi: 10.1109/ICAMIMIA60881.2023.10427911.

Wisam A. Qader, Musa M.Ameen, and Bilal I. Ahmed, "An Overview of Bag of Words;Importance, Implementation, Applications, and Challenges," in Fifth International Engineering Conference on Developments in Civil & Computer Engineering Applications 2019 - (IEC2019) - Erbil - IRAQ, 2019.

K. A. Alshaikh, O. A. Almatrafi, and Y. B. Abushark, "BERT-Based Model for Aspect-Based Sentiment Analysis for Analyzing Arabic Open-Ended Survey Responses: A Case Study," IEEE Access, vol. 12, pp. 2288-2302, 2024, doi: 10.1109/ACCESS.2023.3348342.

A. B. Y. A. Putra, Y. Sibaroni, and A. F. Ihsan, "Disinformation Detection on 2024 Indonesia Presidential Election using IndoBERT," in 2023 International Conference on Data Science and Its Applications, ICoDSA 2023, Institute of Electrical and Electronics Engineers Inc., 2023, pp. 350-355. doi: 10.1109/ICoDSA58501.2023.10277572.

D. D. Prasetya, A. P. Wibawa, and T. Hirashima, "The performance of text similarity algorithms," International Journal of Advances in Intelligent Informatics, vol. 4, no. 1, pp. 63-69, Mar. 2018, doi: 10.26555/ijain.v4i1.152.

S. P. Pati and R. Rautray, "An Empirical Analysis of Similarity based Single Document Summarization," in Proceedings - 5th International Conference on Computing Methodologies and Communication, ICCMC 2021, Institute of Electrical and Electronics Engineers Inc., Apr. 2021, pp. 860-864. doi: 10.1109/ICCMC51019.2021.9418297.

S. A. Khan and Z. Ali Rana, "Evaluating Performance of Software Defect Prediction Models Using Area Under Precision-Recall Curve (AUC-PR)," in 2019 2nd International Conference on Advancements in Computational Sciences (ICACS), IEEE, Feb. 2019, pp. 1-6. doi: 10.23919/ICACS.2019.8689135.

C. Anwar ul Hassan, M. Sufyan Khan, and M. Ali Shah, "Comparison of Machine Learning Algorithms in Data classification," in Proceedings of the 24th International Conference on Automation & Computing, 2018.

Downloads

Published

2025-02-27

How to Cite

[1]
M. F. . Syaifudin, G. . Adiatmaja, and B. . Hidayaturrohman, “Calculation of Similarity between MUI Fatwas: A Comparison of Text Extraction Features and String Matching Algorithms”, hrj, vol. 5, no. 1, pp. 1–13, Feb. 2025.

Issue

Section

Articles