Calculation of Similarity between MUI Fatwas: A Comparison of Text Extraction Features and String Matching Algorithms
DOI:
https://doi.org/10.12962/j22759970.v5i1.1226Keywords:
Fatwa MUI, Word Extraction, Jaccard Similarity, Cosine Similarity, Euclidean Similarity, Dice SimilarityAbstract
Fatwas, as religious rulings issued by the Indonesian Ulama Council (MUI), play a crucial role in guiding the Muslim community. This research aims to analyze the similarity between these fatwas, contributing to the field by comparing various similarity methods. The dataset includes 380 fatwa titles collected from the official website of the National Sharia Council of the Indonesian Ulama Council. The research follows a structured methodology: starting with data collection, followed by text pre-processing involving punctuation removal, stemming, and stop word elimination. Word extraction techniques such as Bag of Words (BoW), TF-IDF (Term Frequency-Inverse Document Frequency), and BERT (Bidirectional Encoder Representations from Transformers) are then applied. Similarity is calculated using Jaccard Similarity, Cosine Similarity, Euclidean Distance, and Dice Coefficient. The results show that Cosine Similarity combined with TF-IDF achieves the highest performance with an F1 Score of 0.299. This study is novel in its comprehensive comparison of multiple similarity methods applied to MUI fatwas, providing valuable insights for researchers and practitioners in Natural Language Processing (NLP).References
R. A. Zayed, M. F. A. Hady, and H. Hefny, "Islamic fatwa request routing via hierarchical multi-label Arabic text categorization," in Proceedings - 1st International Conference on Arabic Computational Linguistics: Advances in Arabic Computational Linguistics, ACLing 2015, Institute of Electrical and Electronics Engineers Inc., Feb. 2016, pp. 145-151. doi: 10.1109/ACLing.2015.28.
J. A. Ali, "Contemporary Islamic Revivalism: Key Perspectives," 2012.
W. B. Hallaq, A history of Islamic legal theories_: an introduction to Sunni_ us_u_l al-fiqh. 1997.
Nasrullah, "MAJELIS ULAMA INDONESIA (MUI); STUDI ATAS PENGGUNAAN METODOLOGI QIYAS SEBAGAI UPAYA PENETAPAN HUKUM ISLAM DI INDONESIA," 2017.
M. Asad, "Ulama in Indonesian Politics: Analysis on the Attitudes of The Majelis Ulama Indonesia (MUI) on the General Elections," Akademika, vol. 16, no. 1, Jun. 2022, doi: 10.30736/adk.v16i1.764.
A. Irsyad and N. A. Rakhmawati, "Community detection in twitter based on tweets similarities in indonesian using cosine similarity and louvain algorithms," Register: Jurnal Ilmiah Teknologi Sistem Informasi, vol. 6, no. 1, pp. 22-31, 2020, doi: 10.26594/register.v6i1.1595.
N. Aini Rakhmawati and M. Jannah, "Food ingredients similarity based on conceptual and textual similarity," 2021. [Online]. Available: http://halal.addi.is.its.ac.id/
N. Aini Rakhmawati, A. Adi Firmansyah, P. Maulidya Effendi, R. Abdillah, and T. Agung Cahyono, "Auto Halal Detection Products Based on Euclidian Distance and Cosine Similarity," vol. 8, pp. 4-6, 2018, [Online]. Available: http://halal.addi.is.its.ac.id;
R. Singh and S. Singh, "Text Similarity Measures in News Articles by Vector Space Model Using NLP," Journal of The Institution of Engineers (India): Series B, vol. 102, no. 2, pp. 329-338, Apr. 2021, doi: 10.1007/s40031-020-00501-5.
A. A. Munshi, W. H. AlSabban, A. T. Farag, O. E. Rakha, A. A. Al Sallab, and M. Alotaibi, "Towards an Automated Islamic Fatwa System: Survey, Dataset and Benchmarks," International Journal of Computer Science and Mobile Computing, vol. 10, no. 4, pp. 118-131, Apr. 2021, doi: 10.47760/ijcsmc.2021.v10i04.017.
Hasmawati and Ade Romadhony, "Similar Questions Identification on Indonesian Language Subject Using Machine Learning," Jurnal Nasional Pendidikan Teknik Informatika (JANAPATI), vol. 12, no. 2, pp. 196-202, Jul. 2023, doi: 10.23887/janapati.v12i2.62582.
N. Aini Rakhmawati, A. Adi Firmansyah, P. Maulidya Effendi, R. Abdillah, and T. Agung Cahyono, "Auto Halal Detection Products Based on Euclidian Distance and Cosine Similarity," vol. 8, pp. 4-6, 2018, [Online]. Available: http://halal.addi.is.its.ac.id;
R. Singh and S. Singh, "Text Similarity Measures in News Articles by Vector Space Model Using NLP," Journal of The Institution of Engineers (India): Series B, vol. 102, no. 2, pp. 329-338, Apr. 2021, doi: 10.1007/s40031-020-00501-5.
M. J. Sulastri, N. Aini Rakhmawati, and R. Indraswari, "Identifying Gender Bias in Online Crime News Indonesia Using Word Embedding," in 2023 International Conference on Advanced Mechatronics, Intelligent Manufacture and Industrial Automation, ICAMIMIA 2023 - Proceedings, Institute of Electrical and Electronics Engineers Inc., 2023, pp. 774-778. doi: 10.1109/ICAMIMIA60881.2023.10427911.
Wisam A. Qader, Musa M.Ameen, and Bilal I. Ahmed, "An Overview of Bag of Words;Importance, Implementation, Applications, and Challenges," in Fifth International Engineering Conference on Developments in Civil & Computer Engineering Applications 2019 - (IEC2019) - Erbil - IRAQ, 2019.
K. A. Alshaikh, O. A. Almatrafi, and Y. B. Abushark, "BERT-Based Model for Aspect-Based Sentiment Analysis for Analyzing Arabic Open-Ended Survey Responses: A Case Study," IEEE Access, vol. 12, pp. 2288-2302, 2024, doi: 10.1109/ACCESS.2023.3348342.
A. B. Y. A. Putra, Y. Sibaroni, and A. F. Ihsan, "Disinformation Detection on 2024 Indonesia Presidential Election using IndoBERT," in 2023 International Conference on Data Science and Its Applications, ICoDSA 2023, Institute of Electrical and Electronics Engineers Inc., 2023, pp. 350-355. doi: 10.1109/ICoDSA58501.2023.10277572.
D. D. Prasetya, A. P. Wibawa, and T. Hirashima, "The performance of text similarity algorithms," International Journal of Advances in Intelligent Informatics, vol. 4, no. 1, pp. 63-69, Mar. 2018, doi: 10.26555/ijain.v4i1.152.
S. P. Pati and R. Rautray, "An Empirical Analysis of Similarity based Single Document Summarization," in Proceedings - 5th International Conference on Computing Methodologies and Communication, ICCMC 2021, Institute of Electrical and Electronics Engineers Inc., Apr. 2021, pp. 860-864. doi: 10.1109/ICCMC51019.2021.9418297.
S. A. Khan and Z. Ali Rana, "Evaluating Performance of Software Defect Prediction Models Using Area Under Precision-Recall Curve (AUC-PR)," in 2019 2nd International Conference on Advancements in Computational Sciences (ICACS), IEEE, Feb. 2019, pp. 1-6. doi: 10.23919/ICACS.2019.8689135.
C. Anwar ul Hassan, M. Sufyan Khan, and M. Ali Shah, "Comparison of Machine Learning Algorithms in Data classification," in Proceedings of the 24th International Conference on Automation & Computing, 2018.

Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Pusat Kajian Halal ITS

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Copyright
Authors who publish their manuscripts in this journal agree to the following terms:
- The copyright of each article remains with the authors.
- Halal Research Journal holds the right to publish the article first under the Creative Commons Attribution 4.0 International License.
- Authors may distribute their published manuscripts non-exclusively (e.g., to institutional repositories or as part of book publications), provided they acknowledge that the article was first published in this journal.
License
Articles published in this journal are licensed under the Creative Commons Attribution 4.0 International License. This license permits anyone to:
- Copy, distribute, adapt, modify, and create derivative works from the material in any form, including for commercial purposes.
- The condition is that proper credit must be given to the authors for the original work.