##plugins.themes.bootstrap3.article.main##

Misjudgments in court cases are inevitable in any judicial system irrespective of how civilized the country in which the judicial system is. The economic effects of failed court judgments cannot be overemphasized. The passing of wrong judgments can be a result of a lack of evidence due to poor research by counsels. Preparing for a court case is not an easy fit as a lot of research must be done on the part of the attorneys in charge. This paper presents an improved Hybrid model for legal case document classification. The system starts by collecting legal case documents from an online domain. The collected documents were converted to texts using a pdf miner library in python. The converted texts were used in creating tables using the pandas library. After the creation of the dataset table, the dataset was pre-processed by removing noise, and non-alphanumeric values, and performing tokenization. The tokenized data was then passed into principal component analysis for the selection of important features. The selected features were used in training an LSTM model for the classification of the legal case documents. The system was designed with Object-Oriented Analysis and Design method and implemented using python programming language. The result of the LSTM is outstanding, having an accuracy of 99% when evaluated with unseen legal case documents. The model was deployed in building a web application for the classification of legal documents. Upon testing the application with emerging documents, it sufficiently classified them and reduced tremendously the conflicting judgments experienced before the application of the improved model for legal case classification.

Downloads

Download data is not yet available.

References

  1. Al-Khurayji R, Sameh A. An Effective Arabic Text Classification Approach Based on Kernel Na?ve Bayes Classifier. International Journal of Artificial Intelligence & Applications, 2016; 8(6): 1-10.
     Google Scholar
  2. Burgess CJC. A Tutorial on Support Vector Machines for Pattern Recognition. Data Mining and Knowledge Discovery, 1998; 2(1): 955-974.
     Google Scholar
  3. Elnagar A, Al-Debsi R, Einea O. Arabic Text Classification using Deep Learning Models. Information Processing and Management, 2020.
     Google Scholar
  4. Fusheng W, Han Q, Shi Y, Haozhen Z. Empirical Study of Deep Learning for Text Classification in Legal Document Review. IEEE International Conference on Big Data (Big Data), 2018.
     Google Scholar
  5. Hongxia L, Ehwerhemuepha L, Rokovski C. A Comparative Study on Deep Learning Models for Text Classification of Unstructured Medical Notes with various levels of Class Imbalance. BMC Medical Research Methodology, 2022; 22(1).
     Google Scholar
  6. Hotho A, Staab S, Stumme G. WordNet Improves Text Document Clustering. International ACM SIGIR Conference on Research and Development in Information Retrieval; 2003.
     Google Scholar
  7. Isa D, Lee LH, Kallimani VP, Rajikuma R. Text Document Preprocessing using the Bayes Formula for Classification Based on the Vector Space Model. Computer and Information Science Journal, 2008; 1(4).
     Google Scholar
  8. Madjid K, Shiva H. Document Classification Methods. [Internet]. 2019. Retrieved from: https://www.researchgate.net/publication/335880715_Document_classification_methods.
     Google Scholar
  9. Mohammed A, Kora R. An Effective Ensemble Deep Learning Framework for Text Classification. Journal of King Saud University-Computer and Information Sciences, 2022; 34(10): 8825-8837.
     Google Scholar
  10. Pinto L, Melgar A. A Classification Model for Portuguese Documents in the Juridical Domain. 11th Iberian Conference on Information Systems and Technologies (CISTI); 2016.
     Google Scholar
  11. Pudaruth S, Soydaudah KMS, Gunputh RP. Categorisation of Supreme Court Cases Using Multiple Horizontal Thesauri. Intelligent Systems Technologies and Applications, Advances in Intelligent Systems and Computing, 2016; 385.
     Google Scholar
  12. Sebastiani F. Machine Learning in Automated Text Categorization. ACM Computing Surveys, 2002; 34(1): 1-47.
     Google Scholar
  13. Shruti S. Precision vs Recall. [Internet]. 2018. Retrieved from https://towardsdatascience.com/precision-vs-recall-386cf9f89488.
     Google Scholar
  14. Thomas AM, Resmipriya MG. An efficient Text Classification Scheme Using Clustering. International Conference on Emerging Trends in Engineering, Science, and Technology, 2016; 24(1): 1220-1225.
     Google Scholar
  15. Ugwu C, Obasi K. Legal Case Document Classification Application based on an Improved Hybrid Approach. International Journal of Engineering Research and Technology (IJERT), 2015; 4(4): 517-525.
     Google Scholar
  16. Wei T, Lu Y, Chang H, Zhou Q, Bao X. A semantic approach for text clustering using WordNet and lexical chains: Expert Systems with Applications. Journal of Expert Systems with Applications, 2015; 2(42): 2264-2275.
     Google Scholar