Integration of Data Cleansing Module Packages in DQM Tools Applications Using Open Source Tools

  • Rahadian Aldi Nugroho Telkom University

Abstract

Data is a source of information on decision making. Data with good quality can produce decisions that can make a company successful. To ensure that the existing data is of good quality, the company can perform data quality management. Data quality management is a set of activities carried out to ensure that the data to be used and processed is of good quality. In data quality management, there are a series of strategies, namely data profiling, data cleansing, data monitoring, and data integration. Data cleansing is the process of converting low-quality data into high-quality data. Data Quality Management Tools (DQM Tools) is an application that can perform the data cleansing process. In the DQM Tools application, there are also several cleansing modules that have not been installed and have not been integrated between the modules. In this study, we will discuss the adjustments made so that the existing and new data cleansing modules can be installed and integrated into the DQM Tools application. This research was also conducted using the Iterative Incremental method. The results obtained in this study are integrated and installed the data cleansing module in the DQM Tools application. The conclusion that can be drawn from this research is that the module can be integrated by adjusting the business processes in the application. The results of this study are very helpful for users when using the DQM Tools application when they want to do data cleansing.

References

[1] DAMA International, DAMA-DMBOK2 (Data Management Body of Knowledge), Basking Ridge, NJ 07920 USA: Technics Publications, 2017.
[2] S. Kramer, "V3B.com," 25 March 2015. [Online]. Available: https://v3b.com/2015/03/the-high-costs-of-dirty-data/#ixzz3Y2t602Ex.
[3] S. A. Noor, ANALISIS DAN PERANCANGAN DATA CLEANSING MENGGUNAKAN OPEN SOURCE TOOLS, 2019.
[4] H. A. Sulistyo, Analisis dan Perancangan Modul Data Cleansing Secara Generic Menggunakan Open Source Tools, 2020.
[5] T. F. K. E. N. A. K F Salmawati, "Carte server implementation for improving data quality management application performance in profiling module," 2021.
[6] F. Naumann, "Data Profiling Revisited," SIGMOD Record (Vol. 42 No. 4), 2013.
[7] Y. Yudhanto and H. A. Prasetyo, in Panduan Mudah Belajar Framework Laravel, Elex Media Komputindo, 2018, p. 18.
[8] Hitachi Vantara, "hitachivantara.com," [Online]. Available: https://www.hitachivantara.com/en-hk/products/data-management-analytics/pentaho-platform/pentaho-data-integration.html.
[9] A. B. Adel Alshamrani, "A Comparison Between Three SDLC Models Waterfall Model, Spiral Model, and Incremental/Iterative Model," 2015.
[10] E. Eilam, Reversing: Secrets of Reverse Engineering, Indianapolis, US: Wiley Publishing, 2005.
Published
2022-03-16