Transforming Library Services with Metadata Lakes: Leveraging Big Data and AI for Enhanced Resource Management

Authors

  • Arti Sawale Department of Library and Information Science, University of Delhi, Delhi, India
  • Paramjeet Kaur Walia Department of Library and Information Science, University of Delhi, Delhi, India

DOI:

https://doi.org/10.70112/ajist-2025.15.2.4393

Keywords:

Metadata Management, Big Data, Metadata Lake, Library Systems, Artificial Intelligence (AI)

Abstract

This study explores the transformative role of metadata in the management of library resources and services, particularly within the context of big data. Metadata enhances data discoverability, indexing, and automation, which, in turn, improves searchability, analytics, and decision-making processes. Libraries are facing an explosion of big data, diverse formats, and evolving user expectations, necessitating a shift in how metadata is managed. In response, this paper proposes a theoretical framework for implementing a “metadata lake” within libraries operating in a big data environment. The concept of a metadata lake is a novel approach to metadata management, and this study aligns it with the characteristics of big data to demonstrate its potential in library systems. Metadata management in libraries has evolved to fit into the big data ecosystem, creating opportunities for knowledge discovery, AI-driven automation, and predictive analytics. By adopting big data technologies, libraries can enhance access, efficiency, data transfer and research impact. The metadata lake concept offers scalability, AI-driven enrichment, and interoperability, allowing libraries to leverage big data and semantic technologies to improve the storage, processing, and retrieval of metadata. The integration of AI-driven recommendations will further enhance content discovery and user engagement. This paper highlights the importance of adopting advanced analytics, linked open data, and semantic technologies in library metadata management. It presents the metadata lake as the future of library metadata, ensuring libraries remain adaptable and capable of contributing to the big data landscape. Furthermore, future research could explore the use of machine learning (ML) and natural language processing (NLP) to automate metadata creation, enrich bibliographic records, and improve classification accuracy, ensuring that libraries continue to evolve in line with technological advancements.

References

Attanasio, P. (2022). New challenges in metadata management between publishers and libraries. DOAJ: Directory of Open Access Journals, 13(1).

Boukraa, D., Bala, M., & Rizzi, S. (2024). Metadata management in data lake environments: A survey. Journal of Library Metadata, 24(4), 215–274. https://www.researchgate.net/publication/382277284_Metadata_Management_in_Data_Lake_Environments_A_Survey.

Feng, F., & Shri Varsheni, R. (n.d.). How metadata lakes empower next-gen AI/ML applications. Zilliz. Retrieved January 23, 2025, from https://zilliz.com/blog/how-metadata-lakes-empower-next-gen-ai-ml-apps.

Himpe, C. (2024). DatAasee: A metadata-lake as metadata catalog for a virtual data-lake. [This entry is incomplete. A journal title, conference proceedings, or publisher is needed for a complete citation.]

Löfler, F., Wesp, V., König-Ries, B., & Klan, F. (2021). Dataset search in biodiversity research: Do metadata in data repositories reflect scholarly information needs? PLOS ONE, 16(3), e0246099. https://doi.org/10.1371/journal.pone.0246099.

Oladokun, B. D., & Gaitanou, P. (2024). Leveraging open data for reference services delivery in academic libraries. Library Hi Tech News, 41(4), 12–14. https://doi.org/10.1108/LHTN-07-2023-0112.

Rousidis, D., Garoufallou, E., Balatsoukas, P., & Sicilia, M.-A. (2014). Metadata for Big Data: A preliminary investigation of metadata quality issues in research data repositories. Information Services & Use, 34

(3-4), 279–286. https://doi.org/10.3233/ISU-140746.

Sawadogo, P., Kibata, T., & Darmont, J. (2019). Metadata management for textual documents in data lakes. Proceedings of the 21st International Conference on Enterprise Information Systems, 72–83. https://www.scitepress.org/Papers/2019/77063/77063.pdf.

Tani, A., Candela, L., & Castelli, D. (2013). Dealing with metadata quality: The legacy of digital library efforts. Information Processing & Management, 49(6), 1194–1205. https://www.sciencedirect.com/science/article/abs/pii/S0306457313000526.

Wang, Z., Chen, H., Zhang, C., Lu, W., & Wu, J. (2023). JCDL2023 Workshop: Innovation Measurement for Scientific Communication (IMSC) in the Era of Big Data. 2023 ACM/IEEE Joint Conference on Digital Libraries (JCDL), 303–305. https://ieeexplore.ieee.org/document/10266056.

Downloads

Published

10-12-2025

How to Cite

Arti Sawale, & Kaur Walia , P. (2025). Transforming Library Services with Metadata Lakes: Leveraging Big Data and AI for Enhanced Resource Management. Asian Journal of Information Science and Technology, 15(2), 62–66. https://doi.org/10.70112/ajist-2025.15.2.4393

Similar Articles

<< < 18 19 20 21 22 23 24 25 26 27 > >> 

You may also start an advanced similarity search for this article.