This paper aims to defne design patterns specifcally for data ingestion techniques within cloud-based architectures, addressing the challenges associated with high-volume data processing. The approach utilizes a fexible, metadata-driven framework that enhances adaptability and ease of use. This framework supports both incremental and full refresh methods, allowing for seamless changes to ingestion types, schema updates, table additions, and the incorporation of new data sources with minimal intervention from data engineers. The proposed design patterns were validated through experiments conducted on the Azure and Google Cloud platforms. The experiments demonstrate that the proposed design patterns signifcantly reduce data ingestion time, showcasing their efectiveness in managing high-volume data ingestion. This paper contributes to the feld of data management by presenting a comprehensive defnition of design patterns tailored for data ingestion in cloudbased architectures, efectively addressing key challenges in high-volume data processing.

Enhancing Data Ingestion Efficiency in Cloud-Based Systems: A Design Pattern Approach

Rucco, Chiara
;
Longo, Antonella;Saad, Motaz
2025-01-01

Abstract

This paper aims to defne design patterns specifcally for data ingestion techniques within cloud-based architectures, addressing the challenges associated with high-volume data processing. The approach utilizes a fexible, metadata-driven framework that enhances adaptability and ease of use. This framework supports both incremental and full refresh methods, allowing for seamless changes to ingestion types, schema updates, table additions, and the incorporation of new data sources with minimal intervention from data engineers. The proposed design patterns were validated through experiments conducted on the Azure and Google Cloud platforms. The experiments demonstrate that the proposed design patterns signifcantly reduce data ingestion time, showcasing their efectiveness in managing high-volume data ingestion. This paper contributes to the feld of data management by presenting a comprehensive defnition of design patterns tailored for data ingestion in cloudbased architectures, efectively addressing key challenges in high-volume data processing.
File in questo prodotto:
File Dimensione Formato  
s41019-025-00300-2.pdf

accesso aperto

Tipologia: Versione editoriale
Licenza: Creative commons
Dimensione 1.73 MB
Formato Adobe PDF
1.73 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11587/556687
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact