Comparing the impact of virtual distributed file system Alluxio on the performance of SQL queries done in Hive, Spark and Presto
Panarin, Sergei (2021)
Panarin, Sergei
2021
Bachelor's Programme in Science and Engineering
Tekniikan ja luonnontieteiden tiedekunta - Faculty of Engineering and Natural Sciences
This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.
Hyväksymispäivämäärä
2021-05-18
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:tuni-202104263503
https://urn.fi/URN:NBN:fi:tuni-202104263503
Tiivistelmä
Data architecture in the cloud is an important topic nowadays and finding solutions to improve data accessibility and performance of various applications working with databases is a crucial task for many businesses and researchers around the globe. In this work the basic performance of 3 SQL architectures integrated with Alluxio VDFS was tested and compared. Tests were done in the cloud storage provided by S3. Alluxio showed increased performance compared to querying data directly from S3.
Kokoelmat
- Kandidaatintutkielmat [8935]