Categories
Ongoing Projects

Federated Queries

An Optimized Way to Query Your Cross Region Stored Data

The phrase “Federated Query” refers to a group of capabilities that allow users and systems to conduct searches across several siloed data sources without the need to transfer all of the data to a single system. A federated system can access data from database systems from several different vendors, in contrast to a distributed database system, which is a single vendor database system and can store data on multiple server instances on different hosts while treating all of the data as one database.

Companies produce tons of data every day and it seems to grow exponentially with time and become more complex. Storing, processing and analyzing such huge unstructured data becomes costlier and tedious process. Many companies shifted their local storages to Cloud services that provides them with advance features to handle and process such huge data and analyze them in an optimized way. Here, this project propose some of the federated queries strategies to query your data stored in different cloud service and in different regions. We have used Amazon Web Services (AWS) as the primary cloud service provider to query data stored in AWS Athena and Redshift, Google BigQuery and Microsoft Azure Synapse. We demonstrated some of the optimized ways to perform SQL join queries that incur less cost and improve query performance.

Project By:
Kirtan Pathak
Judd Bradbury