3 reasons to move your ETL to the web, cloud
ETL development heavily relies on the desktop with...
As discussed in a previous post, there are many reasons to move your BI to the cloud.
Security, being able to work from anywhere and delivering faster, with more resource flexibility and at a lower cost are just a few.
In this post, we'll have a look at some of the key components in your BI AWS architecture.
The focus for this post will be on the 'traditional BI' components. Components for big data and data science will be discussed in later posts.
First of all, you'll need need to build your infrastructure. There is (at least one) AWS equivalent for every component in your physical, on-premise infrastructure:
AWS Athena is a service that allows quick, adhoc SQL querying directly on your data in S3.
There's no need to develop ETL or to build a data warehouse, all Athena requires is a table structure, defined over your CSV, JSON, log or other files in S3, and you're good to go.
Data sources defined in Athena can be used in QuickSight for visualization.
AWS currently provides two ETL services: Data Pipeline and Glue.
One of the key components in a modern BI or analytics architecture is an analytical database or column store.
Redshift is the AWS service that provides a fully managed, distributed analytical option for your data warehouse. Redshift allows you to start small and grow the number of nodes in the cluster and complexity of your Redshift implementation as your data grows. As with other column stores, there's no need to constantly create and maintain indexes to keep your data warehouse performance acceptable. Most queries will return in seconds at most.
Redshift can be used as a source for your visualization platform of choice, or with AWS QuickSight.
Quicksight is an AWS visualization and analysis service. Although Quicksight is not a complete replacement for most full BI platforms, it does allow you to quickly develop adhoc visualization and analyses on a variety of data sources. Once development is done, visualization can be distributed to large numbers of users who can use the visualizations in their browser or on their mobile.
AWS has all the required components to build full BI or analytic projects. Operating in the cloud may require changes in the way you're used to operate, but it also opens plenty of opportunities for scalability and flexibility that are not possible on-promise or in a self managed data center.
Contact us to find out how we can help you become successful with your cloud analytics!
ETL development heavily relies on the desktop with...
Cloud computing is the way to the future, and the way to bring your company to the next level. With...
Analytics projects are often treated as ad-hoc projects. Code and content are...
Blog comments