Amazon data area is a data management service that makes it faster and easier for customers to catalog, discover, share, and manage data stored on AWS, on-premises, and from third-party sources. Amazon DataZone recently announced the expansion of data analysis and visualization options for data subscribed to your project in Amazon DataZone using Amazon Athena JDBC driver.
Working closely with our partners, we tested and validated Amazon DataZone authentication via Athena JDBC login, providing an intuitive and secure login experience for users. With this integration, you can now seamlessly query your governed data lake assets in Amazon DataZone using popular business intelligence (BI) and analytics tools, including partner solutions like Tableau.
Ali Tore, senior vice president of advanced analytics at Salesforce, highlighting the value of this integration, says
“We are excited to partner with Amazon to bring Tableau’s powerful AI-driven data mining and analytics capabilities to customers managing data across organizational boundaries with Amazon DataZone. This integration allows our customers to seamlessly explore data with AI in Tableau, create visualizations, and uncover hidden insights in their governed data, while leveraging Amazon DataZone to catalog, discover, share and govern data on AWS, on-premises and from third parties. -party sources – improving both governance and decision-making.
With this launch, Amazon DataZone reinforces its commitment to providing enterprise customers with secure, governed access to data across the tools and platforms they rely on. For example, Guardant Health uses Amazon DataZone to democratize data access across its organization, enabling diverse teams to effectively access, query, and analyze data tailored to their specific needs.
Rajesh Kucharlapati, senior director of data, CRM and analytics at Guardant Health, says
“By harmonizing data across multiple business domains, we foster a culture of data sharing. Using Amazon DataZone allows us to avoid building and maintaining an internal platform, allowing our developers to focus on bespoke solutions. Leveraging the AWS Managed Service was crucial to enabling us to access business insights faster, apply standardized data definitions, and harness the potential of generative AI. We also needed a simple connection process for widely used analytics tools like Tableau, DBeaver, and Domino, directly within Amazon DataZone projects. This new JDBC connectivity feature allows our governed data to flow seamlessly throughout these tools, supporting the productivity of our teams.
Use cases
Amazon DataZone addresses your data sharing challenges and optimizes data availability. Here’s how to do it:
- Creation of data products – As a data producer, you can create and catalog data products while enforcing governance, making your data findable, accessible, interoperable and reusable (FAIR).
- Simplified access – As a data consumer, you can easily locate and subscribe to data from multiple sources within a single project. You can analyze this data using a variety of tools, including built-in AWS options like Amazon Athena, Amazon RedshiftAnd Amazon SageMaker.
- Integration with partner tools – Adding support for partner analytics tools gives you greater flexibility and efficiency in your workflows. Now you can use the tool of your choice, including Tableau, to quickly extract business insights from your data while using standardized definitions and decentralized ownership. Refer the detailed blog article about how you can use it to connect through various other tools.
Prerequisites
To get started, follow these steps:
- Download and install the latest Athena JDBC driver for Tableau.
- Copy the JDBC connection string from the Amazon DataZone portal into the JDBC connection configuration to establish a connection from Tableau. This will ask you to authenticate using single sign-on with your corporate credentials.
When you’re signed in, you can query, visualize, and share data (governed by Amazon DataZone) within Tableau.
The following diagram shows the high-level architecture of the Tableau integration.
Solution Overview: Configure Tableau to Access Data Resources Subscribed to the Project
To configure Tableau to access data resources subscribed to the project, follow these detailed steps:
- Download the latest Athena driver. If Tableau has the Athena driver preinstalled, it may be the older version (v2). To confirm compatibility with Amazon DataZone, you will need the latest driver (v3) which includes the necessary authentication features. To download the latest version x of the JDBC driver, visit Athena JDBC 3.x driver.
- Install the driver. Copy the JDBC driver file to the appropriate folder for your operating system:
- For macOS:
~/Library/Tableau/Drivers
- For Windows:
C:\Program Files\Tableau\Drivers
- For macOS:
- On the Amazon DataZone console, select your project, as shown in the following DataZone console screenshot.
- To capture JDBC connection parameters, follow these steps:
- On the project page, review the connection options under ANALYSIS TOOLS. Choose Connect with JDBC.
- In the JDBC settings dialog box, select Use IDC authentication and copy it JDBC URL. Optionally you can use Use IAM authentication to connect to your Amazon DataZone project as AWS Identity and Access Management (IAM) (from a server), provided you are added as a project member within this project. The following screenshot shows the dialog box.
- On the project page, review the connection options under ANALYSIS TOOLS. Choose Connect with JDBC.
- To configure the Tableau desktop for connection, follow these steps:
- On the To a server login menu, select Other databases (JDBC).
- Paste the copied JDBC URL into the URL field, leaving the other fields (Dialect, User name, Password) unchanged.
- On the To a server login menu, select Other databases (JDBC).
- To sign in with single sign-on, choose Log inas shown in the following screenshot. You will be redirected to authenticate with AWS IAM Identity Center. Use your AWS single sign-on account credentials.
- Once logged in, you will be prompted to authorize the
DataZoneAuthPlugin
. Choose Allow access to allow access to Amazon DataZone from Tableau, as shown in the following screenshot. - Once the connection is established, a success message will appear as shown in the following screenshot.
You can now view data subscribed to your project directly in Tableau and create dashboards.
Conclusion
Amazon DataZone continues to expand its offerings, giving you more flexibility in how you access, analyze, and visualize your subscribed data. With support for the Athena JDBC driver, you can now use a wide range of popular BI and analytics tools, including Tableau, making the data governed within Amazon DataZone more accessible than ever.
In this article, you learned how recent enhancements to Amazon DataZone make it easier to seamlessly connect with Tableau. By integrating Tableau with Amazon DataZone’s comprehensive data governance capabilities, we enable data consumers to quickly and seamlessly explore and analyze their governed data. This integration helps organizations break down silos, foster collaboration and make informed decisions, while maintaining the security and control needed in today’s complex and distributed data landscape.
The feature is supported in all AWS commercial regions where Amazon DataZone is currently available. Watch the video below and the detailed blog article to learn how to connect Amazon DataZone to external analytics tools via JDBC. Start with our technique documentation.
Related blog posts
About the authors
Ramesh H Singh is a Senior Technical Product Manager (External Services) at AWS in Seattle, Washington, currently on the Amazon DataZone team. He is passionate about creating high-performance ML/AI and analytics products that enable enterprise customers to achieve their critical objectives using cutting-edge technology. Connect with him on LinkedIn.
Adiascar Cisneros is a Tableau Senior Product Manager based in Atlanta, Georgia. He focuses on integrating Tableau Platform with AWS services to amplify the value users get from our products and accelerate their journey to valuable, actionable insights. His experience includes analytics, infrastructure, network security and migrations. Follow him on LinkedIn.
Joel Farvault is a Senior SA Analytics Specialist for AWS with 25 years of experience in enterprise architecture, governance and data analytics, primarily in the financial services industry. Joel has led data transformation projects on fraud analytics, claims automation, and master data management. He uses his experience to advise clients on their data strategy and technology foundations.
Yogesh Dhimate is a Senior Partner Solutions Architect at AWS, technology partnership leader with Tableau. Prior to joining AWS, Yogesh worked with leading companies, including Salesforce, to drive their industry solutions initiatives. With over 20 years of experience in product management and solutions architecture, Yogesh brings a unique perspective in cloud computing and artificial intelligence.
Ariana Rahgozar is a Senior Senior Solutions Architect at AWS, leading customers in the design and implementation of technical solutions as part of their cloud journey.