AWS Glue vs Qlik Talend Data Integration

Last Updated:

Our analysts compared AWS Glue vs Qlik Talend Data Integration based on data from our 400+ point analysis of ETL Tools, user reviews and our own crowdsourced data from our free software selection platform.

Qlik Talend Data Integration Software Tool

Product Basics

AWS Glue is a fully managed, event-driven serverless computing platform that extracts, cleanses and organizes data for insights. Automatic code generation ensures citizen data scientists and power users can create and schedule integration workflows. An event-driven architecture enables setting triggers to launch data integration processes.

A common data catalog with automatic schema generation ensures data is unique and easily accessible. With streaming data integration, it catalogs assets from datastores like Amazon S3, making it available for querying with Amazon Athena and Redshift Spectrum. Developers can access readymade endpoints to edit and test code.

Pros
  • Serverless & Scalable
  • Easy Visual Workflow
  • Built-in Data Connectors
  • Pay-per-Use Pricing
  • AWS Ecosystem Integration
Cons
  • Complex Transformations
  • Limited On-Premise Data
  • Python & Scala Only
  • Potential Cost Overruns
  • AWS Lock-in Concerns
read more...
Qlik Talend Data Integration empowers organizations to unify, govern, and integrate data from diverse sources. It shines for businesses grappling with data fragmentation and seeking to establish a trusted data foundation for analytics, AI, and informed decision-making. Key benefits include streamlined data pipelines, automated data quality checks, and real-time data synchronization. Notably, its visual drag-and-drop interface fosters ease of use, while pre-built connectors and code-free transformations accelerate development. Compared to similar tools, user experiences highlight Talend's scalability, flexibility, and robust data governance capabilities. While pricing varies based on specific needs and deployment options, expect a tiered subscription model with per-user or per-month charges.
read more...
$0.44/M-DPU-Hour
Free Trial is unavailable →
Get a free price quote
Tailored to your specific needs
$100 Monthly
Get a free price quote
Tailored to your specific needs
Small 
i
Medium 
i
Large 
i
Small 
i
Medium 
i
Large 
i
Windows
Mac
Linux
Android
Chromebook
Windows
Mac
Linux
Android
Chromebook
Cloud
On-Premise
Mobile
Cloud
On-Premise
Mobile

Product Assistance

Documentation
In Person
Live Online
Videos
Webinars
Documentation
In Person
Live Online
Videos
Webinars
Email
Phone
Chat
FAQ
Forum
Knowledge Base
24/7 Live Support
Email
Phone
Chat
FAQ
Forum
Knowledge Base
24/7 Live Support

Product Insights

  • Effortless Data Integration: Streamline data movement across diverse sources like databases, applications, and cloud storage with pre-built connectors and automated schema discovery.
  • Simplified Data Preparation: Clean, transform, and enrich data with a visual drag-and-drop interface and built-in transformations, eliminating the need for complex coding.
  • Serverless Scalability: Forget infrastructure management! Glue seamlessly scales to handle massive data volumes without upfront provisioning or ongoing maintenance.
  • Cost-Effective Flexibility: Pay-per-use pricing based on actual resource consumption makes Glue ideal for both small and large data pipelines, optimizing your costs.
  • Seamless AWS Integration: Leverage the power of the AWS ecosystem! Glue effortlessly integrates with S3, Redshift, and other AWS services, creating a unified data pipeline within your existing infrastructure.
  • Improved Data Accessibility: Deliver prepared data to data lakes, data warehouses, and analytics platforms, democratizing access for data scientists, analysts, and business users.
  • Enhanced Collaboration: Share data pipelines and workflows with other users and teams, fostering collaboration and streamlining data-driven workflows.
  • Centralized Data Catalog: Maintain a single source of truth for your data assets with Glue Data Catalog, ensuring data consistency and discoverability.
  • Continuous Monitoring and Optimization: Track job performance, identify bottlenecks, and optimize your pipelines for efficiency with built-in monitoring and logging tools.
  • Future-Proof Data Infrastructure: Stay ahead of the curve with Glue's serverless architecture and cloud-native approach, adapting to your evolving data needs with ease.
read more...
  • Unified Data Foundation: Break down data silos and seamlessly integrate information from diverse sources like cloud apps, databases, and legacy systems, creating a single, trusted source for analysis and decision-making.
  • Enhanced Data Quality: Ensure data accuracy and consistency through automated cleansing, transformation, and validation processes, minimizing errors and boosting confidence in analytics.
  • Streamlined Data Pipelines: Design and automate dataflows with a drag-and-drop interface and pre-built connectors, accelerating data delivery and reducing manual tasks.
  • Real-time Insights: Gain access to up-to-date data through real-time data synchronization and event-driven triggers, enabling faster response times and agile decision-making.
  • Improved Data Governance: Implement robust data governance policies and access controls to ensure data security, compliance, and lineage tracking.
  • Increased Agility and Scalability: Respond quickly to changing business needs with a highly scalable platform that can handle growing data volumes and evolving data requirements.
  • Enhanced Collaboration: Foster collaboration across teams by providing a unified view of data and facilitating data sharing with intuitive dashboards and reports.
  • Reduced Costs and Improved Efficiency: Optimize data management processes, eliminate data silos, and reduce manual effort, leading to significant cost savings and improved operational efficiency.
  • Simplified Data Integration: Reduce the complexity of data integration with a user-friendly interface, code-free options, and pre-built components, making data accessible to users of all technical skill levels.
  • Data-Driven Decisions: Empower all levels of the organization to make informed decisions based on accurate, reliable, and readily available data, leading to improved business outcomes.
read more...
  • Console: Discover, transform and make available data assets for querying and analysis. Builds complex data integration pipelines; handles dependencies, filters bad data and retries jobs after failures. Monitor jobs and get task status alerts via Amazon Cloudwatch. 
  • Data Catalog: Gleans and stores metadata in the catalog for workflow authoring, with full version history. Search and discover desired datasets from the data catalog, irrespective of where they are located. Saves time and money – automatically computes statistics and registers partitions with a central metadata repository. 
  • Automatic Schema Discovery: Creates metadata automatically by gleaning schema, quality and data types through built-in datastore crawlers and stores it in the Data Catalog. Ensure up-to-date assets – run crawlers on a schedule, on-demand or based on event triggers. Manage streaming data schemas with the Schema Registry. 
  • Event-driven Architecture: Move data automatically into data lakes and warehouses by setting triggers based on a schedule or event. Extract, transform and load jobs with a Lambda function as soon as new data becomes available. 
  • Visual Data Prep: Prepare assets for analytics and machine learning through Glue DataBrew. Automate anomaly filtering, convert data to standard formats and rectify invalid values with more than 250 pre-designed transformations – no need to write code. 
  • Materialized Views: Create a virtual table from multiple different data sources by using SQL. Copies data from each source data store and creates a replica in the target datastore as a materialized view. Ensures data is always up-to-date by monitoring data in source stores continuously and updating target stores in real time. 
read more...
  • Visual Drag-and-Drop Interface: Design and manage data pipelines with a user-friendly interface that requires minimal coding, making data accessible to users of all skill levels.
  • Pre-built Connectors: Easily connect to hundreds of data sources, including cloud applications, databases, legacy systems, and APIs, with readily available connectors.
  • Code-Free Transformations: Perform data cleansing, filtering, and manipulation through intuitive wizards and pre-built components, avoiding complex coding and accelerating development.
  • Data Quality Management: Ensure data accuracy and consistency with automated profiling, cleansing, validation, and enrichment tools, promoting trust in downstream analytics.
  • Real-time Data Synchronization: Access and synchronize data in real-time with event-driven triggers and continuous data flows, enabling immediate insights and agile decision-making.
  • Data Catalog and Lineage Tracking: Maintain a comprehensive catalog of data assets and track data lineage across transformations, fostering data transparency and governance.
  • Job Scheduling and Orchestration: Automate data pipelines through scheduling and orchestration capabilities, including dependencies and error handling, ensuring smooth data delivery.
  • Data Profiling and Monitoring: Analyze data patterns and monitor job performance with robust data profiling and monitoring tools, facilitating informed decision-making and proactive issue resolution.
  • Scalable Architecture: Handle growing data volumes and complex data pipelines with a scalable architecture that supports cloud, on-premises, and hybrid deployments.
  • Data Masking and Security: Implement data security measures, including data masking and access controls, to protect sensitive information and comply with regulations.
  • Application and API Integration: Integrate data with applications and APIs to streamline workflows and automate data exchange, fostering better decision-making within specific business processes.
  • Machine Learning and Artificial Intelligence Integration: Leverage machine learning and AI capabilities within data pipelines for tasks like anomaly detection and data enrichment, driving deeper insights and automation.
read more...

Product Ranking

#9

among all
ETL Tools

#42

among all
ETL Tools

Find out who the leaders are

Analyst Rating Summary

88
87
100
84
92
92
62
88
Show More Show More
Data Delivery
Performance and Scalability
Platform Capabilities
Platform Security
Workflow Management
Metadata Management
Performance and Scalability
Platform Capabilities
Workflow Management
Data Quality

Analyst Ratings for Functional Requirements Customize This Data Customize This Data

AWS Glue
Qlik Talend Data Integration
+ Add Product + Add Product
Data Delivery Data Quality Data Sources And Targets Connectivity Data Transformation Metadata Management Platform Capabilities Workflow Management 100 92 62 90 96 100 100 84 92 88 71 100 100 93 0 25 50 75 100
100%
0%
0%
80%
0%
20%
85%
8%
7%
85%
8%
7%
36%
0%
64%
89%
0%
11%
88%
0%
12%
46%
42%
12%
90%
0%
10%
100%
0%
0%
100%
0%
0%
100%
0%
0%
100%
0%
0%
80%
10%
10%

Analyst Ratings for Technical Requirements Customize This Data Customize This Data

100%
0%
0%
100%
0%
0%
100%
0%
0%
91%
0%
9%

User Sentiment Summary

Great User Sentiment 165 reviews
Great User Sentiment 385 reviews
85%
of users recommend this product

AWS Glue has a 'great' User Satisfaction Rating of 85% when considering 165 user reviews from 3 recognized software review sites.

87%
of users recommend this product

Qlik Talend Data Integration has a 'great' User Satisfaction Rating of 87% when considering 385 user reviews from 3 recognized software review sites.

4.0 (46)
4.3 (109)
n/a
4.4 (181)
4.4 (109)
4.4 (95)
3.9 (10)
n/a

Awards

SelectHub research analysts have evaluated AWS Glue and concluded it earns best-in-class honors for Workflow Management.

Workflow Management Award

we're gathering data

Synopsis of User Ratings and Reviews

Cost-Effective & Serverless: Pay only for resources used, eliminates server provisioning and maintenance
Simplified ETL workflows: Drag-and-drop UI & auto-generated code for easy job creation, even for non-programmers
Data Catalog: Unified metadata repository for seamless discovery & access across various data sources
Flexible Data Integration: Connects to diverse data sources & destinations (S3, Redshift, RDS, etc.)
Built-in Data Transformations: Apply pre-built & custom transformations within workflows for efficient data cleaning & shaping
Visual Data Cleaning (Glue DataBrew): Code-free data cleansing & normalization for analysts & data scientists
Scalability & Performance: Auto-scaling resources based on job needs, efficient Apache Spark engine for fast data processing
Community & Support: Active user community & helpful AWS support resources for problem-solving & best practices
Show more
Ease of Use: Drag-and-drop interface and pre-built connectors make data integration accessible even for non-technical users.
Faster Data Delivery: Streamlined pipelines and automated transformations accelerate data integration processes.
Improved Data Quality: Built-in cleansing and validation tools ensure data accuracy and consistency.
Scalability and Flexibility: Handles growing data volumes and adapts to diverse data integration needs.
Strong Data Governance: Robust security features and data lineage tracking promote data privacy and compliance.
Show more
Limited Customization & Control: Visual interface and pre-built transformations may not be flexible enough for complex ETL needs, requiring manual coding or custom Spark jobs.
Debugging Challenges: Troubleshooting Glue jobs can be complex due to limited visibility into underlying Spark code and distributed execution, making error resolution time-consuming.
Performance Limitations for Certain Workloads: Serverless architecture may not be optimal for latency-sensitive workloads or large-scale data processing, potentially leading to bottlenecks.
Vendor Lock-in & Portability: Migrating ETL workflows from Glue to other platforms can be challenging due to its proprietary nature and lack of open-source compatibility.
Pricing Concerns for Certain Use Cases: Pay-per-use model can be expensive for long-running ETL jobs or processing massive datasets, potentially exceeding budget constraints.
Show more
Learning Curve for Advanced Features: Mastering complex transformations and customizations can require significant effort.
Performance with Large Datasets: Processing and optimizing very large datasets can be resource-intensive.
Limited Code-Free Customization: Some advanced data manipulations may necessitate manual coding for complete flexibility.
Pricing Complexity: Understanding the tiered pricing structure and potential hidden costs can be challenging.
Smaller Community Support: The user community is smaller compared to some competitors, potentially limiting available resources and peer support.
Show more

User reviews of AWS Glue paint a picture of a powerful and user-friendly ETL tool for the cloud, but one with limitations. Praise often centers around its intuitive visual interface, making complex data pipelines accessible even to non-programmers. Pre-built connectors and automated schema discovery further simplify setup, saving users time and effort. Glue's serverless nature and tight integration with the broader AWS ecosystem are also major draws, offering seamless scalability and data flow within a familiar environment. However, some users find Glue's strength in simplicity a double-edged sword. For complex transformations beyond basic filtering and aggregation, custom scripting in Python or Scala is required, limiting flexibility for those unfamiliar with these languages. On-premise data integration is another pain point, with Glue primarily catering to cloud-based sources. This leaves users seeking hybrid deployments or integration with legacy systems feeling somewhat stranded. Cost also arises as a concern. Glue's pay-per-use model can lead to unexpected bills for large data volumes or intricate pipelines, unlike some competitors offering fixed monthly subscriptions. Additionally, Glue's deep integration with AWS can create lock-in anxieties for users worried about switching cloud providers in the future. Overall, user reviews suggest Glue shines in cloud-based ETL for users comfortable with its visual interface and scripting limitations. Its scalability, ease of use, and AWS integration are undeniable strengths. However, for complex transformations, on-premise data needs, or cost-conscious users, alternative tools may offer a better fit.

Show more

Reviewing user experiences from the past year for Qlik Talend Data Integration paints a mixed picture. Many users praise its user-friendly interface, pre-built connectors, and streamlined data pipelines, making it accessible even for non-technical professionals. This ease of use, coupled with faster data delivery and improved data quality through automated cleansing, is a major draw for businesses seeking to break down data silos and gain quicker insights. However, some users highlight limitations that can be dealbreakers for complex data integration needs. Mastering advanced features and customizations can require a steep learning curve, and performance with exceptionally large datasets may necessitate optimization and resource allocation. Additionally, while code-free options are extensive, some advanced transformations may require manual coding for complete flexibility. When compared to similar products, Qlik Talend shines in its visual drag-and-drop interface and robust data governance capabilities. Users also appreciate its strong scalability and flexibility, making it adaptable to diverse data integration needs. However, some find its pricing structure complex and its community support smaller compared to competitors. Ultimately, the decision to choose Qlik Talend Data Integration hinges on individual needs and priorities. For businesses seeking a user-friendly and efficient solution for basic to moderate data integration tasks, it's a strong contender. However, those with complex data landscapes and specialized requirements may need to weigh its strengths and limitations against other options before making a final call.

Show more

Screenshots

Top Alternatives in ETL Tools


Azure Data Factory

Cloud Data Fusion

Dataflow

DataStage

Fivetran

Hevo

IDMC

Informatica PowerCenter

InfoSphere Information Server

Integrate.io

Oracle Data Integrator

Pentaho

Qlik Talend Data Integration

SAP Data Services

SAS Data Management

Skyvia

SQL Server

SQL Server Integration Services

Talend

TIBCO Cloud Integration

Related Categories

Head-to-Head Comparison

WE DISTILL IT INTO REAL REQUIREMENTS, COMPARISON REPORTS, PRICE GUIDES and more...

Compare products
Comparison Report
Just drag this link to the bookmark bar.
?
Table settings