Home | Articles
Third Generation ETL: Delivering the Best Performance (Part 2)

by Yves de Montcheuil

White Paper : Data Warehouse Appliances
A data warehouse appliance uses commodity processors and hardware optimized for BI and combines them with software built specifically for data warehouse-based analytics in a single box. This paper examines the value of Data Warehouse Appliances. Download

The process of selecting an ETL (extract, load, transfer) software solution is typically a complex one, during which many features need to be evaluated. One of the most critical criteria that must be met is performing well in a given environment and configuration.

Many vendors of ETL software will conclude their sales pitch by giving numbers – always very impressive – regarding the performance of their solution. They are almost invariably provided in the form: such tool can transfer so many rows per time unit.

Many users, however, have been misled by impressive-looking performance numbers that turned out to be less than impressive in real life. Why? Because performance is one of the most difficult elements to evaluate without conducting a full-scale evaluation. Indeed, performance in the production environment is significantly affected by the overall architecture of the information system and by the flow of data during the ETL process.

In Part 1 we looked at the different generations of ETL software and the architecture of third generation ETL products. Part 2 looks in more detail at some of the essential characteristics of this architecture.

Flexibility

The flexibility of the architecture is a key component of the actual performance of ETL processes. It is imperative to be able to reorganize the architecture as dictated by the technical environment – the ETL tool should not dictate the architecture. Here are some typical choices that must be left open because the optimal answer may differ for each project:
  • Which database engine will be used for the transformations? Will it be the source database engine, the target, or another one?
  • Is it acceptable to have processing data mixed in with production data?
  • How easy is it to redefine the location of your databases?
  • What happens to ETL developments if the physical address of a server changes or if a database name changes?
  • How will the code be promoted from the development environment to the production environment?

Resource Center
Business Intelligence
Oracle #1 in Business Analytics According to IDC Research
BI Search
The Business Intelligence Search Engine has all the answers.
Business Intelligence Search Engine
Find all you need on The Business Intelligence Search Engine.
Add a Link Add a Link

  
Other Articles by this Author

Convergence of ETL and Application Integration

ETL: Why abandon SQL when it will give you the best performance?

Data Integration: The Code Generator Approach

Third Generation ETL: Delivering the Best Performance (Part 2)

Third Generation ETL: Delivering the Best Performance (Part 1)

SQL is the “New” Industry Standard Language for ETL





Designing and Implementing Business Intelligence Solutions Using Microsoft SQL Server. Click for details.
Business Intelligence Solution Finder

What do you need?

Location of solution provider

What type of solution are you interested in?

Are you interested in a specific solution?                      


All product names are trademarks of their respective companies.
Copyright © ITNetwork365 - All Rights Reserved