Meet your enterprise ETL needs.


IBM InfoSphere DataStage is a leading ETL platform that integrates data across multiple enterprise systems. It leverages a high-performance parallel framework, available on-premises or in the cloud. The scalable platform provides extended metadata management and enterprise connectivity.

Understand, cleanse, monitor and transform your data across multiple systems.


InfoSphere DataStage is the data integration component of IBM InfoSphere Information Server. It provides a graphical framework for developing the jobs that move data from source systems to target systems. The transformed data can be delivered to data warehouses, data marts, and operational data stores, real-time web services and messaging systems, and other enterprise applications. InfoSphere DataStage supports extract, transform, and load (ETL) and extract, load, and transform (ELT) patterns. It uses parallel processing and enterprise connectivity to provide a truly scalable platform.

Key Features


  • Enhance your enterprise with end-to-end ETL. Ensure the data that drives your business and strategy is trusted, consistent, shareable and governed. Understand, cleanse, monitor, transform and deliver your data, while simultaneously bridging the gap between business and IT.
  • Provide trusted ETL data anytime, anywhere. Rapidly provision new ETL environments on cloud or on-premises. Apply sophisticated rules and use the open architecture to govern your data.
  • Provide fast access to trusted data through capability and high performance. Use the massively parallel processing engine to run natively in Hadoop and access data where it resides.
  • Enforce workload and business rules. Optimize hardware utilization and prioritize mission-critical tasks.
  • Integrate data quickly and easily with other cloud environments. Integrate directly with Amazon Simple Storage System (S3) to load data from and into the cloud, and subsequently integrate with other cloud database technologies.
  • Run connectivity, transformation and data delivery features natively in Hadoop. Gain simplified access to HDFS files in various formats and character sets, including security features such as Kerberos and secure gateways.

Additional Resources


What’s new in IBM InfoSphere Information Server 11.7
Download the flyer

Why IBM InfoSphere DataStage?


  • Powerful ETL platform that allows you to collect, integrate and transform large volumes of data.
  • Rapidly provision new ETL environments on cloud or on-premises.
  • Improve speed, flexibility and effectiveness to build, deploy, update and manage your ETL infrastructure.
  • Leverage new data sources more efficiently with HBase and Hive connectors along with Amazon and MongoDB support.

To learn more about what IBM InfoSphere DataStage can do for your company

Contact Us