What are the components of DataStage?

What are the components of DataStage?

Three components comprise the DataStage server:

  • Repository. The Repository stores all the information required for building and running an ETL job.
  • DataStage Server. The DataStage Server runs jobs that extract, transform, and load data into the warehouse.
  • DataStage Package Installer.

What are the stages in DataStage?

DataStage provides three types of stages:

  • Server Job Database Stages.
  • Server Job File Stages.
  • Dynamic Relational Stages.
  • Processing Stages.

What language is DataStage written in?

At the lowest levels the DataStage server engine is largely written in C. Once upon a time there was some FORTRAN and PL/I in there too, but I believe that’s no longer the case. Only the “public” levels are written in DataStage BASIC.

What is DataStage application?

DataStage (DS) is an ETL tool that can extract data, transform it, apply business principles and then load it to any specific target. It is a part of IBM’s Information Platforms Solutions suite and also that of InfoSphere. DataStage makes use of graphical notations for constructing data integration solutions.

What is the purpose of DataStage?

DataStage is an ETL tool used to extract, transform, and load data from the source to the target destination. The source of these data might include sequential files, indexed files, relational databases, external data sources, archives, enterprise applications, etc.

What is DataStage designer?

The DataStage Designer is the primary interface to the metadata repository and provides a graphical user interface that enables you to view, edit, and assemble DataStage objects from the repository needed to create an ETL job. An ETL job should include source and target stages.

What is hash partitioning in DataStage?

Hash partitioner Partitioning is based on a function of one or more columns (the hash partitioning keys) in each record. The hash partitioner examines one or more fields of each input record (the hash key fields). Records with the same values for all hash key fields are assigned to the same processing node.

Who invented DataStage?

IBM InfoSphere DataStage

Original author(s) Lee Scheffler
Stable release 11.x
Platform ETL Tool
Type Data integration
Website http://www.ibm.com

How do I create a DataStage job?

Perform the following steps to build a job:

  1. Define optional project-level environment variables in DataStage Administrator.
  2. Define optional environment parameters.
  3. Import or create table definitions, if they are not already available.
  4. Add stages and links to the job to indicate data flow.