The SAS interface library

The SAS operators make it possible to execute SAS applications using the computing power and data handling capabilities of InfoSphere® DataStage®'s parallel processing system.

You need to make only minor modifications to your existing SAS applications for execution within InfoSphere DataStage. The topic Parallelizing SAS steps describes these modifications.

The InfoSphere DataStage SAS Interface Library contains four operators:

  • sasin. This operator converts a standard InfoSphere DataStage data set into a SAS data set capable of being processed in parallel by the sas and sasout operators. The sasin operator options include an option for defining the input data-set schema.
  • sas. This operator allows you to execute part or all of a SAS application in parallel. It takes SAS code in the form of DATA and PROC steps as its argument.
  • sasout. This operator converts the SAS data set output by the sas operator to the standard data set format. The operator has a number of options including the -schema and -schemaFile options. You use either of these options to specify the schema of the output InfoSphere DataStage data set.
  • sascontents. This operator generates a report about a self-describing input data set. The report is similar to the report generated by the SAS procedure PROC CONTENTS.