Step 2: Configuring the Unstructured Data stage

Configure the Unstructured Data stage to extract data from multiple Microsoft Excel sheets.

Procedure

  1. Double-click the Unstructured Data stage to open the stage properties.
  2. Click Configure.
  3. In the Configuration window, specify the full file path of the Microsoft Excel input file Employee3.xls.
  4. Specify the data to extract from the Departments spreadsheet and complete the below sub steps to generate the column mappings.
    1. From the Link list box, select Departments
    2. From the Range option list, select Specify the entire range
    3. From the Range expression field, specify Departments!A2:C6
    4. From the Column header, select First row of data ranges
    5. Click Load. The Excel columns in the specified data range are listed in the Import pane.
  5. Specify the data to extract from the Employees spreadsheet and complete the below sub steps to generate the column mappings.
    1. From the Link list box, select Employees.
    2. From the Range Option list, Specify the entire range
    3. From the Range expression field, specify Employees!A2:L34
    4. From the Column header, select First row of data ranges.
    5. Click Load. The Excel columns in the specified data range are listed in the Import pane.
    6. Click Import, and then click OK. The stage maps columns.
  6. Confirm that the values that you entered on the Configuration window are saved on the Property tab of the stage editor.
  7. On the Output page, select the Employees link as the Output name.
  8. On the Columns page, change the data type of the EMP_NO column to integer, and then click OK.