Quotes

Thursday, October 27, 2016

Message modelling using DFDL in WMB

This article addresses the DFDL message modelling feature in IBM® WebSphere® Message Broker (also known as IBM® Integration Bus). It contains step by step instructions for creating a DFDL schema for a flat file using WMB tool-kit.
Data Format Description Language, commonly known as DFDL is an open standard based modelling language which is using in Websphere message broker V8 onwards for modelling data.  DFDL is based on the XML schema and is the recommended way of modelling over the proprietary MRM domain. DFDL is using a subset of the constructs in XML schema to modell the non-XML messages.
DFDL can be used to describe legacy data files,  record-oriented text or scientific and numeric computations. DFDL is more flexible than the MRM modelling and provides better performance.
Objective
To Parse a Tab separated flat file using DFDL in Websphere Message Broker v8. This flat file contains a repeating structure having optional records as well.
Input data:
FlatFile1



A ‘Show all characters’ view from notepad++ as below.
2FlatFile2



Note : The last record is not terminated with a new line.
Below are the steps to create a DFDL model for the above test data.
1. Create a library named EmployeeDFDLLib, right click-on it, and select New => Message Model:
3_createModel





2. Select Record-oriented  text and click Next:
4_CreateModel










3. Select Option-2. Create an empty DFDL schema (we can choose option 1 as well. Here we are using the second option).
5_CreateModel










4. Create folder named ‘TabSeperated’ and schema named ‘TabSeperatedSchema’ and Click Finish.
6_createModel










5. Here is the empty model schema we have created:
7_Schema






Now we are going to define two complex types here (For ‘address’ and ‘name’).
a). Click the ‘Add a complex Type to the Schema’ option on Schema Tab.
8_CreateCompl
b). And give the complex type name as ‘PersonName
9_PersonName
c). Add two local elements to this complex type as ‘FirstName’ and ‘LastName’.
10_addLocalName 11_AddLocalName
d). Similarly create another complex type as ‘Address’ and add local elements to it.
12_Address 13_Address
Now we are going to create our message. Here we should have a root element, followed by a repeating message structure of Header (Name) and Body (Address).
i). Create a Message named ‘Employee’. Use the ‘Add a message to the schema’ option as below:
14_Message 15_Employee
ii). Add a local element to the message Employee and name it as ‘EmployeeDetails’.
16_EmployeeDetails
iii). Change ‘EmployeeDetails’ type from String to Anonymous.
17_EmpDtls
iv). Add 3 local elements to EmployeeDetails.
18_EMPDtls_1
v). Change the Type of element to the comlextTypes we already created.
19_EmpDtls
vi). HomeAddress is Optional. Change the Minimum Occurrence to ‘0’. You may get an error. Change the ‘Occurs Count Kind’ property to ‘implicit’ and save.
20_Empdtls 21_Empdtls
vii). It is been noted that the Separator ‘,’ is been applied for all sequence by default if we have not defined any separator.
22_Seperator
To reset this, please follow the steps
Click the ‘Show all sections’ on top.
23_Showallsections
Go to the data format and click on the ‘<default format>’. Remove the separator and save.
24_seperator
viii). Now define a separator for the message sequences. Define separator ‘CRLF’ for the EmployeeDetails sequence.
26_Sequence
ix). Change the Employeedetails Maximum occurrence to ‘unbounded’ (Repeating structure).
27_Sequence
x). Click on the sequence of EmployeeDetails. Update the separator policy from ‘Suppressed’ to ‘SuppressedAtEndLax’ and save (SuppressedAtEndLax means there is no clash with delimiters from the containing structure –Refer here).
28_Seqnce
xi). Define the initiator ‘H%HT’ (H followed by Horizontal TAB) for EmployeeName
29_initiator
xii). Similarly define the initiator for other two elements as below:
30_initiator
xii). Define the element sequence as ‘%HT’ (Each element separated by TAB).
31_initiator
xiv). We do not have a ‘CRLF’ as last record terminator. Hence update the property to avoid parser exceptions.
Document Final terminator can be missing’ property can be found on Data Formats<default format> properties. Update this to ‘yes’.
32_InifixSeperator
Now the DFDL is ready for testing. Websphere Message Broker V 8 provides a feature to test our message model using DFDL before it can actually be using in any message flow.
Click on the ‘Test Parse Model’ on top and it will show the wizard.
33_test
Select the input file and click OK. It will prompt to switch to DFDL Test perspective. Click Yes.
It will parse successfully and produce the output as below.
36_test
(You can check the DFDL Trace to see the details of message parsing).
If we have to produce the Tab separated message as output (serialise) from an XML, we can use the ‘Test Serialize Model’ option to test it. For this we should have an XML sample message generated as per the DFDL Schema.
Below is the input sample XML to be serialized as a Tab separated flat text.
37_Test
Click on the ‘Test Serialize Model’ and provide the input XML in the wizard.
It will produce the Tab Separated output as below.
38_Test
Appendix
DFDL can be used in message flows to parse (input) or serialise (Output) message.
  • For input parsing, you can set the parser option as DFDL and select the message (Employee) using the browse option.39_input
  • For output serialising, use the ESQL to set the DFDL domain and set the message type as below:
CREATE LASTCHILD OF OutputRoot DOMAIN ‘DFDL’ NAME ‘DFDL’;
SET OutputRoot.Properties.MessageType= ‘{}:Employee’; — Employee is the message name.

No comments:

Post a Comment