Skip to main content
Submitted by SharonVan on 19 April 2020

August 19, 9:00am - August 20, 5:00pm - Remote via Microsoft Teams 

Note: Due to the continued need for social distancing the class will be held remotely via Microsoft Teams. Switching to the remote format, Part 2 classes will start earlier on Wednesday (9 AM), and will conclude on Thursday by 5 PM. When enrolling, please note this time change. Your instructor will send you a welcome email with details on what you will need after enrolling.

This course is the second in a two-part advanced series. People who attend the Advanced ECL (Part 1) – Working with Relational Data class should continue with this class.

This course explores the concept of Super Files in ECL and the techniques for working with XML and JSON data, getting it into your HPCC Systems platform, and defining it to work with other data elements. This flows naturally into the detailed ECL support of Natural Language Parsing – creating pattern-matching definitions and using the PARSE function to extract data from either XML or free-form text.

Course Length: 2 days

Class Prerequisites: Students must have attended the Introduction to ECL (Part 1) - Concepts and Queries and Introduction to ECL (Part 2) - the Extract, Transform, and Load (ETL) Process training classes. Students are welcome to bring their own laptops to take away the code and examples from the class.

Topics include:

  • SuperFiles and SuperKeys
  • Simple XML Spray and Dataset Definition
  • Working with XML Data (simple, complex, and nested)
  • Complex XML Spraying and De-spraying
  • PARSE with XML Data
  • Spraying and defining free-form text data
  • PARSE with free-form text
  • New! An Introduction to Machine Learning

Class on Wednesday begins at 9am and concludes at 5pm.
Class on Thursday begins at 9am and concludes at 5pm. 

RELX and LexisNexis employees should register using the approved promo code provided by their manager or contact training@hpccsystems.com.

Click to enroll in Part 2.