These instructions will guide you through installing and running the HPCC Community Edition on a single node to start and then optionally, expand it to a larger cluster of nodes.

ECL is the Enterprise Control Language designed specifically for huge data projects using the HPCC platform. Its extreme scalability comes from a design that allows you to leverage every query you create for re-use in subsequent queries as needed. To do this, ECL takes a dictionary approach to building queries wherein each ECL definition defines an Attribute. Each previously defined Attribute can then be used in succeeding ECL Attribute definitions as the language extends itself as you use it.

The Standard Library Reference includes utility functions for string handling, date manipulation and others.

The ECL Programmers Guide gives an introduction to the ECL language along with example data and use cases. The supporting ECL Code Files referenced in the guide are included below.

Supporting ECL Files
In ZIP format


Comrise, a HPCC Systems partner, has also made available a Chinese translated version of the ECL Programmers Guide (version

This document covers configuring an HPCC environment using the Configuration Manager, including Advanced View.

This tutorial provides a walk-through of the development process from beginning to end and is designed to be an introduction to working with data on HPCC, as well as give an introduction to Thor and Roxie. The Sample Data Files are required references for the tutorial.

Sample Data File (one needed)
In ZIP format
In tar.gz format OriginalPerson.tar.gz
Uncompressed OriginalPerson
Running an HPCC Systems cluster inside an Amazon Web Services (AWS) Elastic Cloud (EC2) allows you to instantiate and run HPCC Systems clusters of different sizes on the fly.

This is useful for:
• Proof-of-concept
• Experimentation
• Learning
• Leveraging the HPCC Systems platform without incurring cost of hardware and administration
• Create and use an HPCC Systems cluster immediately without purchasing and installing new hardware

You can create a small cluster for small tasks or larger clusters for larger jobs. This flexibility allows you to match cost and processing power to the job at hand.

Temporary EC2 nodes allow you to "rent" computing capacity without long term commitments. In this manner, you pay as you go instead of incurring large fixed costs at the start. Keep in mind that you should terminate any unneeded instances to avoid paying for computing time you don't need.

HPCC Systems® Instant Cloud for AWS
Provide minimal information and press a button to instantiate your cluster.
Click here to begin. The manual below provides additional details.

We are working with Amazon on a process to run an HPCC Systems cluster in an environment similar to Amazon Elastic Map Reduce (EMR). In addition, HPCC Systems is working on its own Enterprise Cloud offering, but it is not yet ready to be announced.

Have a question about HPCC and AWS? Post on the Cloud Forums.

This step by step ECL Programming Example details how to solve the challenge of finding the Six degrees of Separation between any actor and Kevin Bacon.

This document is a guide for administrators of an HPCC Systems Platform including: routine and on-demand maintenance steps, preflight checks, system configuration, and best practices.

This guide is a collection of tips and techniques that target the best practices of ECL programming.

ECL Playground is a tool hosted on an ESP server. A page runs in your browser, allowing you to access and execute self-contained ECL code on your HPCC system without the use of any other tools. The ECL Playground then shows you the results and the graph in your browser. The view is very similar to what the ECL IDE displays.
Listen to the ECL Playground Podcast and how it can be a helpful tool for developers.

This outlines the system requirements and steps for installing and running an HPCC system on a single node inside a virtual machine.

The ECL compiler now supports adding external resources to a Workunit using a manifest file. The manifest, in XML format, adds external resources such as XSLT transformations, or other objects to the archive before it is compiled and deployed to the server. This document demonstrates how to use this feature to visualize ECL job results using Google Chart Tools.

The ECL Scheduler is a component process installed with the HPCC system platform. ECL Scheduling provides a means of automating processes within ECL code or to chain processes together to work in sequence.

This document provides steps you can use to certify your HPCC Systems environment. Use this procedure when you create a new HPCC system or after you make any changes or upgrades to the system. This will ensure that your system is functioning properly.

These steps explain how to move data on or off an HPCC Cluster.

This document describes the technical preview of the graphical monitoring and reporting components in the HPCC Systems platform.

This guide is an introduction to the Rapid Data Delivery Engine (ROXIE).

This manual contains documentation for the set of Client Tools for use with the HPCC Systems platform. These tools include: ECL IDE, ECL Debugger, ECLPlus, ECL Command Line, ECL Compiler, ESDL Command Line, and DFUPlus.

This manual contains documentation for the set of Client Tools for use with the HPCC Systems platform. These tools include: ECLPlus, ECL Command Line, ECL Compiler, ESDL Command Line, and DFUPlus.

This document describes ECL Watch, an interface to the HPCC system that allows you to view information and interrogate nodes to confirm all expected processes are running. This provides a simple Web-based view into the system and a means to perform Workunit and data files maintenance.

This manual covers Dynamic ESDL (Enterprise Service Description Language) --a methodology that helps you develop and manage web-based query interfaces quickly and consistently.

Dynamic ESDL takes an interface-first development approach. It leverages the ESDL Language to create a common interface “contract” that both Roxie Query and Web interface developers will adhere to. It is intended to allow developers to create production web services, with clean interfaces that can evolve and grow over time without breaking existing applications.

The machine learning library contains an extensible collection of machine learning routines which are easy and efficient to use and are designed to execute in parallel across a cluster. The list of modules supported will continue to grow over time.

The following system requirements cover what is needed to support the HPCC Platform and ECL IDE included in the Community Edition.

The HPCC Systems Community Wiki includes information provided by the community covering best practices, tips, sample code and examples. The HPCC Systems Red Book also contains useful information to help users manage the transition between releases.

Contact Us

email us   Email us
Toll-free   US: 1.877.316.9669
International   Intl: 1.678.694.2200

Sign up to get updates through
our social media channels:

facebook  twitter  LinkedIn  Google+  Meetup  rss  Mailing Lists