Skip to main content

HPCC Systems blog contributors are engineers, data scientists, and fellow community members who want to share knowledge, tips, and other helpful information happening in the HPCC Systems community. Check this blog regularly for insights into how HPCC Systems technology can put big data analytics to work for your own needs.

Richard Chapman on 04/21/2020
This is the first in a series of blogs about the new development work in progress to provide a Cloud native version of our platform. Our latest release series, HPCC Systems 7.8.x, includes the first wave of changes. In this blog, Richard Chapman talks about our journey to the Cloud and demonstrates how to setup a simple test cluster using a default helm chart.
Richard Chapman on 03/18/2019
Although Java was the first language that we created a plugin for (not counting embedded C++ for which didn’t need a plugin), in some ways the support for using Java with ECL has been a bit more limited than for some other languages, particularly Python. However, in the upcoming 7.2 release of the platform, that’s going to change.
Richard Chapman on 07/19/2018
A Bloom filter, named after its inventor Burton Howard Bloom, is a data structure that can be used to perform a cheap test for the potential presence of a particular value, in a way that is much faster than looking up the value in an index, requiring much less storage than the index would. Note the “potential” there. The Bloom filter can tell you for certain if a value is not present, but it cannot say for certain that a value is present, only that it may be present.
Richard Chapman on 03/02/2017

by Richard Chapman

Richard Chapman on 12/13/2016

TensorFlowTM (see is a new open-source program from Google for performing linear algebra operations on tensors (matrices) and connecting multiple such operations together. It is particularly suited for machine learning applications, and supports operations on GPUs as well as cluster-based operations across multiple machines when dealing with data that is too large for a single machine to handle.

Richard Chapman on 11/23/2016

One downside of using embedded database calls such as MySQL or Cassandra in your ECL code was that specifying the fields to be returned (or passed in, when inserting rows) was a little clunky and potentially inefficient. Projecting fields into EMBEDs makes this process much easier and more efficient in HPCC Systems 6.2.0.

Let's take a step back and review the approach ECL Developers may have been using to date and then take a look at how to use this new feature.

Richard Chapman on 03/16/2016

Recently, the HPCC platform team held one of our offsite conferences, which usually involves us disappearing into the wilderness somewhere for a week. So, leaving the day to day pressures behind, we all decamped to a remote cottage in the frozen North of England to discuss how to improve HPCC Systems as well as future development plans. Something unusual often comes out of these gatherings and this was no exception.

Richard Chapman on 10/26/2015

If you are an ECL programmer, there are a lot of things you don’t need to worry about that programmers in low level languages like C or C++ need to think about: