Skip to main content

HPCC Systems blog contributors are engineers, data scientists, and fellow community members who want to share knowledge, tips, and other helpful information happening in the HPCC Systems community. Check this blog regularly for insights into how HPCC Systems technology can put big data analytics to work for your own needs.

Roger Dev on 07/20/2020
The HPCC Systems COVID-19 Tracker provides enhanced insight into the state and evolution of the COVID-19 pandemic at Country, State, and County levels. It provides unique metrics, and a comprehensive dashboard for use by health officials and curious individuals alike.
Roger Dev on 01/07/2020
The GNN (Generalized Neural Network) bundle provides an ECL interface to Keras and Tensorflow. Using GNN, an ECL developer can construct, train, and utilize arbitrarily complex Neural Networks such as Classical, Convolutional, and Recurrent networks. These networks can be utilized to analyze complex data such as images, video, and time-series.
Roger Dev on 04/09/2019
Text Vectorization allows for the mathematical treatment of textual information. Words, phrases, sentences, and paragraphs can be organized as points in high-dimensional space such that closeness in space implies closeness of meaning. HPCC Systems' new TextVectors module supports vectorization for words, phrases, or sentences in a parallelized, high-performance, and user-friendly package.
Roger Dev on 10/05/2018
Decision Tree based learning methods have proven to be some of the most accurate and easy-to-use Machine Learning mechanisms. We call these mechanisms "Learning Trees". We explore the hows and whys of the various Learning Tree methods and provide an overview of our recently upgraded LearningTrees bundle.
Roger Dev on 08/30/2018
Cause and effect lie at the heart of human discourse and knowledge. Yet computer science and mathematics has very little to say on the subject until recently. There are now algorithms that can detect patterns of cause and effect from data. We explore these mechanisms and how they relate to Machine Learning and Artificial Intelligence.
Roger Dev on 04/18/2018
The Myriad Interface allows users of the HPCC Systems Machine Learning bundles to execute multiple independent machine learning activities within a single interface invocation. Learn how this works and how to use it.
Roger Dev on 02/14/2018
HPCC Systems provides a rich set of Machine Learning tools. This article provides an overview of the available bundles, and a tutorial on how to install and use them.
Roger Dev on 01/22/2018
A quick but potent intro to Machine Learning for those who are new to the subject. This article provides enough of the basic theory and terminology to make you dangerous.
Roger Dev on 09/13/2017

One of the main pieces of preliminary work involved in the major refactoring of the HPCC Systems Machine Learning Library was to productize PBblas as the backbone for Matrix Operations. The Parallel-Block Basic Linear Algebra Subsystem (PBblas) provides a mechanism for adapting matrix operations to Big Data and parallel processing on HPCC Systems clusters.