Skip to main content

HPCC Systems blog contributors are engineers, data scientists, and fellow community members who want to share knowledge, tips, and other helpful information happening in the HPCC Systems community. Check this blog regularly for insights into how HPCC Systems technology can put big data analytics to work for your own needs.

Russ Whitehead on 11/06/2019
In order to conduct the complex analytics that bring meaning to data, big data platforms require access to massive amounts of potentially sensitive data. And, no matter how powerful or easy to use a big data platform is, it can become a serious liability if it isn’t properly secured. Which begs the question: What is required to properly secure a big data platform from unauthorized access or data theft?
Jessica Lorti on 11/01/2019
In this month’s “5 Questions” interview series, Flavio Villanustre talks with Allan Wroebel. Allan is a senior software engineer at LexisNexis Risk Solutions and a long-time ECL user. Initially working with data operations, Allan now serves as an ECL developer on both Thor and ROXIE.
Cassandra Walker on 10/22/2019
This ECL Tip spotlights the Enterprise Control Language (ECL) AGGREGATE built-in function. ECL AGGREGATE has been seen by many in the community as ‘complex,’ and as such, has been underused. However, in using AGGREGATE you can be sure you’re playing to the strengths of HPCC Systems.
Russ Whitehead on 09/27/2019
As of HPCC Systems version 7.6.0, a new cryptographic module has been added to the ECL Standard Library. Within this module, there is an assortment of cryptographic features available to ECL developers to utilize in order to safeguard their sensitive data, using industry standard cryptographic algorithms. Features include digital hash algorithms, symmetric and asymmetric encryption and decryption, and digital signatures, all of which can be applied to individual columns within an ECL dataset.
Cassandra Walker on 09/23/2019
This ECL Tip spotlights the Enterprise Control Language (ECL) DISTRIBUTE function. The ECL DISTRIBUTE function redistributes data across all nodes in a cluster. Using the DISTRIBUTE function can help prevent “cluster skew”, by distributing data evenly across all nodes. This function can be used on Thor or the ROXIE cluster.
Jessica Lorti on 09/12/2019
In this month’s “5 Questions” interview series, Flavio Villanustre talks to Luke Pezet, Vice President of Software Architecture at Archway Health, about his dynamic career in technology, his experience with big data platforms and why he ultimately settled on HPCC Systems as his go-to framework.
Cassandra Walker on 08/29/2019
Deep learning is a subset of machine learning that is modeled on the basis of the human brain. It essentially teaches computers what comes naturally to humans (learning by examples). In this blog, we discuss how deep learning models using background knowledge were used to achieve sequence learning on traffic and natural language. We also introduce the deep learning tool, TensorLayer.
Rodrigo Pastrana on 08/29/2019
As of 7.2.0, HPCC Systems platform users have access to a new Web service called WsStore. The service is a general purpose key-value (KV) store. A key-value store is a type of non-relational database that uses a simple key-value pair mechanism to store data in which the key serves as a unique identifier to index associated value entries. Keys and values can be anything--ranging from simple literals to complex compound objects.
Jessica Lorti on 08/15/2019
This month’s “5 Questions” interview series features one of our talented interns, Yash Jain. Pursuing his Bachelor of Engineering at the University of Mumbai, Yash’s internship project is entitled, “Cluster Deployment with Juju Charm.”
Flavio Villanustre on 08/08/2019
As is quite evident today, data can be used to improve just about everything. Our data-centered volunteer work has allowed us to help find missing people and provide education to children in need.