Skip to main content

HPCC Systems blog contributors are engineers and data scientists who for years have enabled LexisNexis customers to use big data to fulfill critical missions, gain competitive advantage, or unearth new discoveries. Check this blog regularly for insights into how HPCC Systems technology can put big data to work for your own organization.

Roger Dev on 10/05/2018
Decision Tree based learning methods have proven to be some of the most accurate and easy-to-use Machine Learning mechanisms. We call these mechanisms "Learning Trees". We explore the hows and whys of the various Learning Tree methods and provide an overview of our recently upgraded LearningTrees bundle.
Jessica Lorti on 09/21/2018
On September 13, 2018, HPCC Systems hosted the latest edition of The Download: Tech Talks. This series of workshops is specifically designed for the community by the community with the goal to share knowledge, spark innovation, and further build and link the relationships within our HPCC Systems community. In this very special edition of Tech Talks, we featured more of our 2018 HPCC Systems summer interns and the work they are doing with machine learning.
Jessica Lorti on 09/10/2018
Welcome to our new interview series “5 Questions with an HPCC Systems Community Member” where we will highlight some of our most prominent HPCC Systems community members. These are the people who are avid users of our open source platform and can offer real world expertise and best use information. A doctoral candidate at Keiser University and a computer science instructor at Wayne State University, Itauma Itauma is an expert in learning analytics and uses HPCC Systems for his educational research.
Roger Dev on 08/30/2018
Cause and effect lie at the heart of human discourse and knowledge. Yet computer science and mathematics has very little to say on the subject until recently. There are now algorithms that can detect patterns of cause and effect from data. We explore these mechanisms and how they relate to Machine Learning and Artificial Intelligence.
Jessica Lorti on 08/22/2018
On August 2, 2018, HPCC Systems hosted the latest edition of The Download: Tech Talks. This series of workshops is specifically designed for the community by the community with the goal to share knowledge, spark innovation, and further build and link the relationships within our HPCC Systems community. In this very special edition of Tech Talks, we are featuring some of our 2018 summer interns and the exciting work they are doing.
Arjuna Chala on 08/01/2018
I recently presented at the Open Data Science Conference (OSDC) in Boston. During my presentation, I addressed how a well-designed data lake solution can help solve fundamental data integration problems, and I used a real-world example involving analysis of a New York city taxi company’s route and fare data (which was stored in various formats depending on what was being tracked specifically) to illustrate how data analysis could improve outcomes for cab drivers, cab companies and the local government.
Richard Chapman on 07/19/2018
A Bloom filter, named after its inventor Burton Howard Bloom, is a data structure that can be used to perform a cheap test for the potential presence of a particular value, in a way that is much faster than looking up the value in an index, requiring much less storage than the index would. Note the “potential” there. The Bloom filter can tell you for certain if a value is not present, but it cannot say for certain that a value is present, only that it may be present.
Dan Camper on 07/13/2018
As new ECL programmers, we've all been there.  Using the HPCC Systems big data technology, you've successfully imported a bunch of data, analyzed it using ECL, created aggregated datasets, built an index around the aggregations and written ROXIE code to deliver query results on those aggregations in sub-second time. But then...
Jessica Lorti on 07/06/2018
On June 28, 2018, HPCC Systems hosted the latest edition of The Download: Tech Talks. This series of workshops is specifically designed for the community by the community with the goal to share knowledge, spark innovation, and further build and link the relationships within our HPCC Systems community.
Lorraine Chapman on 06/20/2018

HPCC Systems 7.0.0 Beta is now available for download. It includes all the major new features we expect to be included in the final gold version, targeted for release later in the year. There are some great new features and performance enhancements you might like to test drive and we're looking for your feedback as we make the final tweaks.