Skip to main content

HPCC Systems blog contributors are engineers and data scientists who for years have enabled LexisNexis customers to use big data to fulfill critical missions, gain competitive advantage, or unearth new discoveries. Check this blog regularly for insights into how HPCC Systems technology can put big data to work for your own organization.

Arjuna Chala on 08/01/2018
I recently presented at the Open Data Science Conference (OSDC) in Boston. During my presentation, I addressed how a well-designed data lake solution can help solve fundamental data integration problems, and I used a real-world example involving analysis of a New York city taxi company’s route and fare data (which was stored in various formats depending on what was being tracked specifically) to illustrate how data analysis could improve outcomes for cab drivers, cab companies and the local government.
Richard Chapman on 07/19/2018
A Bloom filter, named after its inventor Burton Howard Bloom, is a data structure that can be used to perform a cheap test for the potential presence of a particular value, in a way that is much faster than looking up the value in an index, requiring much less storage than the index would. Note the “potential” there. The Bloom filter can tell you for certain if a value is not present, but it cannot say for certain that a value is present, only that it may be present.
Dan Camper on 07/13/2018
As new ECL programmers, we've all been there.  Using the HPCC Systems big data technology, you've successfully imported a bunch of data, analyzed it using ECL, created aggregated datasets, built an index around the aggregations and written ROXIE code to deliver query results on those aggregations in sub-second time. But then...
Jessica Lorti on 07/06/2018
On June 28, 2018, HPCC Systems hosted the latest edition of The Download: Tech Talks. This series of workshops is specifically designed for the community by the community with the goal to share knowledge, spark innovation, and further build and link the relationships within our HPCC Systems community.
Rodrigo Pastrana on 06/12/2018

One new feature to look out for in HPCC Systems 7.0.0, is log visualization integration with ECL Watch using ELK (ElasticSearch, Logstash and Kibana). Log visualizations help identify, track and predict important events and trends on HPCC Systems clusters, by spotting interesting patterns and giving you visual clues which are easier to interpret than reading through the log file itself.

Jessica Lorti on 06/08/2018

One of the major challenges facing financial institutions is customer attrition, or churn. Studies suggest it costs five to seven times more to attract new customers than it does to retain current customers. Studies have also shown that companies are more likely to retain customers who engage frequently with a product or service.

Lorraine Chapman on 06/04/2018

Systemd is a system start-up and process manager for Linux. In recent years, it has become the boot manager of choice on newer Linux distributions. It is one of many new features you can try out in our HPCC Systems 7.0.0 Beta version which we are preparing for release soon.

Jessica Lorti on 05/23/2018
On May 17, 2018, HPCC Systems hosted the latest edition of The Download: Tech Talks. This series of workshops is specifically designed for the community by the community with the goal to share knowledge, spark innovation, and further build and link the relationships within our HPCC Systems community.
Jessica Lorti on 05/11/2018
The idea of self-driving cars has long fascinated mankind, but most thought it was a topic so far fetched it could only exist in a child’s cartoon. Yet today, this dream is becoming a reality.
Jessica Lorti on 04/27/2018
In 2016, more than 4000 workers died on the job in the US, and there were nearly 3 million private-sector work-related injuries. Yet a key component of workplace safety, the protective hard hat, has remained largely unchanged over the past 60 years. GuardHat, Inc. aims to save lives and prevent injuries with a smart hardhat — a wearable, Internet of Things platform in an industrial hardhat form-factor.