Skip to main content

HPCC Systems blog contributors are engineers and data scientists who for years have enabled LexisNexis customers to use big data to fulfill critical missions, gain competitive advantage, or unearth new discoveries. Check this blog regularly for insights into how HPCC Systems technology can put big data to work for your own organization.

Cassandra Walker on 01/17/2020
James McMullan, Sr. Software Engineer at LexisNexis Risk Solutions, gave an overview of the Spark-HPCC Plugin & Connector in a breakout session at the 2019 HPCC Systems Community Day. This presentation also included an introduction to Apache Zeppelin, a demonstration of a random forest model created in Spark, and a discussion about the future of the Spark-HPCC Ecosystem.
Jessica Lorti on 01/13/2020
In this month’s “5 Questions” interview series, Flavio Villanustre talks with Anthony Fishbeck. Anthony is a senior architect for LexisNexis Risk Solutions and the creator of the Enterprise Services Platform (ESP), a highly adaptive and extensible component that serves as the front end of HPCC Systems.
Roger Dev on 01/07/2020
The GNN (Generalized Neural Network) bundle provides an ECL interface to Keras and Tensorflow. Using GNN, an ECL developer can construct, train, and utilize arbitrarily complex Neural Networks such as Classical, Convolutional, and Recurrent networks. These networks can be utilized to analyze complex data such as images, video, and time-series.
Lorraine Chapman on 12/16/2019
This competition provides students working on HPCC Systems related projects with an opportunity to showcase their achievements. 13 students presented on projects ranging from platform related features and machine learning library contributions to interesting use cases and research results. Read on to meet the students and judges, see the posters and learn which were the winning entries.
Flavio Villanustre on 12/09/2019
Flavio Villanustre (VP, Technology and CISO, LexisNexis Risk Solutions), recounts the advice he recently gave to a friend who found that increasing the amount of memory on his Thor cluster didn't impact performance as significantly as expected. Read on to hear Flavio's advice about how to approach improving the performance of Thor and why increasing the amount of memory isn't necessarily the answer.
Cassandra Walker on 12/02/2019
The RELX Group Information Assurance and Data Protection organization (IADP) provides oversight of privacy, security, and compliance practices as part of the company’s comprehensive risk mitigation program. The IADP generally works with Risk Solutions and Legal and Professional business, focusing on PII (personal identifiable information) and SPII (sensitive identifiable information), that are available through LexisNexis online products.
Bahar Fardanian on 11/20/2019
Wouldn’t it be great to show our future generation what the tech industry looks like? In a great move, CodeDay is showing our younger generation, high school and middle school students, tools and technologies that are being used in today’s world. Giving them an idea of what their future looks like and how they can take a role in it when it comes to college.
Lili Xu on 11/14/2019
In this blog, I will introduce another clustering bundle: DBSCAN Bundle, a highly scalable and parallelized implementation of DBSCAN algorithm. DBSCAN is a density-based unsupervised machine learning algorithm to automatically cluster the data into subclasses or groups.
Lorraine Chapman on 11/14/2019
The proposal application period for internships in 2020 is now open. The closing deadline for proposal applications is Friday March 20th 2020. But don't wait until the last minute! Read on to find out more about our intern program, how it works and how to apply.
Arjuna Chala on 11/08/2019
The next generation of Data Scientists will be tasked to solve problems around what IDC predicts will be 175 zettabytes of data by 2025. As part of helping universities educate students on solving Big Data problems at scale, LexisNexis Risk Solutions is promoting the open source HPCC Systems Data Lake platform by sponsoring hackathons, workshops, internships, and research programs.