Skip to main content
Submitted by SharonVan on 14 August 2020

September 17, 2020 - Webinar
11:00am ET

Join us as we continue this webinar series specifically designed for the community by the community with the goal to share knowledge, spark innovation, and further build and link the relationships within our HPCC Systems community.

Featured topics and speakers include three of our 2020 HPCC Systems summer interns and their projects.

Matthias Murray, Masters in Data Science, New College of Florida, will present on his work: Applying HPCC Systems Word Vectors to SEC Filings.

This project involves reporting on the current status of vectorisation and NLP representation of SEC filings and then compiling identified SEC filing cases and their intersection from a LexisNexis perspective. He will need to sort and transform SEC data, creating a function to convert the data into a format required by the HPCC Systems Word Vectors ML bundle. More information about this project is available in the associated JIRA issue.

Robert Kennedy, Research Assistant, Florida Atlantic University, will speak on Implementing a Multi-node, Multi-GPU Accelerated Deep Learning Algorithm using GNN.

During his 2020 internship, he aims to expand on our existing GNN bundle to improve our GPU accelerated neural network training. By the end of his internship, HPCC Systems will be able to train neural networks, at scale, across many GPUs, across many GPU enabled nodes using different parallelisation techniques that are suited to deep learning tasks. Robert's work will increase the robustness of the underlying GNN library by identifying areas for improvement while documenting best practices to be used when training neural networks on GPUs using the GNN bundle. More information about this project is available in the associated JIRA issue.

Vannel Zeufack, Masters in Computer Science, Kennesaw State University, will share his project on Implementing a Preprocessing Bundle for the HPCC Systems ML Library.

The purpose of his 2020 project is to make the data preprocessing phase of machine learning on HPCC Systems easier and faster. He also plans to produce a preprocessing bundle tutorial to demonstrate how the different modules in the preprocessing bundle could be used together to easily prepare data for a machine learning project. More information about this project is available in the associated JIRA issue.

Register for the webinar

Submit a talk for an upcoming episode!

  • Have a new success story to share?
  • Want to pitch a new use case?
  • Have a new HPCC Systems application you want to demo?
  • Want to share some helpful ECL tips and code sample?
  • Have a new suggestion for the roadmap?

It’s easy! All you need to do is submit a talk title and brief abstract to If chosen, you will be asked to present remotely for an upcoming 20-minute tech talk.