Skip to main content

End-to-end big data in a massively scalable supercomputing platform.

Open-source. Easy to use. Proven.

Try Now

Discover HPCC Systems

This truly open source solution allows you to quickly process, analyze, and understand large data sets, even data stored in massive, mixed schema data lakes.


Easier to update.

Easier to learn.

Easier to program.

Easier to integrate data.

Easier to manage clusters.

Join Us! October 15-16, 2019

Call for Poster Abstracts open until September 30

What's New?

New Release Available:
HPCC Systems 7.4.0 Gold

Download now and find out more about how to take advantage of new features and improvements in Java Embed, Spark, ECL Watch, ECL Language, our Standard Library, and more! View the release notes for the full list of changes. The supporting documentation is available on our website and we recommend that you browse the Red Book for additional information about specific items.

New Blog:
ECL Tips - A Tiny Trove of TABLE Tidbits

This ECL Tip spotlights the Enterprise Control Language (ECL) TABLE function. The TABLE function is a versatile tool for ETL (extract, transform, load) operations, and was one of the first ECL statements available, well before the family of TRANSFORM functions.

5 Questions with
Dr. Taghi M. Khoshgoftaar

Read and Listen as Flavio Villanustre and Dr Khoshgoftaar discuss big data, Khoshgoftaar’s interest in academics and education, and his time as a mentor. With more than 750 published journals and conference papers, Dr. Khoshgoftaar is recognized as one of the foremost experts in the field.

Summer of Learning- Free Training Offer

To help you get started with HPCC Systems, one of the easiest to use open source big data platforms, we're offering complimentary access to our extensive online training library over the summer. Offer expires Aug 31, 2019

Modern IDE support for data programming

Try playing with the code on our virtual playground and sample dataset.

Download the ECL IDE

Visit the ECL Playground

Get the VS Code ECL Extension

The HPCC Systems Platform

ETL Engine (Thor)

Extract, Transform, and Load your data using a powerful scripting language (ECL) specifically developed to work with data.

Query Engine (ROXIE)

An index based search engine to perform real-time queries. SOAP, XML, REST, and SQL are all supported interfaces.

Data Management Tools

Data Profiling, Data Cleansing, Snapshot Data Updates and consolidation, Job Scheduling and automation are some of the key features.

Machine Learning Tools

In place (supporting distributed linear algebra) predictive modeling functionality to perform Linear Regression, Logistic Regression, Decision Trees, and Random Forests.

Check out our blog

  • Jessica Lorti
    5 days 1 hour ago
    This month’s “5 Questions” interview series features one of our talented interns, Yash Jain. Pursuing his Bachelor of Engineering at the University of Mumbai, Yash’s internship project is entitled, “Cluster Deployment with Juju Charm.”
  • Flavio Villanustre
    1 week 4 days ago
    As is quite evident today, data can be used to improve just about everything. Our data-centered volunteer work has allowed us to help find missing people and provide education to children in need.
  • Cassandra Walker
    1 week 6 days ago
    Time series forecasting is an important statistical tool for predicting future events, needs, trends, etc., and can be applied to a variety of data sources. Jeremy Meier and David Noh, recent graduates of Clemson University’s Computer Science program, spoke at HPCC Systems Tech Talk episode 23 about the basic principles and components of time series forecasting using modern machine learning methods. This blog gives insight into their semester-long project, which focused on time series analysis and forecasting using financial datasets. 

Browse all blog entries.

Follow us on Twitter

What people are saying about HPCC Systems